Gene Noc_1140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1140 
SymbolthrS 
ID3706904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1248285 
End bp1250219 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content51% 
IMG OID637737643 
Productthreonyl-tRNA synthetase 
Protein accessionYP_343174 
Protein GI77164649 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000099925 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGTTA TTACCCTTCC AGATGGTAGC CAACGCAGTT TTGACCATCC TGTTACCGTT 
TATGACGTAG CGGCCGATAT CGGCCCGGGT CTAGCGAAGG CAGCCCTTGG GGGCAAGATC
GAAGGCCGTT TGGTCGATAG TTCTTATCCC CTTGAGAAGG ATACGAAACT TACCATTATT
ACCGAACGGG ACATGGATGG CTTGGAAATT ATCCGCCATT CCTGCGCGCA TTTGCTGGCC
CAAGCGGTCA AGGCGCTTTA TCCGGAAGCC CAAGTGACTA TCGGACCGGT GATTGAGGAT
GGCTTCTATT ATGATTTTGC CTACCCTAAG GGGTTCACCC CAGAGGATCT TGAAGCCATT
GAAGCCAAAA TGCGAGAATT GGTGGAGCAA GATCTTTCGG TTCATCGAGA GTTGAAGTCC
CGCGAGGAAG CTGTCTCTTT ATTCCGCCGG ATGGGGGAAG AATATAAGGC TGAGATTATC
GCTTCTATCC CCTCGGAGGA GGAAATTTCT CTTTACCGAC AAGGGGATTT TGTGGATCTT
TGCCGCGGGC CCCATGTGCC TTCGACCGCT AGGCTCAAAG CCTTTAAGCT TACCAAGGTG
GCCGGCGCTT ATTGGCGGGG TGATGCCAAT AACGAGATGC TGCAGCGTAT TTATGGCACC
GCCTGGCCTG ATAAAAAAGC CCTTAAGGCT TATCTCCATC GTCTTGAAGA AGCTGAAAAA
CGGGATCACC GCCGGATTGG CGCTGATTTG GATCTGTTTT CCATTCAGGA AGAAGCGGGT
GGCGGCTTGG TATTCTGGCA TCCCATGGGG GCGCGTATCC GGCGGGTGAT AGAGGATTTT
TGGCAGGAGC GTCATACGGC GGCAGGCTAT GAAATGCTCT ATACGCCCCA TATTGCTCAC
GAGGAATTAT GGCAAACTTC CGGGCATACG GATTTTTACC GAGAGTCCAT GTACCAGCCC
ATGGAGGACG ACCACCAACT TTACCAGCTT AAGCCCATGA ATTGCCCTTT TCATGTGCTG
ATATATCAAG GTCGGCTGCG CTCCTATCGG GAATTGCCCA TCCGCTGGGC GGAACTGGGT
ACCGTTTACC GCCATGAAAT GTCTGGTGCT CTGCATGGGT TGATGCGGGT GCGGGGGTTT
ACTCAGGATG ACGCGCATAT TTTCTGTCGC GAAGAGCAGA TTGAGAATGA AATTCTGGGT
ATCCTTGATC TGACTCTAGA AATGCTAGCG GCGTTTGGTT TTGACCGTTA TGAAATTGAC
CTTTCTACGC GGCCGGAAAA ATCGGTGGGG CCGGAAGCAA TTTGGGAGCA GGCAACCCAA
GCGTTGCGTT CAGCATTGGA TAAGAAGGGC TTGGATTACG CTGTGGACGA AGGCGGTGGT
GCTTTCTACG GCCCCAAGAT CGATATTAAA ATCGAGGATG CCATTGGCCG TAAATGGCAG
TGCTCTACGG TCCAGCTAGA CTTTAATCTG CCGGAGCGTT TTGCGATGGA GTATGTGGCC
GAGGATGGCG CTCGCCATCG TCCTATTATG ATCCATCGAG CGGTGTTAGG TTCTTTAGAA
CGTTTTTTTG GTGTACTTAT TGAGCACTAC GAAGGTAAAT TTCCGCCTTG GCTCGCGCCT
GTACAGGTTG TCGTGATGAG CATCACTGAT CGGCAGGAGG GATATGCCCG CCAAGTGGAA
GAAGCGATGA GAAATAAAGG TTTTCGTTCT CTTTTGGACT TGAGAAATGA GAAAATCGGT
TTTAAAATCC GTGAGCACAT TTTGCGCCGG ATTCCTTATT TGTTAGTCAT TGGAGATCGG
GAGGTGGCAA ACCAGACCGT GGCCGTGCGT ACCCGATACA GTCAGGATCT GGGGGCGATG
AGTCTTGATG CCTTTATGGA GCATCTTAGC GTTGACGTTG CTCGTCTTGG TCATAACATT
TCTGAGGAGG ATTAG
 
Protein sequence
MPVITLPDGS QRSFDHPVTV YDVAADIGPG LAKAALGGKI EGRLVDSSYP LEKDTKLTII 
TERDMDGLEI IRHSCAHLLA QAVKALYPEA QVTIGPVIED GFYYDFAYPK GFTPEDLEAI
EAKMRELVEQ DLSVHRELKS REEAVSLFRR MGEEYKAEII ASIPSEEEIS LYRQGDFVDL
CRGPHVPSTA RLKAFKLTKV AGAYWRGDAN NEMLQRIYGT AWPDKKALKA YLHRLEEAEK
RDHRRIGADL DLFSIQEEAG GGLVFWHPMG ARIRRVIEDF WQERHTAAGY EMLYTPHIAH
EELWQTSGHT DFYRESMYQP MEDDHQLYQL KPMNCPFHVL IYQGRLRSYR ELPIRWAELG
TVYRHEMSGA LHGLMRVRGF TQDDAHIFCR EEQIENEILG ILDLTLEMLA AFGFDRYEID
LSTRPEKSVG PEAIWEQATQ ALRSALDKKG LDYAVDEGGG AFYGPKIDIK IEDAIGRKWQ
CSTVQLDFNL PERFAMEYVA EDGARHRPIM IHRAVLGSLE RFFGVLIEHY EGKFPPWLAP
VQVVVMSITD RQEGYARQVE EAMRNKGFRS LLDLRNEKIG FKIREHILRR IPYLLVIGDR
EVANQTVAVR TRYSQDLGAM SLDAFMEHLS VDVARLGHNI SEED