Gene Noc_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1038 
Symbol 
ID3707260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1147182 
End bp1148399 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content52% 
IMG OID637737543 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_343076 
Protein GI77164551 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.548207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATGA GTTCCCTACA ATCCCAGCGC GTGCTTTCCG GAATGCGTCC AACCGGGCAG 
CTGCATCTGG GCCACTATCA TGGCGTACTG AAAAACTGGG CCCGGTTGCA ACACGAATAC
AACTGTTTTT TCTTTGTGGC CGACTGGCAT GCTTTGACGA CGGAGTACGA AAATCCCCAG
GTAATTACCG ACAGCGTCTG GGATATGGTC ATCGACTGGC TGGCTGCGGG AATTGAGCCT
TCGGCAGCGA CTTTATTTAT TCAGTCGAAG GTGCCTGAGC ATGCGGAGTT GCATCTTCTA
TTATCCATGA TAACTCCCCT TGGTTGGTTG GAACGGGTGC CGACTTACAA GGACCAGCAG
GAAAAATTAA AAGAGAAGGA TCTGGCGACT TATGGTTTTT TGGGCTACCC GCTACTGCAA
AGCGCCGATA TTCTCGTTTA TAAAGCAACT CGGGTTCCAG TGGGAGAAGA TCAGGTTCCT
CACGTGGAGA TGAGCCGGGA AATTACCCGG CGTTTTAATC ATCTCTATGG CCGTGAACCA
GGCTTTGAAG AGCTGGTGGA AGCGGCTATA AAAAAAATGG GTAAGAAAAA TGCCCAGCTT
TACCGGGAAT TGCGCCGTCG TTTCCAGGAG CAGGGAGATG TGGAAGCCTT GGATAAAGCT
CGTGCTTTTC TAGAAACGCA GCAGAATCTT ACCCTTGGTG ATCGGGAACG TTTATTCGGC
CATCTGGAAG GGGAAGGTAA AGTCATTCTG CCGGAACCGC AGGCCTTGCT GACCCCAGCC
GCTCGTATGC CAGGGCTCGA TGGACAAAAA ATGTCTAAAT CCTACGGCAA TACGATTGCC
TTGCGTGAGC CACCTGAGCA AGTGGAACGG AAGCTCCGCA CCATGCCTAC GGATCCAGCC
CGCGTGCGGC GCACCGATCC CGGCGATCCC GAAAAATGTC CGGTCTGGCA ATTCCATAGG
GTTTACTCTG ACGATGAGGT GAAGGAGTGG GTTCAGAAAG GATGCAGAAC AGCAGGTATT
GGTTGCTTGG ACTGCAAGCA GCCAATTATT GATGCTATTC AGTCTGAACT AAAGCCTATT
CGAGAGCGGG CGCAAGAATA TGCTCACCAT CCCGAGGAGA TCCAACGGAT TATCAAAGAG
GGTAACGAGG CGGCCCGCGA AGTGGCCCGC GAGACGATGG CGGAGGTGCG CCAAGCAATG
GGATTGTCCT ATCGTTAA
 
Protein sequence
MAMSSLQSQR VLSGMRPTGQ LHLGHYHGVL KNWARLQHEY NCFFFVADWH ALTTEYENPQ 
VITDSVWDMV IDWLAAGIEP SAATLFIQSK VPEHAELHLL LSMITPLGWL ERVPTYKDQQ
EKLKEKDLAT YGFLGYPLLQ SADILVYKAT RVPVGEDQVP HVEMSREITR RFNHLYGREP
GFEELVEAAI KKMGKKNAQL YRELRRRFQE QGDVEALDKA RAFLETQQNL TLGDRERLFG
HLEGEGKVIL PEPQALLTPA ARMPGLDGQK MSKSYGNTIA LREPPEQVER KLRTMPTDPA
RVRRTDPGDP EKCPVWQFHR VYSDDEVKEW VQKGCRTAGI GCLDCKQPII DAIQSELKPI
RERAQEYAHH PEEIQRIIKE GNEAAREVAR ETMAEVRQAM GLSYR