Gene Saro_3473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3473 
Symbol 
ID5077622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp73888 
End bp76248 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content62% 
IMG OID640481197 
ProductTonB-dependent receptor 
Protein accessionYP_001165859 
Protein GI146275699 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACCC AAGCCACGCG GCGGCTGATC GTCGTTGCCC TTACCACCAC TGCAATCGGT 
TTCTCGGTGC CGGCATTCGC CGCCGAAGCC GATGAGCAGC AGGCGGCGCA GACCGCCGAG
GCGACCGATT CAGGCGACAC CGGCGCGATC ATCGTCACTG CCCGTCGCCG TTCGGAAACC
CTGCAGTCGA CCCCGGTTGC CATTACCGCC GTCAACACCG CCATGCTTGA AAGCAAGGCC
GCAGTGAACA TCGGCGATCT TCAGGGCGCG GCTCCTGGCC TTCTCATCAC GCAGCAGAAT
TCGGGCGCGC AGGCGGCCAA CATCTCGATC CGCGGCCTGA CCTATGCGGA TATCGAGAAG
TCGCAGACGC CGACCGTCGG CGTCGTTGTC GATGGCGTGA CCATCGGCAC CAACACCGGC
CAGCTCCAGG ACGCCTTCGA TGTCGCCCAG ATCGAAGTCC TGCGCGGTCC GCAAGGCACG
CTGTTCGGCG CGAACACCAT CGGCGGCGTC ATCAACATCA CGCGTTCGAA GCCCACGATG
GAACCCGGCG CCAAGGCCGA GTTCTCCTAT GGCCGCTGGA ACACGATGTC GCTCAAGGCC
ATCGCCAACT ACGGCGACGG TGATACCTGG GGCGTCAAGG CGTTCTACTT CCACAACGAG
ACCGACGGTT TCTACCGCAA TGTCACGCGC AACACGAATG CGGGCTGGAG CGTTGGCAAC
ACCGTCGGCG GCAGCCTGCT GTTCAAGCCT GCGGGCTCGG GCTTCGACGC GCAGTTGACG
GTCGAGCACG TCAGCCAGAA GTTCGATCCG GTCGTCAGCA ACCTGACCAA CAGCACCGAG
GTGTTCTGCG GCTTCATTCC TGAGCGTGAG TGCAACCGCA ACAACACGAC CGATCTTTAC
ACCACCTTCG GTGACTATGC CGAGAGCACC TACAATGCTC CCGACGCAAC GCTGGAAATG
AACTACGATC TGGGTGGAGT GAAGCTGACC TCGATTACCG GTTGGCGCCA TTCCAAGGAG
GCGCAGACTC AGGACTTCGA CGGTTCATCG ACCGACCTGT ATTACGTCGA TCGTCGCCAG
CACTACACGC AGTGGAGCCA GGAGCTGCGC GCTGCAGGGA ATCTCTTCGA CGGCTTCGAC
TATGTCGTTG GCGGTTACTT CTTCAGCTCG AAGTATGACC TGACGCAGTG GAGCCGAGTA
TTCGGCTTCG ATTCTTCAAC CCCTCCGACC AAGTTCGACA CGGCCGCGCA GCACGTCGAA
GGGAAGACCA AGAGCTATGC GTTCTTCGGC GACTTCAACT GGGCTTTCGC GCCGGGCTTC
CGCCTCTCGT TCGGCGGCCG TTTCAGCCAC GACAACAAGA AGCTCAGCAA CGGCTTTGCC
GATGGCGTCC TGCTCGATCC CGACAACCTC GATCTCAGCA AGATCGCGCT GGTCGGCAAG
GGCGATGCCA GCTTCAACAA GTTCACTCCC AAAGTCGGCA TCGACTGGCG CCCGACGCCG
GACCTGATGG TCTATGCCTC GTGGTCGCGT GGCTATCGTT CGGGCGGTTT CAGCCCCCGC
GCCGCTACCG CTGCAACGGC CAGCACGCCG TTCCAGCCCG AAACGGTCGA CGCCTACGAA
GTCGGCGTGA AGCTGGCAGC TTTCGATCGC AAGCTTGAGC TGAACGTCGC CGGCTTCGTG
TCCGACTACA AGGACATGCA GCAGAACCTG ACCGTGCCTG GCGGCCCCAC CGGCAACCAG
ACGATCACCG GCAACGTTCC GGGTGGCGCG CTGATCAAGG GCATCGAAGT CGACGGCACT
GTCCGCGTGA CCGAAAACTT CAAGCTCACC GGCTCGATCG CGGTGATGGA CTCGCACTTC
CGCAACTTCG TCACCTGTGG CGCCTATGCC GGCGGTGCGG TGGCGACCAA CGATTGCGGC
ACCGGTCTCG TTCCATTCGA CTATTCGAAG AACCGTCTGA TCTACGCGCC TGATTTCACC
GCTTCGCTCA GCGCGGAATA CACCCTGCCG ACGAGCTTCG GCGACGTTTC GGCCAACGTC
GGCTGGCGCC ACATCTCGCC CTATGACGAA CAGCTCTCCG CTGCTTCGCT TACCCCTACG
CTCAACGGCG ATGGCGAAGC GACGCGGATC ACCGTCGAAG GCAACGATCC GCGCGTCCGC
ACCACCACGC AGGATCTGGT CGATGCGGCG CTGACCTTCA ACTTCGATTT CGACAATACC
AAGGCCTATG TCCGCGTCTT CGGCCGCAAC CTGCTGAACG AGAAGACCAC GACCCACGCA
TTTACCGTCG CGGGACTGTG GTCGTTCGGC ATGGCGCTCG AACCGCGCAC CTATGGCGCG
ACGCTGGGGG TCAAGTTCTG A
 
Protein sequence
MKTQATRRLI VVALTTTAIG FSVPAFAAEA DEQQAAQTAE ATDSGDTGAI IVTARRRSET 
LQSTPVAITA VNTAMLESKA AVNIGDLQGA APGLLITQQN SGAQAANISI RGLTYADIEK
SQTPTVGVVV DGVTIGTNTG QLQDAFDVAQ IEVLRGPQGT LFGANTIGGV INITRSKPTM
EPGAKAEFSY GRWNTMSLKA IANYGDGDTW GVKAFYFHNE TDGFYRNVTR NTNAGWSVGN
TVGGSLLFKP AGSGFDAQLT VEHVSQKFDP VVSNLTNSTE VFCGFIPERE CNRNNTTDLY
TTFGDYAEST YNAPDATLEM NYDLGGVKLT SITGWRHSKE AQTQDFDGSS TDLYYVDRRQ
HYTQWSQELR AAGNLFDGFD YVVGGYFFSS KYDLTQWSRV FGFDSSTPPT KFDTAAQHVE
GKTKSYAFFG DFNWAFAPGF RLSFGGRFSH DNKKLSNGFA DGVLLDPDNL DLSKIALVGK
GDASFNKFTP KVGIDWRPTP DLMVYASWSR GYRSGGFSPR AATAATASTP FQPETVDAYE
VGVKLAAFDR KLELNVAGFV SDYKDMQQNL TVPGGPTGNQ TITGNVPGGA LIKGIEVDGT
VRVTENFKLT GSIAVMDSHF RNFVTCGAYA GGAVATNDCG TGLVPFDYSK NRLIYAPDFT
ASLSAEYTLP TSFGDVSANV GWRHISPYDE QLSAASLTPT LNGDGEATRI TVEGNDPRVR
TTTQDLVDAA LTFNFDFDNT KAYVRVFGRN LLNEKTTTHA FTVAGLWSFG MALEPRTYGA
TLGVKF