Gene Saro_3668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3668 
Symbol 
ID5077816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp297891 
End bp300164 
Gene Length2274 bp 
Protein Length757 aa 
Translation table11 
GC content65% 
IMG OID640481391 
ProductTonB-dependent receptor 
Protein accessionYP_001166053 
Protein GI146275893 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.859497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGTGT CGAGGCTACT TTTGTCCGCA GCGGTCTGCG CGTTGGTTCC ATCTTTCGCA 
TTCGCGCAAA ATTCTGGTCA GGCCGATTCC GGCTTGGAAG AAATCATCGT CACCGCGCAA
AAGCGCGAGC AGAACATGCA GGACGTGCCG GTCGCGGTCA CCGCGCTGTC CGCCGAGACC
CTCACCAACC GCAACGTCGC CTCGGTTGCA GACCTGCCCC GCCTCGCTCC CAGCCTCACG
CTGACCCAGG GCAACGTGCC CACCAACAAC TCGCTCAACT TGCGCGGCAT CGGCACGATC
GCCTTCAGCA CCGCCATCGA ACCTTCGGTC GCGGTGGTCG TCGACGACGT TGCCTTGCTC
CAGCAGGCCC AGGCCTTCTC GGGCCTCAGC GACATCTCCC GCATCGAAGT GCTGCGCGGC
CCCCAGGGAA CGCTGTTCGG CAAGAACGCA TCGGCGGGCG CGGTCAACAT CGTCTCGCAG
GGCGCGTCCG ACGTGTTCAC CGGCGCGGTC ACCGGCACTG CCACCACCGA CGATGAATAT
CGCGTCGACG CGTCGCTGGC CGGCCCGCTC GGCGAAAACG CCGGTTTCCG TGTCAACGCC
TTCTACGGCG ACCGCAAGGG CTACATCCGC AATCTTGAGG ATGGCTCGCG CCTCAACAAC
GACAAGAGCT ACGGCTTCCG CGGCCGCCTC GAACTAAAGC CCACCGAAAC GATCAAGGTA
GACCTGATCG CCAGCCACTC GATCAGCGAA AGCGACGGCT TCGCCCGCAC CTTCCGGGCC
GCGCCGACCG GCGCCGCCGT GTTCGGCACC CCGCTGACCG ACAGCCTCGT CGGCATCACG
CCGGGAGAGG ACAACTACTC CGTCCGGCTC GACAAGCCGC TGTTCAACAA GAGCAAGCAG
ACCACCGTCT CGGGCCGCGC CACGCTCGAT CTCGGATTTG CCGACCTGAT TTCGGTCACC
AGCTACCAGG ACTGGCGCTT CCAGTTCGAG GAAGACTTCG ACTACACCGT GTCGGACGTG
CTCGGCATCC CCGGCGGAAT CGTGGCCGAC AGCACCTATC ACGCCACCCA GTTCGCGCAG
GAACTGCGCC TCGTCTCGCC CAGCAAGGGC CGCTTCAGCT ACGTGCTCGG CCTGTTCTAC
GCCGACGGCA AGACCGACCG CGAATTCGAA CGCGGCCCCT CGGGCCCGGT CGTCGCGAGC
TGGGCCTCGC AGAGCCGCAC CGAAAGCTAC GCCGCCTTCG GACAGGCCAC CTTCAACCTG
ACCGACACCA CGCACATCGA TGCCGGGGTG CGTTTCAACC ACGAAAAGGT CGGCGCCAGC
TTCCTCAATC GCGTGCCCAA CGCCTCGCCC CCGGCCGATA ACGCCACCTG CCTCACCACC
TGCGTCGGCA ATGCCAAGGA CAGTGTCGTG ACCTGGAAGA CCGCCCTGCG CCAGGATATT
GGCGATGCGG TCATGGTCTA TGCCTCGTTC GCGCGCGGAT ACAAGGGCCA GGGCTTCGAC
ATCAGCACCG GATTTAACCC GCGACGGGCA GCCTTCCCGG TGCGTCCGGA AACGTCCAAT
GCCTATGAAG TGGGCATCAA GTCACGCTTC CTCGACAACA AGGTCCAGCT CAACATCGCA
GGCTTCTGGA GCGATTTCCG CGACTTCCAG GCCCAGTCCG GCATTCTGCT GCCCGACAAC
ACGGTCCTGC TCACGCTGAA CAACGTCGGC AAGGTCCGCA CCCGCGGCAT CGAGGCAGAA
CTTACCGCCA AGCCCACGGC GGCCCTGACG CTCGACAGCG CGGTCAGCTT TACCGACACC
CGCATCATGG AATTCCCGGG CGCCCAGTGC TACACCGGCC AGACCACTGG CTGCGTCGAT
CTCGACGGCG ATGGCCCGGC GACCGTCAAG GGACAGGACC TTGCCGGAAA GCGCCTTCCC
AACGCGCCGC GCCTCAAGTT CAACGCGGGC TTCAACTACG ACGTGTTCCT GCCTTCGGCA
CCGTTCGATG CCTTTGTCCA GGCCGACGTT TCCTACCAGA GCAAGGTCAA CTTCGACCTC
CTCGGCAATC CGCTGACGGT CCAGGATGGC TATGCGGTGG TCAACGGCAG TATCGGCATC
GACCAGAACG AGCGCGGCGG AATGCGCGTG GCCCTGTTCG TCAACAACCT GTTCGACAAG
CACTACGCCT CGAACGTCAG CATCGCCTCG GGGGGCTCGG CCGGCCTGCT CAGCCAGGCT
CTCGACCGCA AGTCCCGCCG TTACTTCGGC ATCCGGGCCC GCTACCAGTT CTGA
 
Protein sequence
MRVSRLLLSA AVCALVPSFA FAQNSGQADS GLEEIIVTAQ KREQNMQDVP VAVTALSAET 
LTNRNVASVA DLPRLAPSLT LTQGNVPTNN SLNLRGIGTI AFSTAIEPSV AVVVDDVALL
QQAQAFSGLS DISRIEVLRG PQGTLFGKNA SAGAVNIVSQ GASDVFTGAV TGTATTDDEY
RVDASLAGPL GENAGFRVNA FYGDRKGYIR NLEDGSRLNN DKSYGFRGRL ELKPTETIKV
DLIASHSISE SDGFARTFRA APTGAAVFGT PLTDSLVGIT PGEDNYSVRL DKPLFNKSKQ
TTVSGRATLD LGFADLISVT SYQDWRFQFE EDFDYTVSDV LGIPGGIVAD STYHATQFAQ
ELRLVSPSKG RFSYVLGLFY ADGKTDREFE RGPSGPVVAS WASQSRTESY AAFGQATFNL
TDTTHIDAGV RFNHEKVGAS FLNRVPNASP PADNATCLTT CVGNAKDSVV TWKTALRQDI
GDAVMVYASF ARGYKGQGFD ISTGFNPRRA AFPVRPETSN AYEVGIKSRF LDNKVQLNIA
GFWSDFRDFQ AQSGILLPDN TVLLTLNNVG KVRTRGIEAE LTAKPTAALT LDSAVSFTDT
RIMEFPGAQC YTGQTTGCVD LDGDGPATVK GQDLAGKRLP NAPRLKFNAG FNYDVFLPSA
PFDAFVQADV SYQSKVNFDL LGNPLTVQDG YAVVNGSIGI DQNERGGMRV ALFVNNLFDK
HYASNVSIAS GGSAGLLSQA LDRKSRRYFG IRARYQF