Gene Saro_1888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1888 
Symbol 
ID3917109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1995165 
End bp1997885 
Gene Length2721 bp 
Protein Length906 aa 
Translation table11 
GC content64% 
IMG OID640444632 
ProductTonB-dependent receptor 
Protein accessionYP_497162 
Protein GI87199905 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor
[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0505133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATTC GCACACTATC CACCGTTTCG GGCGCTTCGC TCGCAGCAAT GACCGTCGCG 
CTCGCGGCGA GCCCGGTCAT GGCGCAGGAA GCCGCCGGTG ACGCCCAGGC GCAAGCGACA
GAGGCCGCGC CGACCGAAGG CCAGGAGCAG GCCATCGTCG TCACGGGCTT CCGCGCCGCG
TTGGCCACTG CGGTCAACGT GAAGAAGACC GCACCTGTGA TCGTGGAGTC GGTTTCGGCG
GAAGACATCG GCCGCCTGCC CGATGCCTCG ATCGGTGAAT CGATCGCCCG CCTGCCGGGT
CTGACGACGC AGCGCCTGTT CGGCCGCGCC AACTCCATCG CGATCCGCGG TTCGAGCGCG
GACCTTTCCT CGACCACGCT TAACGGCCGT CCGCAGACAT CGACCGGCGA GCAGCGCAAT
GTCGAGTTCG ACCAGTTCCC GTCGGAAATC GTCAGCCGCG TCGACGTCTA CAAGGCGCCC
CAGGCCAATC TCGTCCATCA GGGCCTGGTG GGCACGGTCG ACATCAAGAC GATCCGTCCG
CTGGAAATGG GCAAGAGCCT GCTGTCGGTC GGCGCGCGCG GGACTTATGC CGATCTCGGC
AAGGTCAATG CCGACAGCCA TGACAAGGGC TATCGCCTGA CCGGTACCTA TGTCGGCCAG
TTCATGGAGG ACCGCCTCGG CGTCGCGCTG TCGGCTGCCT ATACCGACGA GCCGTACCAG
GCACAGGAAT TCGAGGCCTG GGGCTATGCC GACGGTCCCG ACAGCACCAA GGTCATCGGC
GGAATGAAGC CGTTCGGCGT TTCCACCCAG CTAAAGCGGC TTGGCATCCA GGGCACCGTC
CAGTTCAAGC CGGTCGACGA ACTGACGCTG ACGGTCGATG CCTTCTACGG CAACTTCAAG
GACAAGCAGA TCAAGCGCGG CGTGGAGTTC CCGCTGGCGT GGAGCGGTGC GCAGCTTTCC
CCGACGGGCA TAGAGACCAC CGGCAACCTG ATCACCGGGG GCACGTTCAC CGGCGTCGAG
GCGGTGGTCA ACAACCACGG CTACGAGCGC AATTCAGACA TCTTCTCGGG CGGCTTCAAC
GCGGCCTGGC AAGGTGACGA TGGCTGGTCG GCCTCGTTCG ACTTCGGCTA CTCCAAGACC
GACCGCAACG AACTGACGCT CGAAACCAAT GCCGGCACTG GCCCGGGCGG CGGCGTGGGC
GCAACCGACA CCCTGACTTT CGTGAGCGAC GGCAAGGGCA CGCACTTCAC CGACCACACG
CTCGATTACG GCGATTTCAA CTCGATTGTG CTGACCGACC CGCTGGGCTG GGGCGGCGGC
GCACCGGCCG GCCACCAGGA AGGCTACTAC AACAACCGCA TCATCGACGA CGAGATCAAG
AGCTTCCAGG TCGAGGTCGG CAAGGAGCTT GAGGACAGCT TCCTGTCGAA GCTCTCGGTG
GGCACGGCCT ACGTCGACCG CACCAAGGCC AAGACCCCGG AAGAATACTT CCTCAATCTC
GCCGATGGTG CCCGCTCGCT CGTCGTGCCG GATCAGTACC GCACCGGGAC CACCGACCTG
TCGTTCATCG GCGCCGGCCC GATCGTCAGC TACGATCCGT TCAAGATGCT GGACGACGGC
GTCTACGTGA AGACGCTGAA CCCAAGCAAG GACGTGCCGG CCAAGGCCTA CGCGGTTACC
GAGCGGGTCA TGTCGATCTA CCTCAAGGGC GACCTCAAGG CGGCGTTCGG CGATATCGAG
ATGGACGGCA ACATCGGCGT TCTCGCGCAG AATACCGAGC AGAAGTCGCG CGGGTACGTG
AACCTTGCCG CAGCCAGCCT GGTGCCCGTG ACGCGCGGTG CGCGCTACTG GGATGTCCTG
CCGAGCCTGA ACCTGAACTT CCGCATCCCC GGCGACTGGG TGGTGCGCGT GGCTGCGGCG
CGCGAGATCC AGCGTCCGCG CTTCGAGGAC ATGAAGGTCA GCCTGGACTA CAGCTACAAC
ACCGCCAGCG GCATCATTTC GGGCAACGGC GGCAATCCGG AGTTGCGCCC GTACCGCGCA
TGGGCCGCCG ACCTGAACAT CGAGAAGTAC TTCGGCCGCA AGGGCTACAT CGGCGTCCAG
ATGTTCTACA AGAAGCTTGA CAACTACATC TACACCGATG TCGTGCCGTA CGACTACTCG
GGCCTGCCGG TGACCGCACC GGTGCCGATC ACTAACTACG TCGGCACGCT CAAGACCGCG
GTCAACGGTT CGGGCGGGAA GCTCTATGGC ATCGAACTTG CGGGCACGCT GCCGTTTGAG
GTCATCACCC CGGCACTGGA AGGCTTCGGC TTCACCGGCG GCGTTGGCTA CACCAAGACG
TCGATCAAGC CGGGCGTGGA CGCGAAGGCG CAGGACCTTC CGGACTATTC GCGCTGGGTG
GCCTCGGGTA CGCTGTTCTT CGAGAAGGCC GGCTTCAATG CGCGCGTCTC GGCCCGTCAT
CGTTCCTCGT TCCAGGGCAT CTTCGTGGGC TTCGGTGGCG AGCGTGAACT GCGACGCGCG
CTGAAGGAGA CGATCGTCGA CGCCCAGATC GGCTATGACT TCCAGGAAAG CAGCAAGCTT
CACGGCCTGT CGCTGTTCCT GCAGGGCCAG AACCTGACGG ACGAGCCGTT CGTCTCGGTC
GATACGGGCA CGACGCTGCA AATCCGCAAC TACCAGACCT ATGGCCGCCG CTTCATGGCG
GGCTTCAATT ACCGCTTCTG A
 
Protein sequence
MAIRTLSTVS GASLAAMTVA LAASPVMAQE AAGDAQAQAT EAAPTEGQEQ AIVVTGFRAA 
LATAVNVKKT APVIVESVSA EDIGRLPDAS IGESIARLPG LTTQRLFGRA NSIAIRGSSA
DLSSTTLNGR PQTSTGEQRN VEFDQFPSEI VSRVDVYKAP QANLVHQGLV GTVDIKTIRP
LEMGKSLLSV GARGTYADLG KVNADSHDKG YRLTGTYVGQ FMEDRLGVAL SAAYTDEPYQ
AQEFEAWGYA DGPDSTKVIG GMKPFGVSTQ LKRLGIQGTV QFKPVDELTL TVDAFYGNFK
DKQIKRGVEF PLAWSGAQLS PTGIETTGNL ITGGTFTGVE AVVNNHGYER NSDIFSGGFN
AAWQGDDGWS ASFDFGYSKT DRNELTLETN AGTGPGGGVG ATDTLTFVSD GKGTHFTDHT
LDYGDFNSIV LTDPLGWGGG APAGHQEGYY NNRIIDDEIK SFQVEVGKEL EDSFLSKLSV
GTAYVDRTKA KTPEEYFLNL ADGARSLVVP DQYRTGTTDL SFIGAGPIVS YDPFKMLDDG
VYVKTLNPSK DVPAKAYAVT ERVMSIYLKG DLKAAFGDIE MDGNIGVLAQ NTEQKSRGYV
NLAAASLVPV TRGARYWDVL PSLNLNFRIP GDWVVRVAAA REIQRPRFED MKVSLDYSYN
TASGIISGNG GNPELRPYRA WAADLNIEKY FGRKGYIGVQ MFYKKLDNYI YTDVVPYDYS
GLPVTAPVPI TNYVGTLKTA VNGSGGKLYG IELAGTLPFE VITPALEGFG FTGGVGYTKT
SIKPGVDAKA QDLPDYSRWV ASGTLFFEKA GFNARVSARH RSSFQGIFVG FGGERELRRA
LKETIVDAQI GYDFQESSKL HGLSLFLQGQ NLTDEPFVSV DTGTTLQIRN YQTYGRRFMA
GFNYRF