Gene Saro_2215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2215 
Symbol 
ID3916531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2357459 
End bp2360374 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content62% 
IMG OID640444970 
ProductTonB-dependent receptor 
Protein accessionYP_497487 
Protein GI87200230 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.169336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGGCA AGCTTTCCCT GCTGGCCGGC GTGTGCGTAG CGGCCATTTC CACCCCCGTA 
TTCGCCCAGT CGGCGAACGA GAATGTGGGT CTTGAAGAAA TCATCGTTAC TGCGCAAAGG
CAGGCGCAAT CGCTTCAGGC CGTGCCTATC GCGGTGTCCG CATTCTCGGC GGAGAACCTC
GAGAAGCAGC AGATCCAGAA CTCTTCCGAT CTTCAGCTCT CGCTGCCGAA CGTAACCTTC
ACCAAGACCA ACTTCACCTC CTCCAGCTTC ACCATCCGCG GCATCGGCGA CCTTTGCGTC
GGCGTCTCGT GCGATGCCGC CACGGGCATC CACCTGAACG ACATGCCGAT GGTATCGAGC
CGCCTGTTCG AAACCGAATT CTTCGATCTC GAACGCATCG AAGTTCTGCG CGGTCCGCAG
GGCACCCTGT TCGGGCGCAA CGCCACTTCC GGCGTGGTCA ACATCATCAC CGCCAAGCCT
GATCTTTCCG GCTTCGGCGC GTCGGGCGAG GCCGACTACG GCAAGTACAA TTCGGTCCGC
GTCAAGGGCA TGATCAACGT TCCGCTGGGC GAAACGCTCG GCGTGCGCGT TGCGGGCATG
TACACAAAGC GCGATGGTTA CACCTACAAC ATCGGCTCCA ACAGCCGCAT CGACGGTCGC
GACATGTACG CCCTGCGCGG CTCGCTGCGC TGGGAGCCGA GCCCGGATAC CACCATCGAC
CTTTCGGCTT ACTACTTCCG CGAGAAGGAC AGCCGCAGCC GCGTCCAGAA GCAGCTCTGC
CATCGCGACC CGACCGGCAT ACTTGGTTGC TTGCCCGACA AGCTGGCCAA CGAGACAACC
AACGCGGACT CGACCCTCGC GGCACTGCTG ACGTCGAAGG AATTTTTCGG CATTGCCGTT
GCACCGAGCT TTGCTGCGCT CGGCCTCGGC AGCATCTATG GCGACGACGG CGACAATTTC
TCGGGCGCGG TCAATCCGTC CGACGTTCGT ACCGTCAACA TCGACTACCT GCCGACCTAC
TTCGCCGAGG AAGAAGTCTA CCAGGGCAAG CTCCAGCAGG CTCTCGGCAA CAACCTGACG
TTGAATGTCA CCGCTGGCTA TTCGCGCAAC GAGGTTCGCA GCCGTACCGA CTACAACCTC
GCGACCGAGC GCAGTCTCGT CGGCAACGCG GGCCTCTATG CCCTCGCGGC CTTCGCGCAG
AGCCCGACCC TGGGCTTTGC CTTCGCGCCT ATCGCGGCTC GCCTCATCCC TAACGGCCCG
ACCGGCCAGA TCTGCCAGTC CGACATCGAC CCCAACAACG TTGGTGTCTT CGGCGCGGGC
AACAAGGCGA TCTGCGCGAA CACCTCGCTC GACTTCGATG AATCGAGCCA GACCTACCAG
CAGTACGCTG CCGAAGCGCA CATCGACTCC GACTTCGACG GCATGTTCAA CTTCCTGATC
GGCGCGAACT ACCTGCGCGG CGTGACGACG AACAACAGCT ACTACGTCAA CTCGTTCGGT
CTGGACTACG CATCGGGCCT GCTCGGTGCT GCGGCGGCGA CGACCGGCGC CTTCGGTGCC
AATGCGGTGT TCCGTCCTTC GCCGTTCTTC CGCAGCAACA CCGACAAGGG CACGCTCGAC
AGCTACGGCA TCTTTGGTGA GACCTACTTC AAGTTCAACG ACAAGATTAA GCTCACCCTC
GGCCTGCGCT ACAACCACGA CAAGAAGTTC ACCCGAGCCC GCACCACCTT GTTGAGCGAC
GGCCTGTCGA GCGAATTCGC AGGTGGCAAC GCAATCTATG CGCTGGTCGG GGCGGCCTCG
CTCGAGGATG CTCTGAACTG GGCCACTGCC GACTTCGACA AGGGCACGGA TGACGTCCAG
GCCTTCCAGG AACGCTCGGT CAGCTTCGGT CGCATGACCG GCCGCGCGGT GCTGGATGTC
CAGCTCACGC CCGACAACCT GCTCTACCTG TCCTACTCGC GTGGGTACAA GTCGGGCGGC
ATCAATCCGC CGCTCTCGGT CGGGTCGGTC AACGACTCGT TCAAGCCGGA ATCGGTCGAT
GCCTTCGAAA TCGGTTCCAA GAACCGCTTC GGCGCGCTGC AGCTCAACCT CTCGGCGTTC
TACTACCGCT ACAAGGATCT TCAGCTCAGC CGCATCACCG CGCGCACTTC GGTCAACGAC
AACATCGATG CCAACATCTA CGGCGTCGAG GCCGAAGCGA TCATGGCGCC CACGCGCAAC
CTGCTGGTCA ATTTTTCGGC CAGCTACCTC AAGACCAAGG TCGTCGGGGA CCAGTTCTTC
GTCGACACCC GCGACGTTTC GGCGGGCCGT TCGGACACGG TGATCATCAA GGACATCACC
CTCGGCTCGA ACTGCGCGGT TACCGGGGCA AGCGCGGCGG CGGCCAATGC CTTCGTCAAC
ACCATCAATA GCGGGGTCGG GCTGCGCGGC ACGACGCCGA TCCCCGGCAC CAACACCACC
GGTGCCTTCT CGATCTGCGG CGTTCTGGCA ACCCAGGCCG CGGCGGTCGG CAGTGCATTC
GGCGGCATCA CCGTCAGCTC GGGCATCGAA AAGAACGTGC GCGGCAACCA GCTCCCGCAG
GCCCCTGAAT TCAAGTGGTC GGCGGGCATC CAGTACACGC TCGAACTGGG CGGGATGACC
CTCGTTCCGC GCTTCGACAT CAACTACACG GGTGAAAGCT ACGGCACGAT CTTCAACGGC
AACATCAACC GGATCAAGGG CTACGAGGTG ATGAACGCCC AGATCCAGCT GAATGGCCGT
GACGACCGCT GGTTCCTGCG CGGCTACATC CAGAACATCG GCAACAACAA CGCCACGACC
GGCCTCTACG TGACCGACCA GTCCTCGGGC CTGTTCACCA ACATCTTCAC GCTGGAACCG
CGCCGCTACG GCGTCGCGGC CGGGTTCAAG TTCTGA
 
Protein sequence
MRGKLSLLAG VCVAAISTPV FAQSANENVG LEEIIVTAQR QAQSLQAVPI AVSAFSAENL 
EKQQIQNSSD LQLSLPNVTF TKTNFTSSSF TIRGIGDLCV GVSCDAATGI HLNDMPMVSS
RLFETEFFDL ERIEVLRGPQ GTLFGRNATS GVVNIITAKP DLSGFGASGE ADYGKYNSVR
VKGMINVPLG ETLGVRVAGM YTKRDGYTYN IGSNSRIDGR DMYALRGSLR WEPSPDTTID
LSAYYFREKD SRSRVQKQLC HRDPTGILGC LPDKLANETT NADSTLAALL TSKEFFGIAV
APSFAALGLG SIYGDDGDNF SGAVNPSDVR TVNIDYLPTY FAEEEVYQGK LQQALGNNLT
LNVTAGYSRN EVRSRTDYNL ATERSLVGNA GLYALAAFAQ SPTLGFAFAP IAARLIPNGP
TGQICQSDID PNNVGVFGAG NKAICANTSL DFDESSQTYQ QYAAEAHIDS DFDGMFNFLI
GANYLRGVTT NNSYYVNSFG LDYASGLLGA AAATTGAFGA NAVFRPSPFF RSNTDKGTLD
SYGIFGETYF KFNDKIKLTL GLRYNHDKKF TRARTTLLSD GLSSEFAGGN AIYALVGAAS
LEDALNWATA DFDKGTDDVQ AFQERSVSFG RMTGRAVLDV QLTPDNLLYL SYSRGYKSGG
INPPLSVGSV NDSFKPESVD AFEIGSKNRF GALQLNLSAF YYRYKDLQLS RITARTSVND
NIDANIYGVE AEAIMAPTRN LLVNFSASYL KTKVVGDQFF VDTRDVSAGR SDTVIIKDIT
LGSNCAVTGA SAAAANAFVN TINSGVGLRG TTPIPGTNTT GAFSICGVLA TQAAAVGSAF
GGITVSSGIE KNVRGNQLPQ APEFKWSAGI QYTLELGGMT LVPRFDINYT GESYGTIFNG
NINRIKGYEV MNAQIQLNGR DDRWFLRGYI QNIGNNNATT GLYVTDQSSG LFTNIFTLEP
RRYGVAAGFK F