Gene Saro_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1053 
Symbol 
ID3916348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1093126 
End bp1095501 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content63% 
IMG OID640443787 
ProductTonB-dependent receptor 
Protein accessionYP_496332 
Protein GI87199075 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGC AAAGCAATCT GCAGTCGGCC AATGCGCGCA AGATCAACTC GTACAAGGCG 
CTGCTTGCCG GAACGGCCGT TCTGGCCGTG GTGTCTCCTG CTGCCGCGCA GGAAGCGGCC
AGCGAGGATC CGGGCGAAAT CGTCGTGACC GCAACCCGGC AGTCCGAAAC GATTTCGAAG
GTGCCGCTCA GCATCGCCGC CTACAGCCAG GAAAAGCTCG ATCAGCAGGG CGTACGGCGG
GTTGACGACG TCGCTCGCCT GACTCCGGGC GTCACCTTCT CGCGAGGGGA CCAGCGCAAT
GCAGGCGCGG CGAACATTTC CATCCGCGGC ATTTCGTCGG CGGCAGGGTC TTCCACCACC
GGCATCTATA TCGACGATAC GCCCATCCAG ATCCGCAACG TCGGCTTCAG CGCCTTCACG
CCGTTTCCCG CAGTCTTTGA CCTGCAGCGG GTCGAGGTGC TGCGCGGACC GCAGGGCACG
CTGTTCGGCG CAGGGTCCGA AGGCGGCACC GTCCGTTTCA TTACGCCCAG CCCGGATTTC
GATGAAATGA AGTTCTATGC GCGAAGCGAA CTGGCCACGA CCAAGAGCGG CGAGGCAAGC
TTCGAGGCGG GCGCTGCGAT CAGCGTGCCG CTCGTCAAGG ACAAGCTCGC CGCGCGGGTC
AGCGGCTACT ACCGGCGCGA CGGTGGCTAT ATCGACCGCG TCGACTACAC CAGCGGCAAC
GTGCTGGACA AGAATTCGAA CTGGCAGGAC ACCAAGGTGG CCAGCGCATC GCTCGCATGG
AAAGCCACCG ACGCGGTCAC AGTTACCCCT TCGGTCTATT TCCAGGAAAC CTGGAACAAC
GACGCGGGCA CCTATTGGGA AGTGCTGTCC GACGCCGGGA AGGGCCAGTT CAACAACGGC
AATGCGGTTG CCAACTGGAA TCGCGACCGT TTCGTGCTGC CCGCGCTGAA GATCGAGGCG
GAGCTGGGCG ACGTGTCACT GATCTCCAAC ACGTCCTACT TCTACCGCGA CCAGAAGGCC
CAGAACGATT ACACCGTGTT CGAGGCCGCG CTCTGGACCG GCAACCCGTT CTATCCCGCA
GGAATGTATG CGCCGGCGTT CCAGTACAAC AGGCAATCCA ACTTCACCCA GGAACTTCGC
CTGCAGTCCA ACAATCCGGA CAGCCCCTTG CGCTGGGTGA TCGGCGGCTT CTATGCGCAC
AATCGCCAGA CTGCCCGCCA ATTCGTCCAG GACACATTCC TGCCGGACCT GTTTGAATCG
GTGACGGGCG TGCCGTTCGT GGCCGTGTTC GGCCAGGGGC TGGTGGACGA CAAGTACACG
TTCGTGCTCG ACAAGGCGGC ATCGACCGAC GAACAGATCG CCGGCTTCGG GCAGGTCGAC
TACAACCTGA CCGAACAGTT CAAGCTCACA GCCGGCCTGC GCGTGGCGCA TACCAAATTC
AGCACCAGCG CGCAGTTCGT CGGGCCGGTG GTCGGCCCGG ATGTCGATGA CACCGGCAAG
CAGAGCGAAA CCCCGGTGAC GCCGAAGTTC GGCTTGTCCT GGCAAGCCAA CGAGGACAAC
CTCGTCTACG CCACGGCATC GAAGGGCTTC CGCATCGGTG GCTACAATCC GGCCGTGGGC
CTGCCTTGCG GTGTCTCCAG CTCTCCGGTC GCCGGAACTG CCTTGGGCAA CCTCGGTCTT
TCCGATCGCC CGCAGCAGTT CGGATCGGAT TCGGTCTGGA GCTACGAGAT CGGATCGAAG
AACAAGCTGT TCGGCCGCGC GCTGACCCTG GAAAGCAGCG CGTTCCTGAT CGACTGGAGC
AATATCCAGC AGCAGGTCCA GCTCAACGCC TGCGGCTTCA ATTTCACCCA GAATCTCGGC
AAGGCGCGCA GCAAGGGCTT CGACGTGCAG TTCCAGCTCA AGGCAGCGGC CGGCCTGACC
CTTGGCGGGT CGATCGGCTA CACCAAGGCC GAGTTCACCC AGACGGTGAA GGGCGGTCCC
GCCGCCACGC TCAACCTTGT CACCAAGGGC GACGACATTC CGCTGAACCC CTGGCAGATC
GTGCTCAATG CCCAGTACGA TTTCGCCGTG GGTGGCAAGG ATGCCTATGT CCGGGCGGAC
TTCCAGCATC TGTCCCGCCA GAATGCGGAC ACGCCAGCCC GCAATCCCGC CAACGGCGTT
GCCGACCTGA CCATTCCGGG CGTGGCCGAG GTCAACAACC TCAATCTCCG GGCAGGCGTG
CGCTTCGATC TGGTGGAACT GGCGGTGTTC GCCAACAACG TCACCGACGC GACGCCGCTG
CTGCTGCGCC AGCATGATGT CGGCTTCTCG ACGCTTTACC GCAACGCCAC GTTGCGCCCG
CGCACCATCG GCCTGACCGC CACTGTACGT TACTGA
 
Protein sequence
MNKQSNLQSA NARKINSYKA LLAGTAVLAV VSPAAAQEAA SEDPGEIVVT ATRQSETISK 
VPLSIAAYSQ EKLDQQGVRR VDDVARLTPG VTFSRGDQRN AGAANISIRG ISSAAGSSTT
GIYIDDTPIQ IRNVGFSAFT PFPAVFDLQR VEVLRGPQGT LFGAGSEGGT VRFITPSPDF
DEMKFYARSE LATTKSGEAS FEAGAAISVP LVKDKLAARV SGYYRRDGGY IDRVDYTSGN
VLDKNSNWQD TKVASASLAW KATDAVTVTP SVYFQETWNN DAGTYWEVLS DAGKGQFNNG
NAVANWNRDR FVLPALKIEA ELGDVSLISN TSYFYRDQKA QNDYTVFEAA LWTGNPFYPA
GMYAPAFQYN RQSNFTQELR LQSNNPDSPL RWVIGGFYAH NRQTARQFVQ DTFLPDLFES
VTGVPFVAVF GQGLVDDKYT FVLDKAASTD EQIAGFGQVD YNLTEQFKLT AGLRVAHTKF
STSAQFVGPV VGPDVDDTGK QSETPVTPKF GLSWQANEDN LVYATASKGF RIGGYNPAVG
LPCGVSSSPV AGTALGNLGL SDRPQQFGSD SVWSYEIGSK NKLFGRALTL ESSAFLIDWS
NIQQQVQLNA CGFNFTQNLG KARSKGFDVQ FQLKAAAGLT LGGSIGYTKA EFTQTVKGGP
AATLNLVTKG DDIPLNPWQI VLNAQYDFAV GGKDAYVRAD FQHLSRQNAD TPARNPANGV
ADLTIPGVAE VNNLNLRAGV RFDLVELAVF ANNVTDATPL LLRQHDVGFS TLYRNATLRP
RTIGLTATVR Y