Gene Saro_3508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3508 
Symbol 
ID5077657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp117083 
End bp119605 
Gene Length2523 bp 
Protein Length840 aa 
Translation table11 
GC content65% 
IMG OID640481232 
ProductTonB-dependent receptor 
Protein accessionYP_001165894 
Protein GI146275734 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCATGGGT GCCGCCGGCA CTCCGTCGCC GCATGGCGCG CCATCCCCAC CGGCCTCCGC 
GCGCGGGACA ACGGGAGGCA AGACATGATC AGCAATGCCA GGTTCACCGC TTTCACCTTC
GCCGCGCTTG CCGGAGCATC GCTGCTTCCG CTGGCATCGG CCCATGCGCA GGCGGCCCCG
CAGGACGTCG CCGCCGACAG CGAGCAGGCC AGCGACGGTT CCGGCATCGG CGACATCATC
GTCACCGCCC GCCGCCGCGA GGAACGCCTC CAGGACGTGC CGGTCTCGGT CACCGCGCTT
TCGGCCGAAC AGATCCGCAA GTATGACATG ACCAGCCTGG AGAAGATCTC CACGCAGACG
CCGCAGCTTA CCATCGGTCG CGCTTCGAAC GGTTCTGGCG CGCAGCTCAC CCTGCGCGGC
ATCGGTTCGT CGTCGACCTC GATCGGCATC GAGCAGTCGG TCGCGGTCAT TCTCGACGGC
GTCTACTACG GCCAGGGCCG CGTCATCAAC GAAGGCTTCC TCGACCTTGC CGGCGTGGAA
ATGCTGAAGG GCCCGCAGGC CCTGTTCTTC GGCAAGAACG CTACCGCCGG CGTGATCTCG
CTGCGTTCCG CCAATCCGTC GAACAGCCCG GAATTCATGG CCCGCGCGGG GTACGAGTTC
AAGGCGAAGA ACCTTGTCGG CGAAGTCATG GGCTCGGGCC CGCTGTCCGA TACGCTCGGC
ATCCGCATCG CGCTGCGTGG GTCGAACATG TTCGGCGGCT ATTTCAAGAA TGGCGGCATC
GACAAGACCT ACAACACCAC CAACATCAAC ACCGGGGTCG TGACCCCGCA CCTTGCCCCC
GCGCTCACCG GCGACAATCC CGGCGAACGC GAAGTGCTGG GCCGCGTGAC TTTGCAGTAC
AAGCCGACCG ATCGCCTCAC CGCGACGCTC AAGGCCAATG CCTCGTTCAA CGACAACGAC
AACAACAGCT GGAACTACGT TCCGGTCGCC TGCGCCAATG GCACTTATGC GCTGAACCCG
GCGATCAAGT GCGGCAAGCA GTTCACGATC TACCAGAACC GCTTTCCCGA GGATCTCGCC
GGCACCAATC CGTTCAGCCG CGCCGATGGG GGGCTCTACA ACCGCTATCG CAGCTGGGCC
GTCACCGGCA CGCTCGATTA TGCGCTCGAC GATCTGACGC TGACGTGGAT CAACAACTTC
AACCGCAACG TCAACCAGTG GGCCTGCGAC TGCACGTTCG TCTCGTCGGA CGCCGCCGCC
GCACCTTCGA CGGAGAAGTC GAAGTACCAC GCCTTCTCGT CGGAACTGCG GGCGCAGACC
TCCTATGACG GTCCGGTCAA CGTCCTTGCC GGCATCTATT ACCAGAAGAC CAGGCGCGAC
CACACGCAGA CCGGATCGTT CGGCAATGTC GAGGACGATA CCGCACCCGC CGCCTACCGC
TATCTCGGCT ACCTGAAGCG GTCGGAAACC GACGGCGAAA CGATTTCGGG CTATGGCCAG
GCCACGTGGA AGCTGGTCGA AGGGCTCGAG GCGACCGCGG GCGTACGCTA CACGCACGAG
ACGAAGGACA GCTATCTCGT CCAGCCCTAT GTCAACGCGG CGCTCCAGTT CCTCTTCCCG
CAGGACAAGG TGATCCGCGC CGGGCAGGCC TTCGACAACT GGTCGCCCGA AGCCACGCTC
ACCTGGAAGC CCACCAGCGA CATCACCGTC TATGGCGCCT ACAAGACCGC CTACAAGTCG
GGCGGCTTCT CGAACTCGGG CTTCGTCAGC ACCGGCACCG TGCCGAGCGA CGTCGCCTTC
AATCCCGAAA AGGCGCGCGG CTTCGAAGCG GGCATCAAGA CCACGCTGCT CGACCGCCAG
CTTCGCGTCA ATCTCGGCGT CTACACCTAC AAGTACGTCG ACCTGCAGGT GGACTTCTTC
AATTCCAACA CCTTCGCCTT CATCACCACC AATGCCGGCG GCGTGCGGAC GAGGGGCGTC
GAGCTGGAGT TCGAGTTCGC ACCGCGCGCC CTCGATGGCT TCAACCTGCA TGGCACGGTG
AACTATAACC GCGCCCGCTA CACCAACTAC ATTGCCCCCT GCTATGGCGG GCAGAGCGTG
GATGCGGGCT GCGACACCGT GTTTCAGGGC GCGGGCGGCC AGGACCTCAG CGGCAAGCCG
ACCGCCGTGG CGCCCGCATG GACCGGCTCG TTCGGCTTCA GCTACGAGAC GCCGGTTGCG
GACAACCTCA ACTTCGGCCT TTCCGCCGAC AGCCGCTACT CCGGCTCCTA CCTTGCATCC
AGCTTCGCCC ACCCGCTGTC GCGCCAGGAC GAATACCTTA CCATCGACGC CAGCGTCCGC
CTGCGCACCG CCGACGACAA GTACGAACTC GCGCTCATCG GCAAGAACCT GACCAACCGC
TTCATCGTGG GCGGCGTGGT CGATGCGCCC AACACGCCTG CCGTTGTCGG TCAGGTTGCT
GACCAGATGG GCTTCGTCTC GCTGCCGCGC ACCGTGCAGG TCCAGGCGAC CGTCCGCTTC
TGA
 
Protein sequence
MHGCRRHSVA AWRAIPTGLR ARDNGRQDMI SNARFTAFTF AALAGASLLP LASAHAQAAP 
QDVAADSEQA SDGSGIGDII VTARRREERL QDVPVSVTAL SAEQIRKYDM TSLEKISTQT
PQLTIGRASN GSGAQLTLRG IGSSSTSIGI EQSVAVILDG VYYGQGRVIN EGFLDLAGVE
MLKGPQALFF GKNATAGVIS LRSANPSNSP EFMARAGYEF KAKNLVGEVM GSGPLSDTLG
IRIALRGSNM FGGYFKNGGI DKTYNTTNIN TGVVTPHLAP ALTGDNPGER EVLGRVTLQY
KPTDRLTATL KANASFNDND NNSWNYVPVA CANGTYALNP AIKCGKQFTI YQNRFPEDLA
GTNPFSRADG GLYNRYRSWA VTGTLDYALD DLTLTWINNF NRNVNQWACD CTFVSSDAAA
APSTEKSKYH AFSSELRAQT SYDGPVNVLA GIYYQKTRRD HTQTGSFGNV EDDTAPAAYR
YLGYLKRSET DGETISGYGQ ATWKLVEGLE ATAGVRYTHE TKDSYLVQPY VNAALQFLFP
QDKVIRAGQA FDNWSPEATL TWKPTSDITV YGAYKTAYKS GGFSNSGFVS TGTVPSDVAF
NPEKARGFEA GIKTTLLDRQ LRVNLGVYTY KYVDLQVDFF NSNTFAFITT NAGGVRTRGV
ELEFEFAPRA LDGFNLHGTV NYNRARYTNY IAPCYGGQSV DAGCDTVFQG AGGQDLSGKP
TAVAPAWTGS FGFSYETPVA DNLNFGLSAD SRYSGSYLAS SFAHPLSRQD EYLTIDASVR
LRTADDKYEL ALIGKNLTNR FIVGGVVDAP NTPAVVGQVA DQMGFVSLPR TVQVQATVRF