Gene Saro_3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3042 
Symbol 
ID3916654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3255880 
End bp3258405 
Gene Length2526 bp 
Protein Length841 aa 
Translation table11 
GC content65% 
IMG OID640445822 
ProductTonB-dependent receptor 
Protein accessionYP_498311 
Protein GI87201054 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTCC TGGTCGCCGC GTCGACCATC GCGATCGGCA GCGTCGCGCT GGCCAGCGCC 
GCCCACGCCC AGTCGACCGG TTCGGTCGAC GTCGAAGAGG CGATCGTCGT GACCGGCACG
CGCGCGGATG CCGCCGTCAA CGGCTTCAAG GCCCCCGAAA CCCCCAAGGC CAAGGCCGTC
CTGACCCAGG AACTCGTCGC TCGGCAGAAC CCCGGCAAGG CGATCTTCGA CACGATCAAC
ATCGTGCCGG GCGTCAATTT CACCAGCACC GACCCCTATG GCGCCGCAGG CGGCAACTTG
CGCATTCGCG GCTTCGACGG CGCGCGCATC TCGGCCACGT TCGACGGCGT CCAGGTCAAC
GATTCGGGCA ACTATTCGCT CTACACCAAC CAGCAGCTCG ACTCCGAACT GATCGAGCAG
GTCAACATCA ACTTCGGCGC GACCGACGTC GACAGCCCGA CCGCGAGTGC TGCGGGCGGC
ACCGTCAACT ACCGCACCCG CCTGCCCAAG GAAGAGCTTG GCGCCGCGAT CAACTATTCG
CACGGCACCT TCAACTACAA CCGCGTGTTC GGCGTGATCG ACACCGGCGT GTTCACGCCC
TTCGGCACCC GCGCGTTCTT CTCGGCCAGC GACACCAAGT ACGACCAGTT CCGCGGCCCA
GGCGGCATCC ACAAGCAGCA GTACAACGTC CGCGTGTACC AGCCGATCGG CGAAAACGGT
GATTTCGTGA GCCTCGCAGG CCACTACAAC GAGAACCGCA ACAACTTCTA TCGCCGCGTC
GGCATCAACG ACATGCGCAC CCTGCTGGGC TCCGCCACGA TCCCGGCGTC GGCCAGCATC
ACTCCCGCTT CCCCGCTCGA TCTCGGCAAC CTGACCGACG CGCAGCAGGA GACCATCTTC
AACTTCAACA ACGATGCCAC CTGCACGCTG CCCAGCAGCA GCGGTGGTGC GGGCCAACAG
TCGGACGCGT CGTCCTGCGC CAACTACTAC AATACCTCGA TCAACCCTTC GAACACCGGC
AACATCCGCT TCAACTCGCG CTTCGCGCTG AGCGAGAAGC TGATCGCGAC GCTTGATGCC
AGCTATCAGT ATGTCCTCGC CAATGGCGGC GGCACCTCGG TCTTCGCGGA ATCCGACGCC
AACCGCAGCG TCTCGGGCGT CTACAGCCGC CAGGGAACGC GCATCGGCGG TGGCGTCGCG
AGCGGCGTCG ACATCAATGG CGATGGCGAT ACGGATGATC TCGTGCGTCT GCTGTTCCCG
TCGAACACCC GCACGCACCG TCTCGGCGCC ACGCTTTCGC TGCGCTACGA AGCGTCGCCG
GAAAATACCT TCCGCGTTGC CTATACCTGG GACCGCGCCA AGCACCGCCA GACCGGCGAA
GCGGGTCGGC TCGACCAGCT CGGCAACCCG CTGAACGTCT TCGGCGGCAT CGGCGACGAT
GATTCCGCGG TGAAGGATGC GGCCGGCAAC GTCCTCCAGA AGCGCGACCG CCTGTCCTAC
GCGATCCTGC ACCAGGTCGC GGGCGAATAC ATCGGCAAGT ACTTCGACAA TACGCTGACC
GTGCAGGCCG GCGTGCGCGC GCCGTTCTTC CGCCGCAACC TGACCAACAA CTGCTGGACT
ATCGCCGGTA GCTCGAACGA CGCCTACTGC ACCTCGGAAA GCGATGCCGT GGTCGAAGCC
AAGTACCCGG CCTATGCCGC GCCCTACGCC GCCCGCAAGG TGGCCTACAG CGCAGTCCTG
CCCAATGCCG GCTTCGTCTA CAAGATCACG CCGCAGGTCA ACGTGTTCGG CAACTTCAGC
CAGGGCTTCT CGGCCCCGCG TACCGACAAC CTCTACGGGT TCGATGGCGT GAAGATCCAG
CCGACCTCGC TGGTGAAGCC GGAACGCACC AACAGCTTCG ACCTCGGCGC GCGCTATACC
AGCCGCGTCG TCCAGGCCCA GGCCAGCGCA TGGTACATCG GGTACAAGAA CCGCATCATC
TCGTCGCAGG TGCTGCTCGA GGACGGCAGC TCGCTCAACC TCGACCGCAA CGTCGGTCGC
GTGCGCAGCT ACGGCTTCGA CGCCAGCGTC GCGGTGCGCC CGGTCGACAT GTTCTCGCTC
TACACCTTCG CGTCCTACAC CAACGCGAAG CTGAGGGACG ACGTGGTCTC GCCCGCCGGC
GCGATCCTCT CGCCGACCAA GGGCAAGTTC GTGGCCGAAA CGCCGAAGTG GCAGGTCGGC
GGCCGCGCCC AGTTCGACTA CGAGCCCGTC TCGATTGGCG CGCAGGTCAA GTACGTGGGC
GACCGCTTCC TGACCGACAT CAACGACGTG ATCGCACCGT CCTACACCAC GGTCGATCTC
GATGCGCGCG TCAACCTCGG CAAGGTCAAC GACAAGGGTT CGATCTACCT GCAGCTCAAC
GTGATCAACC TGTTCGACAA GTTCTACATC GGCAACCTGT CGACGCAGGC TGCCGCCTCG
AACAACCCGC AGGTCGAGTT CGGCTCGCCG CGCACTTTCG TCGGCTCGAT CCACTTCGAG
TTCTGA
 
Protein sequence
MKFLVAASTI AIGSVALASA AHAQSTGSVD VEEAIVVTGT RADAAVNGFK APETPKAKAV 
LTQELVARQN PGKAIFDTIN IVPGVNFTST DPYGAAGGNL RIRGFDGARI SATFDGVQVN
DSGNYSLYTN QQLDSELIEQ VNINFGATDV DSPTASAAGG TVNYRTRLPK EELGAAINYS
HGTFNYNRVF GVIDTGVFTP FGTRAFFSAS DTKYDQFRGP GGIHKQQYNV RVYQPIGENG
DFVSLAGHYN ENRNNFYRRV GINDMRTLLG SATIPASASI TPASPLDLGN LTDAQQETIF
NFNNDATCTL PSSSGGAGQQ SDASSCANYY NTSINPSNTG NIRFNSRFAL SEKLIATLDA
SYQYVLANGG GTSVFAESDA NRSVSGVYSR QGTRIGGGVA SGVDINGDGD TDDLVRLLFP
SNTRTHRLGA TLSLRYEASP ENTFRVAYTW DRAKHRQTGE AGRLDQLGNP LNVFGGIGDD
DSAVKDAAGN VLQKRDRLSY AILHQVAGEY IGKYFDNTLT VQAGVRAPFF RRNLTNNCWT
IAGSSNDAYC TSESDAVVEA KYPAYAAPYA ARKVAYSAVL PNAGFVYKIT PQVNVFGNFS
QGFSAPRTDN LYGFDGVKIQ PTSLVKPERT NSFDLGARYT SRVVQAQASA WYIGYKNRII
SSQVLLEDGS SLNLDRNVGR VRSYGFDASV AVRPVDMFSL YTFASYTNAK LRDDVVSPAG
AILSPTKGKF VAETPKWQVG GRAQFDYEPV SIGAQVKYVG DRFLTDINDV IAPSYTTVDL
DARVNLGKVN DKGSIYLQLN VINLFDKFYI GNLSTQAAAS NNPQVEFGSP RTFVGSIHFE
F