Gene Saro_3147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3147 
Symbol 
ID3918189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3356399 
End bp3358726 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content66% 
IMG OID640445931 
ProductTonB-dependent receptor 
Protein accessionYP_498416 
Protein GI87201159 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor
[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.774521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGTA TGTCCACGCG CGCCGTTCTG GCCGCATCTG CCGCGCTGTT CGCGCTGCCT 
GCCGTCCCCG CCATTGCGCA GGAAGCACCG GCTGACGAAA CCACCGCCTC GAACGAGATC
ACCGTCATCG CCCGCCGCCG CGAGGAGCGC CTGCTCGACG TGCCGATCGC GATCTCCGCG
CTCAGCACCG AAGCGCTCGA CAAGGCCGGC GCCAAGGATC TCTCCGGCGT CCAGGGCGCG
ATCCCCAACG TCAACATCGT GCAGGGCCGC GGCTCGGCCA GCAGCGCCAA CTTCTACATC
CGCGGCATCG GCCAGCCGGA CGCGCTGCAG ACCTTCGACC CGGCTGTCGG CGTCTATGTC
GACGGCGTTT ACCTCAGCCG CATCCAGGGT GCGCTGCTCA ACCTGTTTGA CGTCCAGCGC
GTCGAAGTCC TGCGCGGGCC GCAGGGCACG CTCTATGGCA AGAACACCAT CGGCGGCGCG
GTCAACGTCG TCTCGAAGAA GCCCGACCTC AACGACTTGC GCGGCGAAGC GTCGATCACC
TACGGCCGCT TCGACGAAGT GACCGCCAAG GGCTACGTCT CGGCCCCGCT TGTCGCGGAC
AAGCTCGCCC TCTCGGTCGC GGGGGTCTAC GACGACCGCG ACGGCATCGT CACCGATCCC
GCCACCGGCC GGAAGTACAA CGACCGCAAC AACCTCTCGG GCCGCGCAAT CCTTCGCGCC
CAGCCGACCG ACACCGTCGA GGTCCTGATC TCGGGCGATT ACACCCGCCA GCGCAACTCG
CTGACCATGG GCCAGGCGAC CGCCCCGCTG ATCGGGTTCG ACTACAACGC CGACTTCAGC
GCCGTCACGC CCTTCGTGAT CGCGCCCGCC GCCACCGGCG AATGGGACTA CAAGGCCTCC
AGCAGCTTTG CCGGCGACAA GGGCCAGAAG CTCGACCACT GGGGCGTTTC GGGCACGATC
AACGTCGACC TGTCCGATAC CCTGCAACTC GTCTCGATCA GCGCCTACCG CAAGCTCAAG
ACCGACTTCT TCGTCGACAT CGACGCAACC ACCGCCGAAG TGGGCGACGT CTTCGTCGGC
ACCCGCCAGC ACCAGTTCAG CCAGGAACTC CAGCTCAAGC TCGATGCCGA CAAGCTCAAG
GGCGTGCTCG GAGTCTACTA CCTGAACGAG CACGTGACCT CGCACCAGGA AGCCTATGCC
GACAGCTACC TGCGCTATGT CGGCACGCCG CTCAACTTCC TGCGCACCAT CGATGACGAG
CAGGACACCA AGTCCTACGC CGCCTTCGGC CAGCTTACCT ACGACTTCAC CGATGCGGTC
TCGCTGACCG GCGGCCTGCG CTACACGCGC GAGACGAAGG AATACTTCCG CACCACCACG
GCCACCACGT CGAGCCCGAT CTTCCCGGCC CTGGTCATCA AGGGCACCTT CACCTTCCCG
ACCAACCTGC CCGCCCCCTA CAACACGCTC GACAGCGTGA CCTACGAGGC GTGGACCCCC
TCGGCGACGC TCAGCTACAA GCCCTCGCGC AACACGATGC TCTACGGCTC GGTCAGCCGC
GGCTTCAAGT CGGGCGGCTT CAACGGGCGC GTCAACGGGC TCGGCGACGT CACCCAGGTG
GTCGACGGCA CGACCGTCGT CGTACCGACC TTCAAGCCTG AAACCGTGTG GACCTACGAA
GTCGGCGCCA AGGGCTCGTT CCTCGACGGG CGCGTGAACA TCTCGGGCGC GGCGTTCTAT
TCCGACTACG CGAACTTCCA GGCGCGCGTC GGCGGCGGCA ACACCGGCAT CAACGGCGGC
AGCTTCCCCG TGCTCAACGC CGGCAAGCTG CGCATCCAGG GCTTCGAGTT CGACGTCAAC
GTGCGGCCCG CCGATCCGGT CACGCTGTTC GCCTCGGTCG GCTATCTCGA TGCCGACTAC
AAGGAGTTCA ACGACGGCCG CCGGGCGCCC GCGTTCTCGT GCAACCCGAC TGGCGCGAAG
GTGACCTGCA AGCCCGCCTT CGCCCCGCCG CTTACCCTTC GCGCGGGCGG CGAATACCGC
GTGCCGCTGG GCGATGCGAC GCTGAGCCTG GGCGGCGACG TCCGCTTCGT CGACAAGCAT
TACCTGTCGG TCGACAACCG CCCCGGCCTC ACCGAAGACG GCTACCTGAT CGGCAACCTC
TATGCCCAGG TGGACTTCGA CAAGTTCTAC CTGCGCGGCG CGGTCCGGAA CGTAGGCAAC
ACGCTCTACA AGACCGACGG GCAGGAATTC AGCTCGGTCG GCAACATCCA GACCGTCTAC
TATGGCGACC CGCGCACGTG GAACGTCACG CTCGGCGTCC GCTTCTGA
 
Protein sequence
MTRMSTRAVL AASAALFALP AVPAIAQEAP ADETTASNEI TVIARRREER LLDVPIAISA 
LSTEALDKAG AKDLSGVQGA IPNVNIVQGR GSASSANFYI RGIGQPDALQ TFDPAVGVYV
DGVYLSRIQG ALLNLFDVQR VEVLRGPQGT LYGKNTIGGA VNVVSKKPDL NDLRGEASIT
YGRFDEVTAK GYVSAPLVAD KLALSVAGVY DDRDGIVTDP ATGRKYNDRN NLSGRAILRA
QPTDTVEVLI SGDYTRQRNS LTMGQATAPL IGFDYNADFS AVTPFVIAPA ATGEWDYKAS
SSFAGDKGQK LDHWGVSGTI NVDLSDTLQL VSISAYRKLK TDFFVDIDAT TAEVGDVFVG
TRQHQFSQEL QLKLDADKLK GVLGVYYLNE HVTSHQEAYA DSYLRYVGTP LNFLRTIDDE
QDTKSYAAFG QLTYDFTDAV SLTGGLRYTR ETKEYFRTTT ATTSSPIFPA LVIKGTFTFP
TNLPAPYNTL DSVTYEAWTP SATLSYKPSR NTMLYGSVSR GFKSGGFNGR VNGLGDVTQV
VDGTTVVVPT FKPETVWTYE VGAKGSFLDG RVNISGAAFY SDYANFQARV GGGNTGINGG
SFPVLNAGKL RIQGFEFDVN VRPADPVTLF ASVGYLDADY KEFNDGRRAP AFSCNPTGAK
VTCKPAFAPP LTLRAGGEYR VPLGDATLSL GGDVRFVDKH YLSVDNRPGL TEDGYLIGNL
YAQVDFDKFY LRGAVRNVGN TLYKTDGQEF SSVGNIQTVY YGDPRTWNVT LGVRF