Gene Saro_3778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3778 
Symbol 
ID5077926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp417242 
End bp419710 
Gene Length2469 bp 
Protein Length822 aa 
Translation table11 
GC content63% 
IMG OID640481501 
ProductTonB-dependent receptor 
Protein accessionYP_001166163 
Protein GI146276003 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGGAT CGGACATGAA GGGACTTCGC ACGGCATTCC TGATCGGCAC GGCGCTATGT 
GCGGTGGCCG CTCGACCGGC GCTGGCTGAC GAGGCGGAGC AGCCTCAGGC GTCGGCCACC
CCCGGCCCCG AAGAGATCAT CGTGACGGCG CAGCGTCGCG CGGAATCGCT GAACGATGTG
GGCATGGCGA TCCAGGCGGT CACCGCCGAC ACGCTCCAGG CACTGCGCGT CACCGACGTG
CGCGACCTTA CGACCGTCGC CCCGAGCTTT ACCGTATCGC AAAGCTACCA GGGCGTGCCG
ACCTATACCC TGCGCGGCAT CGGCTTCAAC ACGATCAACC TTTCGGCCAC CTCCACCGTC
GGCACCTATG TCGATGAAGT GGCCTATGCC TATCCGATCA TGAACACCGG CCCCGTCATG
GACCTCGAGC GGGTGGAAGT ACTCAAAGGC CCTCAGGGCA CGCTATACGG GCGCAACACA
ACGGCCGGCC TCATCAACTT CATTACCGCC AAACCGACCG ACACGTTCGA AGGCGCGGTC
AGCGCGGACG TCGGCAACTA CCGCACATGG AACCTGGGCG GGCATGTCTC GGGCCCGCTG
GGCGAAGGGA TCGCCGCGCG CATCGCGGTA CGGTCAGAAC AGTCGGACAA GGGCTGGCAG
GTGAGCAACA CCCGGGGCGA CCGCCTGGGC AAGATCGACA AGCTGGGCGT ACGCGGCGCG
GTCGCCATCG ATCCTTCGGA CAAGACGCAC ATCGACCTGT CGGTGTTCTG GTGGCGCAAC
CGTTCCGACA CTGTGGCGGG GCAAGGCATC GGCTTCACCC CGGCAACCGA TCCGGTAACC
GGGACGAGCC TGTCGCACCT GTTCAACGCT CCCGGCCTTG CCGACTACAT CGCCAACAAT
TTCCCGACCA GCGCGACCCA GGCCGACTGG GCGCCGGAGG CCAGCCGTTC CGCCGATGTC
GGCACAGGCC TCGGCCTTGA CGGACCCTTG CGCGAGAACA ACCGCTTCTG GGGCCTGAAG
CTGCGGTGGG ACCAGTACAT CGGCGACACG ATGAAGTTCG TCAGCCTGAC CAGCTACAAC
GATTTCAAGC GCGACGCTCT TTCCGACTGG AGCGGCGCGC CCTTCGAGGT CCTGCTCCAG
AACACCGTCG GCCGGATCAA GAGCTTTGCG CAGGAAGTTC ACCTCGAAGG CGAGACCGAC
AAGATGAACT GGCTGGTCGG CGGATACTAT GCCAACGACC GGATCATCGA TTCCAACCGA
ACGATGCTGG GCCAGAACGC CAATGTCGGG CTGATCCGCG CGGTTGGGTC TACCCTGCTG
GGTACGCCGT TCAACTCGAA CGGATACACC CTGACCGAAA TGCTCCAGGC GTTCCGCAGT
TACGAAGACT TCGGCCGCAT TCGCACGCGG ACATGGAGCC TGTTCGCCAA TGCCGACTGG
CAATTCACCG AACAGCTCAA GCTGACCGCA GGCGTCCGCT ATACCGAGGA CCGCCAGCGC
TACAACGGCT GCTCGCGCGA CTTCAACGGA AACATGCTGC CCAACGTGAA CGTTGTGAAC
CGGGCGCTCT ACTTCCAGTC CTATGGGGTG CTTGCCGCGC CGATCACCCA GGGGCAGTGC
AACACCTTCG ATCCGGACAG CGGGACCTTC GGCGAAGTCC GCTCGGTCCT TTCGGAGAAC
AACGTCGCCT GGCGCGTTGC GCTGGACTGG TCGCCCAATG ACGACACGCT TCTCTACGGA
TCGGTCTCGC GCGGGTACAA GTCTGGCACG ACGCCGATCA ACGCGGCGAA CCTTGCCCGC
CAGAACGCCC CCGTCACGCA GGAAAAACTG ACCGCCTATG AACTGGGCAT CAAGGCCAGC
CTTGCCGACC GCCGGGTACA GGCGAACCTT TCGGCCTTCT ACTACGACTA CCGCGACAAG
CAGATCAGCA CCTACTTTGC CGATCCGATC TACACCGCCC TTTCGCGACT GGACAACGTG
CCCGATTCCG AGGCCTATGG CGTAGAGGCC GAACTTGTCG TGCGACCGGT GCAGGGCCTG
ACGATGACCG GCAACGCGCT GTGGCTCAAG ACGCGGATCA ACGGCTACAA CGGCACCAAT
GCCGCCGGCG AGGCCCAGAA CTTCGATGGC GCGGAATTCA TCTACAGCCC GCACTTCCAG
GGCAGCGCGA CAATTGCCTA CGATGCGCCG GTGGGCAGCG GCCTGAGCGC GACCGGGGCG
GTGAGCCTGC GCTACCAGTC TGAGTCCAAC ACGATCTTCG AGGACCTCGC GCTCTACAAG
ATCAACTCCT ATGCCACCGT CAATGCAAGC ATCGGCTTGA AGAGCGAAAG CGGCTGGTCG
GCTTCGATCT GGGCAAAGAA CCTGTTCGAC AAGTATTACT GGTCTGCCGT GGCCAGCAAC
GCCAACGTCG TCGTGCGTTT CCCCAACCCG CCGCGCACGT TCGGCGTGAC GCTTGGCTAC
AACTTCTGA
 
Protein sequence
MRGSDMKGLR TAFLIGTALC AVAARPALAD EAEQPQASAT PGPEEIIVTA QRRAESLNDV 
GMAIQAVTAD TLQALRVTDV RDLTTVAPSF TVSQSYQGVP TYTLRGIGFN TINLSATSTV
GTYVDEVAYA YPIMNTGPVM DLERVEVLKG PQGTLYGRNT TAGLINFITA KPTDTFEGAV
SADVGNYRTW NLGGHVSGPL GEGIAARIAV RSEQSDKGWQ VSNTRGDRLG KIDKLGVRGA
VAIDPSDKTH IDLSVFWWRN RSDTVAGQGI GFTPATDPVT GTSLSHLFNA PGLADYIANN
FPTSATQADW APEASRSADV GTGLGLDGPL RENNRFWGLK LRWDQYIGDT MKFVSLTSYN
DFKRDALSDW SGAPFEVLLQ NTVGRIKSFA QEVHLEGETD KMNWLVGGYY ANDRIIDSNR
TMLGQNANVG LIRAVGSTLL GTPFNSNGYT LTEMLQAFRS YEDFGRIRTR TWSLFANADW
QFTEQLKLTA GVRYTEDRQR YNGCSRDFNG NMLPNVNVVN RALYFQSYGV LAAPITQGQC
NTFDPDSGTF GEVRSVLSEN NVAWRVALDW SPNDDTLLYG SVSRGYKSGT TPINAANLAR
QNAPVTQEKL TAYELGIKAS LADRRVQANL SAFYYDYRDK QISTYFADPI YTALSRLDNV
PDSEAYGVEA ELVVRPVQGL TMTGNALWLK TRINGYNGTN AAGEAQNFDG AEFIYSPHFQ
GSATIAYDAP VGSGLSATGA VSLRYQSESN TIFEDLALYK INSYATVNAS IGLKSESGWS
ASIWAKNLFD KYYWSAVASN ANVVVRFPNP PRTFGVTLGY NF