Gene Saro_2509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2509 
Symbol 
ID3916830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2712217 
End bp2714607 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content65% 
IMG OID640445266 
ProductTonB-dependent receptor 
Protein accessionYP_497779 
Protein GI87200522 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.538781 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGCGA TGCTCGCCGG CCTTGCCACT CCGGCTCTCG CCGAGGAACA GCAGGTTGCC 
CAATCCGACA CCGGCCTGGC CGAGATCATC GTCACCGCCC AGCGACGCAC CGAGAACCTT
CAGGACGTGC CGATCGCGAT CACTGCGGCA AACTCCGAAA CGCTCGCCCA GGCACGCGTC
GAAAACGTTG CCAACATCCA GGCGATCAGC CCCTCGATCA GCTTCCGCGT GACCAACATC
GCCACGTCGA GCGCCAACCT CATCATCCGC GGCCTCGGCA CGACCGGCAA CAGTCGTTCG
TTCGAAGGCT CGGTCGGCGT GTTCATCGAC GGCGTCTACC GCACCCGCGC GGCAGCGGCG
CTCCAGAACT TCCTCGACAT CGACAATCTC CAGGTCCTGC GCGGCCCGCA AGGCACCCTG
TTCGGCAAGA ACACCACCGC CGGCGCGCTC CTGCTCAGCT CCGCCGCGCC CTCGCTCAAC
GACGTCAACG GCTCGGTCGA GGCGACCTAC GGCAACTATG ACGGCCTGAT CGTACGCGGA
GCCATCAACG CGCCGCTGTC CGATACGGTC GCCTTCCGCA TCGCGGGCCT CGCGTCCAGC
CAGAACGGCT TCTACACCGA CAGCACCACC GGCGACGATC TCAACGGCAA CAAGACCCGC
GCCGCAAAGG CGCAGCTCCT GTTCGAGCCG AGCGAGAACC TTACGGTCCG CGTGATCGGC
GACTACTCCT ACAGCAACGG CAATTGCTGC TACGCCACTT CGGCCTTCAT CGATGGCCCG
ACCCAGCCGC TGATCGACCT GCTCACGCTC TACCAGCCGT CCAGCAGCGC CCAGCTTCTC
GGCGTACTGA CCGGCGCGCT GCCCGCTTCG TCGATGACGC CGACCGGCCG CACCCTGCCC
TCGCGCGATG CCTCGAAATG GGAGCAGACG CTGAACGGAA ACGGCAAGCA GACCATCGAG
GACTACGGCG GCACGCTGCT CGTCGATGCC TCCATCGGCG AAGGCACGCT GAAGTCGGTC
ACCGCCGTGC GCAAGTTCAA GGTCGATCAG GTCGACCTCG ACCCCGATTT CTCGGGCGCG
GACATCTTCC GCTACAACGA AAGCTTCGAA AGCCGCTTCA TCTCCCAGGA ACTGACCTAC
AACACCAAGA TTACGGCGCT CAATGCCGAG GCGGTCTTCG GCCTGTTCTT CTCGGATGAA
AAGCTCAAGA TGGGCCGCAG CCTGCCCTGG GCCGACCAGG CCCAGTACTA CTGGGACGTG
ATCTTCGCGC AGCTCGGCGT CGCGCCCGGC ACGGCCAACG CCGCCCCCGG CACCTGGACG
AGCGAACGCA TGGGCGGTTC GGCGAAGTCC TACGCCGGCT TCGCGCATCT CGATTTCGCG
GTGAACGACA AGTTCAACGT GATCGCCGGC CTGCGCTATT CGGTCGAGAA GAAGCGCGGC
TTCTTCAACA ACTCGTTCTA TCGCTCCTCG CCGTTCGACG TGTTCACCCT GCTCGGCATC
GCACCGGCGC CGGCCTATGA CGCGACTTCG ACCGACAAGG CGCTGTCCGG AACCTTCGGC
CTCCAGTACC GCCCGACCGA CGACATCATG CTCTATGCAA CGTACAACCG CGGCTTCAAG
GCGGGCGGCG TGAACATGGA CGTGAACGCA GCCGGTACGC TGATCAACAA TGCAGAGGCA
TACAACGCCC TGCCCGCCCC GATCCGCGCC GCCTTCTTCG GCAATGCCGA GGCCAAGGAC
CCGCTGAATC CCCGCTACAA GCCCGAGAAG GTCAACGCCT TCGAGGTCGG CGGCAAGTTC
CAGTACCTTG ACGGCCGCGC GCGCACCAAC ATCGCGTTCT TCTACTACGA CCTGTCCGAT
CTCCAGATCG CTCAGTTCAT CGGCCTGCGC TTCACCGTGC TCAACGCCAA GTCCGCCAAG
GACTACGGCG TCGAGATCGA GAACATGTTC CAGCTCACCG ATGGCCTGAC GCTCGGCCTC
GATGGCACCT GGATCCCGCA TGCGCAGTAC GCGAAGGACG CGAACATCGA CCCGGTCCTG
TCCGGCTCGC GCTTCCGCTT CAGCCCCAAG TTCTCGGGCA ACGCGACGCT GAACCTCGAC
CAGCCGATCA ACGACAACCT CAGCCTGCTC GCCCGCGCAC AGGTCCAGTA CCAGAGCCGC
CAGCTCATAA GCACGGCGAC CACGGCGGAA CAGGGCGCGG TGACGCTGGT CAACGCCAAC
CTCGGCTTCA AGCTGCCGCA GACGGGGCTG CTGATCGAAG GCTGGGTGCA GAACCTGTTT
GACAAGACGT GGTTCACCCA GTCCTTCCCA ACGCCGCTCC AGACCGGCGA CCAGAACGCC
TACCCGGGTG CGCCGCGCAC CTACGGCATC CGCGTCCGCG CGACGTTCTG A
 
Protein sequence
MGAMLAGLAT PALAEEQQVA QSDTGLAEII VTAQRRTENL QDVPIAITAA NSETLAQARV 
ENVANIQAIS PSISFRVTNI ATSSANLIIR GLGTTGNSRS FEGSVGVFID GVYRTRAAAA
LQNFLDIDNL QVLRGPQGTL FGKNTTAGAL LLSSAAPSLN DVNGSVEATY GNYDGLIVRG
AINAPLSDTV AFRIAGLASS QNGFYTDSTT GDDLNGNKTR AAKAQLLFEP SENLTVRVIG
DYSYSNGNCC YATSAFIDGP TQPLIDLLTL YQPSSSAQLL GVLTGALPAS SMTPTGRTLP
SRDASKWEQT LNGNGKQTIE DYGGTLLVDA SIGEGTLKSV TAVRKFKVDQ VDLDPDFSGA
DIFRYNESFE SRFISQELTY NTKITALNAE AVFGLFFSDE KLKMGRSLPW ADQAQYYWDV
IFAQLGVAPG TANAAPGTWT SERMGGSAKS YAGFAHLDFA VNDKFNVIAG LRYSVEKKRG
FFNNSFYRSS PFDVFTLLGI APAPAYDATS TDKALSGTFG LQYRPTDDIM LYATYNRGFK
AGGVNMDVNA AGTLINNAEA YNALPAPIRA AFFGNAEAKD PLNPRYKPEK VNAFEVGGKF
QYLDGRARTN IAFFYYDLSD LQIAQFIGLR FTVLNAKSAK DYGVEIENMF QLTDGLTLGL
DGTWIPHAQY AKDANIDPVL SGSRFRFSPK FSGNATLNLD QPINDNLSLL ARAQVQYQSR
QLISTATTAE QGAVTLVNAN LGFKLPQTGL LIEGWVQNLF DKTWFTQSFP TPLQTGDQNA
YPGAPRTYGI RVRATF