Gene Saro_1691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1691 
Symbol 
ID3916266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1773187 
End bp1775418 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content64% 
IMG OID640444432 
ProductTonB-dependent receptor 
Protein accessionYP_496965 
Protein GI87199708 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0224052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGAAC GTCTTTACCG TGGAATCGCG CTGGCAGGGG TAGCCGCCGC ATCCATTTTC 
GCCGCAGCGC CCGTATTGGC GCAGGAAGTG CAATCCGACA GGGCCGTGCC CGACGGCGAG
ATCGTCGTGA CCGCAACCAA GCGTGCGGAA AGCCTGCAAT CGGTCCCCAT CTCGGTCTCG
GCCATCGGCG GCGATACGCT TGCCAAGGCG CGGGTCGGCA GCGTGGACAG CCTGGTGACC
AAGGTCGCCA ATCTTCAGCT CACGTCGATC GTGGGCGACA ACACGCCGAT CTTCGCGCTG
CGCGGCGTAT CGATGTCCGA CTACAGCCTC AACCAGTCGA GCCCCGTCGC AACCTATTAC
GACGAAGTCT ACAAGGGCAA CTTCGCCTTC CTCGGCGTCA CCATGTTCGA CCTTGAGCGC
GTCGAGGTAC TGCGCGGGCC GCAGGGCACG CTCTATGGCA AGAACACCAC CGGCGGCGCG
GTCAACATCA TCGCCAACAG TGCGAAGCTG GGCGAGACCA GCGGCTATTT CAGCGCCGGC
TATGGCAACT ATGACCGCTT CGACCTCAAC GGCGCGGTCA ACGTGCCGCT GGGCGAAAAG
GCGGCCCTGC GCATCGCGGG CACTTATGCC CGGGCCGATG GCTGGTTCAA GAACGTGGTC
CCGGGCAAGC CCGATCTCGC CTCGACCGAC GAGTACGCCA TCCGAGGCAC GCTGAACTTC
GAGGCGAGCG ATACCGTCCG CTTCGTCCTG CGCGCGTCGA CCAGCTACCA GAACCCGCAG
AACTATGGCA TCTATGCCCA GCCCGAAGAC GTCAATCGCC CCGGTCTCGA CCGCTGGGAG
ATCGCCTCGA ACATCGCCAC CAAGCGCAAG GCGCGCACCT ATTCGGTCGC GCTTACCAGC
AACTTCGACG TGTCGGATAC GCTGACCGTC ACCAGCATCA CCTCGTGGGA CAAGGGCAAC
CTGTTCTTCT ACGAGGACAC CGACGGCACC GCCTCGCAAC TGCTCGAAAT CCCCTATACC
GACCGCGCCA CCCAGTTCGC GCAGGACCTG CGCCTGACCA GCGACACCGG TGGCCCGTTC
GATTTCATCC TCGGCGCCTA CTTCAACCGC GAGAAGGTCT ACAACGAGAC TGCCTTCGAG
ATCGGCAAGG ACATTGACCT TACCGGCGAC AACATCGTCA CGGCAGACGA CTGCGTCGAA
GGCCTGACCA ACGAAGATGG CAGCGACGAT GGCATCGCCT GCCTCTTCCG CAACCGCTTC
GACCAGGTGA AGAAGAGCTA CGCGATCTAT TCGGACCTCA AGTACCAGGT CACCGATGCG
GTGACCCTGC GTGGCGGCCT GCGCTATACC CACGATACGG GCCGGCAGAG CGGCTTCCGC
TCCGATGCGC TGGGCGTCGA CGGGTCCGAG GTGGCCAACC TGATCCCCTT GTCCTCGCTC
AGCTATTCGC AGGACAACCT CTCCGGGAAG ATCGGCCTCG ACTACAAGCT GGCCGATGGC
AACCTGCTCT ACGCCAGCGT CAGCCGGGGC TACCGCGCGC CCAGCTTCAA CGCGCAGGCC
TTCTTCGATC CGTCGGAGCT TTCGGTCGCC AAGCCCGAGC AGGTGACCTC GTACGAAGTC
GGCGCGAAGA CGCAGTTCCT CGACCGCCGC ATCACGCTCA ACGTGGCCGG GTTCTACTAC
GACTACCGCA ACCAGCAGTT CATCAACGTC GACCCGGTAC TGGGCTCGCA GACGCTGCTG
AACATTCCCA AGTCGCGCAT CTATGGCGGC GAGGCCGAGC TGACGATCCG CGCCAGCGAC
CGGCTGACCC TGCACAGCGG CATGGGCGTC CTTGCCACAA AGATCCAGCG CGGCAGCGTG
AGCGGCGTGG ACGTTTCCGG CAACCGCCTG TCCAACGCAC CGACCTTTAC CTTCAACGCC
ACGATCGACC TGACGCTGGT CGATGGCGAC ATGGGCAAGC TCTCGGTCCA CCCGGACGTG
GCCTACCAGT CGAGCCAGTT CTTCGAAGTG CTCAACATCC CCCGCCTGCG CCAGACTTCC
TACGCGCTGG TCGGCGGGCA CATCGACTGG GAAAGCGCCG ACGGGCGCTT CAATGCCTCG
GTCTGGGGCA AGAACCTGTC CAACAAGTTC TACTTCACCT CGCGCGTGGA CCTGCTGGCG
GGCTTCGGCT TCGACTACAA CCACATCGGC AATCCGCGCA CTTACGGCGT GACAGTGGGC
GCGAAGTTCT GA
 
Protein sequence
MRERLYRGIA LAGVAAASIF AAAPVLAQEV QSDRAVPDGE IVVTATKRAE SLQSVPISVS 
AIGGDTLAKA RVGSVDSLVT KVANLQLTSI VGDNTPIFAL RGVSMSDYSL NQSSPVATYY
DEVYKGNFAF LGVTMFDLER VEVLRGPQGT LYGKNTTGGA VNIIANSAKL GETSGYFSAG
YGNYDRFDLN GAVNVPLGEK AALRIAGTYA RADGWFKNVV PGKPDLASTD EYAIRGTLNF
EASDTVRFVL RASTSYQNPQ NYGIYAQPED VNRPGLDRWE IASNIATKRK ARTYSVALTS
NFDVSDTLTV TSITSWDKGN LFFYEDTDGT ASQLLEIPYT DRATQFAQDL RLTSDTGGPF
DFILGAYFNR EKVYNETAFE IGKDIDLTGD NIVTADDCVE GLTNEDGSDD GIACLFRNRF
DQVKKSYAIY SDLKYQVTDA VTLRGGLRYT HDTGRQSGFR SDALGVDGSE VANLIPLSSL
SYSQDNLSGK IGLDYKLADG NLLYASVSRG YRAPSFNAQA FFDPSELSVA KPEQVTSYEV
GAKTQFLDRR ITLNVAGFYY DYRNQQFINV DPVLGSQTLL NIPKSRIYGG EAELTIRASD
RLTLHSGMGV LATKIQRGSV SGVDVSGNRL SNAPTFTFNA TIDLTLVDGD MGKLSVHPDV
AYQSSQFFEV LNIPRLRQTS YALVGGHIDW ESADGRFNAS VWGKNLSNKF YFTSRVDLLA
GFGFDYNHIG NPRTYGVTVG AKF