Gene Rfer_1097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRfer_1097 
Symbol 
ID3963481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodoferax ferrireducens T118 
KingdomBacteria 
Replicon accessionNC_007908 
Strand
Start bp1174624 
End bp1175874 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content56% 
IMG OID637915918 
Productextracellular solute-binding protein 
Protein accessionYP_522369 
Protein GI89899898 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000486653 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAAAC TTTCCAAATT GTCTGCAGCC ATTGCATTTG TAGTGGTTGG CGCATCGGCC 
CTGGCCGGTG AAGTTGAAGT GCTGCATTGG TGGACATCGG GCGGCGAAGC CAAGTCGGTC
AGCGAATTGA AAAAAATCAT GCAGAGCAAG GGCCACACCT GGAAAGACTT TGCGGTGGCC
GGCGGCGGCG GTGACAACGC CATGACCGTG CTCAAAAGCC GTGTGGTGGC CGGCAACCCG
CCCGCTGCAG CCCAGATCAA AGGCCCTGCG CTTCAAGAAT GGGCCTCTGA AGGGGTGCTG
GCGAATCTGG ATGCCACTGC AAAAGCTGAA AACTGGGACA GCCTGCTGCC CAAGGCGATA
GCAGACGGCA TGAAATACAA AGGGAACTAC ATCGCTGTTC CGGTCAATGT GCACCGCGTC
AACTGGCTGT GGGCCAATGC GGCCGTGTTG AAAAAATCTG GCGTGGCGGG CATGCCCAAA
ACCTGGACTG AATTTTTTGC CGCTACCGAC AAGATCAAGA AGGCGGGATT TATCCCGGTT
GCAACGGGTG GCAATGCCTG GAATGACCTC ACCAACTTTG AGCCTGTGGC GCTCGGTGTC
GGCGGTGTCA AGTTTTACAA CGATGCGTTT GTCAAACTTG ACCCTAAAGC ACTGAACAGC
GATGCCATGA AGAAATCGCT GGAGACCTTT CGCAAGCTCA AAGGCTACAC CGATGCCGGT
GCCGTGGGTC GCGACTGGAA TATCGCCACG GCGATGGTGA TCCAGGAGAA AGCCGGCTTC
CAACTCATGG GCGATTGGGC CAAGGGCGAA TTTGTCGCTG CCGGCAAGGT GCCGGGCAAA
GATTTCCTGT GTGCCGCAGC ACCTGGCAAT GCAGGCACCT ACACCTTTAA CGTGGATTCA
TTCGCCATGT TCAAGCTCAA GGATGCGGCA GCCCAAAAGG CGCAGGCAGA CCTTGCCGTT
GCCATCATGG GCCCTGAATT CCAGGAAGTA TTCAACCTGA ACAAAGGCTC CATCCCGGTT
CGCTTGAACA TGAACATGGA CAAGTTTGAC GAATGCGCCA AGCTGTCGGC CAAGGAATTT
GTTGACACCG CCAAGTCGGG TGGCCTGGTT CCTTCCGTTT CCCAGGATAT GGCGCTCAAG
CCCGCCGCGA CGGGTGCCCT GAAAGATGTC GTCAGCCAGT TCTGGAACGA TGACAAGATG
ACGCCGGAAA CGGCCATGAA GAATATGGTC AAGGCGGCTA CGACCAAATA G
 
Protein sequence
MLKLSKLSAA IAFVVVGASA LAGEVEVLHW WTSGGEAKSV SELKKIMQSK GHTWKDFAVA 
GGGGDNAMTV LKSRVVAGNP PAAAQIKGPA LQEWASEGVL ANLDATAKAE NWDSLLPKAI
ADGMKYKGNY IAVPVNVHRV NWLWANAAVL KKSGVAGMPK TWTEFFAATD KIKKAGFIPV
ATGGNAWNDL TNFEPVALGV GGVKFYNDAF VKLDPKALNS DAMKKSLETF RKLKGYTDAG
AVGRDWNIAT AMVIQEKAGF QLMGDWAKGE FVAAGKVPGK DFLCAAAPGN AGTYTFNVDS
FAMFKLKDAA AQKAQADLAV AIMGPEFQEV FNLNKGSIPV RLNMNMDKFD ECAKLSAKEF
VDTAKSGGLV PSVSQDMALK PAATGALKDV VSQFWNDDKM TPETAMKNMV KAATTK