Gene Rfer_1110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRfer_1110 
Symbol 
ID3964348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodoferax ferrireducens T118 
KingdomBacteria 
Replicon accessionNC_007908 
Strand
Start bp1189895 
End bp1191130 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content60% 
IMG OID637915931 
Productextracellular solute-binding protein 
Protein accessionYP_522382 
Protein GI89899911 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0139598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGTTT TAAGAAAAAT GGCGACGGCA CTGGCCGCTG GCCTGGGTGT TTGTGTTGCC 
GCGCACGCGG TCGGCCCTAA AGCCGAGGTG ATTCACTGGT GGACCTCCGG CGGCGAATCC
GCTGCGGTGA AAGTGTTTTC CGACGCCTAC ACCACCACCG GCGGTGTCTG GGTCGATACG
GCCGTGGCAC TCGGTGAGCA GGCGCGCTCC GTGGCCATCA ACCGTATCGT CGGGGGTAAC
CCACCGACAA TGGCGCAGTT CAATACTTCC AAGCAGTTCT TGGACGTCGT CGAGCAGGGC
ATGCTCAACA ACGTGGACGA GGTCGCCATC CGCGACGGGT GGGACAAATT CCTACCCGAA
ACCGTGCTCA ATGTGGTGAA AGTCAAAGGG CACTACTACG CCGCGCCCGT GAACATCCAC
ATGCCGACCT GGTTCTGGTA TTCCAAGGCC GCGTTCAAAA AGGCGGGCAT CGCGGCCGAG
CCCAAAAATA TGGACGAGTT GTTCGCGGCG CTGGACAAGC TCCAGGCCGC CGGGCTGATT
CCCCTGGCGC ACGGCGGCCA ACCGTGGCAG GAGAACACTA TTTTTACCGC CGTGCTGGCG
AACGTCGGCG GCACCGAGCT CTACCTGAAA GTGATGCGTG ACCGCGATGC GAAAGCGATT
CAGGGCGAGG CTTTCAAGAA TGTGTTGCTG ACCTACAAGC GACTTCAATC CTACGTGGAC
AAGGGTTCTC CGGGGCGCAA CTGGAACGAC GCCACCGCGC TGCTGATCAC GGGCAAGGCC
GGTGTGCAAA TCATGGGCGA CTGGGCCAAG GGCGAGTTCA CGGCGGCAAG GCAGGTGCCG
GGCAAGGACT ATGGCTGCAT TGCGGGTTTT GGGTCCAAGT CACCCTACAT CATTCAGGGC
GACGTGTTCG TCTTTCCCAA GACCACGAAT CCTGAGCAGA TCAAGGCGCA AAAGGTGCTG
GCCGGCCTCA TGCTGGCACC CGCCGTGCAG GTGGAGTTCA GCAACAAAAA AGGTAGCATT
CCCGTGCGCA CCGACGTGGA CTCGTCCAAG ATGGACATCT GTGCCCAGCA GGGACTGGCC
ATCATGAAAG ATAAATCGCG CCAGTTGGGC AACGGCGAGA TCTACCTCAA CCCCGATCAG
AATGGCGCGC TGGCTGACAT CCTGACCGCC TACTGGAACA AGAGCGTTCC CGTCGAAAAG
GTCCAGAAGG ACATTGCAAA CGCGTTGAAG AACTGA
 
Protein sequence
MVVLRKMATA LAAGLGVCVA AHAVGPKAEV IHWWTSGGES AAVKVFSDAY TTTGGVWVDT 
AVALGEQARS VAINRIVGGN PPTMAQFNTS KQFLDVVEQG MLNNVDEVAI RDGWDKFLPE
TVLNVVKVKG HYYAAPVNIH MPTWFWYSKA AFKKAGIAAE PKNMDELFAA LDKLQAAGLI
PLAHGGQPWQ ENTIFTAVLA NVGGTELYLK VMRDRDAKAI QGEAFKNVLL TYKRLQSYVD
KGSPGRNWND ATALLITGKA GVQIMGDWAK GEFTAARQVP GKDYGCIAGF GSKSPYIIQG
DVFVFPKTTN PEQIKAQKVL AGLMLAPAVQ VEFSNKKGSI PVRTDVDSSK MDICAQQGLA
IMKDKSRQLG NGEIYLNPDQ NGALADILTA YWNKSVPVEK VQKDIANALK N