Gene Rfer_1099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRfer_1099 
Symbol 
ID3963483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodoferax ferrireducens T118 
KingdomBacteria 
Replicon accessionNC_007908 
Strand
Start bp1177505 
End bp1178767 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content61% 
IMG OID637915920 
Productextracellular solute-binding protein 
Protein accessionYP_522371 
Protein GI89899900 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000491601 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTTTA ATGCTGCCCG TACCGCAGGC AAAATTCTTT CCCTGACGGT GCTGCTCGTC 
AGCACCAGCC TGGCCCAAGC CCAGAGCCAG ATACTGTCCG TGACGGCTTA CCCGGCGGTG
GACGAGATCA TCAAGGCCGC GATGCCACAG TGGAAGAAGA CGCATCCCAA CGTCGAGATC
AAACTGGTCA GCCGTGCATT TGAGGATCAT CACACCGCCA TGACCACGGC CCTGTCCACC
TCAAGCAATC TGCCGGACGT GATGGCGCTG GAATTTGCCT ACGTGGGCCG CTTTGGCGCG
GGAGGTGGAC TGGAAGATCT GTCTCAGTCG CCTTACCGAA TCAAGGACAC GCAAATGCGT
TTTGTGCCCT TTGCTTTCAG GCAGGCCACC CTCAGTACTG GCGCGGTGGT GGCCGCGCCC
ACTGACATCG GCCCGGGCAC CTTGCTGTAC CGGACCGATC TGCTTAAAAA AGCCGGTGTC
AGCGAAGCGG AGCTGACGCA GTCCTGGGAC TCCTTTGTGG CGTCGGGCGT GAAGATCAAG
GCCACCACGG GCGCCTACCT AATGGCGCAC GCGCGCGATA TCAAGGACAT CCTGATCCGC
TCCAACGTCA AGCCGGGCGA TGGCCTGTAC TTTGATGCCG CCGGCAAGGT GGTGGTGGAT
TCGTCGCGCT TTGTGCGCGC GTTCGAACTG GCGCGCAGGG TGCGCCAGCA ACAGCTCGAC
GGCAAGATCA GCGGCTGGTC GACCGCATGG TCTCAGGGCT TCAAGAACGG CAACATCGCC
ACGCAGATGT CGGGTGCCTG GCTGGCCGGG CAAATGGCAA GCTGGATTGC ACCCACCACA
AAGGGTCTCT GGCGTGCCTC GCAGCTGCCC GAAAAAGCCT GGGGCGCTTG GGGCGGCACT
TTTTATGCGA TCCCGAAGGC GGCGAAGAAC AAGGCGCTGG CCTGGGAGTT CATCCAGTTC
ATGACGCTCA ACCGCGACGC ACAACTCAGC GCGTTCAAGG TGCAGGACGC TTTTCCGGCC
TTGCTCGAAG CGCACACTGA TCCGTTTTAC GACCAGCCGA TTGAATTCCT GGGCGGGCAG
AAAGCGCGTC TGCTGTGGCG CGAAGCGGCA CTGAAAATCA ACGCCATCGA CGTGAACAAG
CTGGACCCGA TTGCCGACGA AATCGTCAAC ACCGAGCTCG ACAAGGTGCT GGACCAGGGC
AAGGACATTC CCAAGGCGCT GGCTGATGCC AAGGCCTTGC TGGAGCGGCG TGCGCGCCGC
TAA
 
Protein sequence
MLFNAARTAG KILSLTVLLV STSLAQAQSQ ILSVTAYPAV DEIIKAAMPQ WKKTHPNVEI 
KLVSRAFEDH HTAMTTALST SSNLPDVMAL EFAYVGRFGA GGGLEDLSQS PYRIKDTQMR
FVPFAFRQAT LSTGAVVAAP TDIGPGTLLY RTDLLKKAGV SEAELTQSWD SFVASGVKIK
ATTGAYLMAH ARDIKDILIR SNVKPGDGLY FDAAGKVVVD SSRFVRAFEL ARRVRQQQLD
GKISGWSTAW SQGFKNGNIA TQMSGAWLAG QMASWIAPTT KGLWRASQLP EKAWGAWGGT
FYAIPKAAKN KALAWEFIQF MTLNRDAQLS AFKVQDAFPA LLEAHTDPFY DQPIEFLGGQ
KARLLWREAA LKINAIDVNK LDPIADEIVN TELDKVLDQG KDIPKALADA KALLERRARR