Gene GM21_2204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2204 
Symbol 
ID8137540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2575293 
End bp2576285 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content60% 
IMG OID644869819 
Productprotein of unknown function DUF6 transmembrane 
Protein accessionYP_003022014 
Protein GI253700825 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0000000000373443 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCGCCGG TCTTCTGCAA GATGCTCATC GGGGACATGT CGCCGGCTCT GCTGGCGGGG 
CTCCTCTATC TCGGCTCGGG CCTCGGCTTG CAACTCGTCC TCTTCTTCCA GCGCAAAAAC
TCCCTCCACG AACTGGCTCA TCTCTCGCCG CGCCACCGGC TCAAGCTGAT CGGCGCCATC
ATCTCTGGCG GCATCATAGC GCCTCTATGC CTCGCTTTCG GCATCAAGTA CGGCACGGCT
TCGGAGGTCT CGCTGCTGCT CAACCTGGAA ACGGTGGCGA CGACCATAAT CGCCTGGCTC
GTTTTCAAGG AGTACATCGG CCCCTATGTC TGGACCGGTA AGGTACTCAT ACTCATAGGC
GCCGGCCTGG TGGTGCTGAA AGCTGAGGGG GGTATGTCCT TCTCCACCTC CGGGCTCCTC
GTTATCTGCG CCTGCATCTT CTGGGGCATC GACAACAATC TGACCCGGGA CGTGGAGGAG
CTTTCATCCA CGGTGCTCGC CTCGGTGAAA GGTTTCGCCG CCGGTCTCTT CTCCATCTTT
TTGGCGCTCG CTTTTACCTC TGGATTGGCA ACTCCCTCGC AAATCTCCGG GGCCTTGGCT
ATCGGCGCTC TTAGTTACGG ATTGAGTCTG GTCCTCTTTG TCGAGGCCCT GCGGAAGATC
GGTGCGGCGA GAACCGCCAC TTTCTTCGCC GTAGGTCCTT TCTTCGGCAC GCTCCTCTCC
GTGGCGCTTC TGGGTGAGCG CCCCCCTGCT GCCTACTGGA TCGCCACGGT GCTGATGCTC
GCGGGGATCG CCCTTTTGTA CCTGGAACTG CACCGGCACA GCCACGCGCA TGAGGAAATG
GCTCATGCCC ACCCTCACAT CCACGACGAG CACCACAATC ACGAGCATCC GGAAGGGGAG
GTTGATCTCT CTCACGACCA TTACCATGTC CATCGTCCCA TGAGCCACTC GCACGTCCAC
TGGCCGGACA TTCACCACCA GCATCCTCAC TGA
 
Protein sequence
MSPVFCKMLI GDMSPALLAG LLYLGSGLGL QLVLFFQRKN SLHELAHLSP RHRLKLIGAI 
ISGGIIAPLC LAFGIKYGTA SEVSLLLNLE TVATTIIAWL VFKEYIGPYV WTGKVLILIG
AGLVVLKAEG GMSFSTSGLL VICACIFWGI DNNLTRDVEE LSSTVLASVK GFAAGLFSIF
LALAFTSGLA TPSQISGALA IGALSYGLSL VLFVEALRKI GAARTATFFA VGPFFGTLLS
VALLGERPPA AYWIATVLML AGIALLYLEL HRHSHAHEEM AHAHPHIHDE HHNHEHPEGE
VDLSHDHYHV HRPMSHSHVH WPDIHHQHPH