Gene GM21_0065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0065 
Symbol 
ID8135364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp81700 
End bp82704 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content64% 
IMG OID644867682 
Productthiamine biosynthesis protein 
Protein accessionYP_003019910 
Protein GI253698721 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0482] Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.96842e-23 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGAGAA AAGCCATAGC CCTTCTGTCG GGCGGACTCG ATTCCACCCT CGCGGTCAAG 
GTCCTCCTCG ACCAGGGGAT CGCCGTCGAG GCCCTCAACT TCACCTCCCC TTTTTGCACC
TGCACCGGGA AAAACGCCGG CTGCAAGTCG GAGGCGGTCC GCGTGGCGGA AGATTTCAAG
ATCCCCATCA AGGTGATGCA CAAGGGGGCG GACTACCTCG AGGTGGTCAG AAACCCCAAG
CACGGCCACG GCAAGGGGAT GAACCCCTGC ATCGACTGCC GCATCTTCCT TCTCAAAAAG
GCCAAGGAGT ACATGCTGGA ATCCGGCGCC GATTTCGTCT TCACCGGGGA GGTCCTGGGA
CAGCGCCCCA TGAGCCAGCG CCGCGACACC CTGCGCATCA TCGAGAAGGA GAGCGGCCTT
GAGGGGCTCC TTTTGCGCCC CCTCTCGGCT AAGCACTTCC AGCCCACCAT CCCGGAGCAG
GAAGGGTGGG TCGACCGCGA GAAGCTCCTC TCCATCCAGG GGAGGTCCCG GAAGGAGCAG
TTCGAGCTCG CGGCCGAGTT GGACGTGAAG AACTACCCCT GCCCCGCCGG CGGCTGCCTT
TTGACCGAGC TCTCCTTCGT CGGCAAGATT CGCGACGTCT TCGACCACTC GGACGAACTC
AACATGAGGG ACTTCCGGCT CCTCAAGCTC GGGCGGCATT TCAGGATCGG ACCCCGGACC
AAGGTTATCC TCGGCCGCAA CGAGGGGGAG AACGAACTCC TGGAGCGGGC CGTCCAGCCC
GGGGAGGCAA CGCTTCGCTG GGTCGAGGGA ATGAGCCCGC TCGCCGCGGT CATGGGGGAA
ACCACCGATC ACCTTTTGGA AAAGGCGGGG CAGATACTTT TGCGCTACAC CAAGGCGGAG
CCGGGCTCCC CGGCCACCCT GAGCGTTTTG CGCGACGGCG GCGAAACGGA GCTTAAGACG
GTGAACGCTC TCGACGAGGC GGCCGTGGAG GCGCTCAGGC TCTAG
 
Protein sequence
MQRKAIALLS GGLDSTLAVK VLLDQGIAVE ALNFTSPFCT CTGKNAGCKS EAVRVAEDFK 
IPIKVMHKGA DYLEVVRNPK HGHGKGMNPC IDCRIFLLKK AKEYMLESGA DFVFTGEVLG
QRPMSQRRDT LRIIEKESGL EGLLLRPLSA KHFQPTIPEQ EGWVDREKLL SIQGRSRKEQ
FELAAELDVK NYPCPAGGCL LTELSFVGKI RDVFDHSDEL NMRDFRLLKL GRHFRIGPRT
KVILGRNEGE NELLERAVQP GEATLRWVEG MSPLAAVMGE TTDHLLEKAG QILLRYTKAE
PGSPATLSVL RDGGETELKT VNALDEAAVE ALRL