Gene GM21_1226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1226 
Symbol 
ID8136551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1432561 
End bp1433670 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content65% 
IMG OID644868840 
Productriboflavin biosynthesis protein RibD 
Protein accessionYP_003021045 
Protein GI253699856 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.078845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACC TACACCTTAA AATGATGCGC CTGGCGCTTT GCGAGGCCAG AAAGGGAGTC 
GGCAAGACGG CCCCCAACCC GGCCGTCGGC TGCGTCATCG TCCGGGACGG CGAAGTGGTC
GGGACCGGCT GGCACAAAAA GGCGGGGACC CCGCACGCCG AGGTGCATGC GCTTAAGGCC
GCCGGCGAGA AGGCGGCCGG CGCCGACGCC TACGTGACCC TTGAGCCCTG CTCCCATTTC
GGCAAGACCC CCCCCTGCGC GAAAGCGCTC ATCGAGGCGA AAGTGGCGCG CGTCTTCGTC
GCCATGGTCG ACCCCAACCC GCTTGTCTCC GGGAAGGGGA TCCAGATGCT CAAGGACGCG
GGGATAGCAG TCGAGGTGGG ACTCTTGGAA GAGGAGAGCC GTGAGCTGAA CCTTCCCTTC
ATCAAGTGGA TCCAGACCAG GCTCCCCTTC GTGGTGCTGA AGAGCGCGCT AACGCTGGAC
GGCAAGAGCG CCACGGCAAG CGGCGACTCC AAGTGGGTGA CCAGCGACCG GGCCCGGCGT
GAGGTGCACC GGCTGCGCGG CCGCCTGGAC GCCATCATGG TCGGCGTCGG TACCGTGGCG
AAGGACGATC CGCTTTTGAC CTGCAGGGTC CCCGGCGGCA AAGATCCGCT GCGGGTGATA
GTCGACTCGA CCCTCAGGAT ACCGCTGCAC GCCGCGGTCC TCGGAGTGCC TTCCAAAGCT
CAGACGATCA TCGCCACCTG TAGCGGCGAC GAGGCAAAGA TGCAAGCGCT CAAGGCGCAC
GGCGTCGAGA TCCTCACTTG CTGCGAGAGC GACGGGCGGG TCGACCTTGC CGATCTCTTC
GTAAAGTTGG GTGCGCGCGG CGTGCAGTCC GTGCTGCTCG AAGGCGGAAG TCACCTGGCA
GGGGCAGCTC TTCGTGCCGG GCTCATCGAC AAATGCATGA TCTTTCTGGC GCCGAAGCTC
GTGGGAGGAG CAGGCATGGG GCTCTTCGCC GGCGAGGGGG CGACGCTGAT GGCAGACGCC
ATACGGTTGG AGCAGATGAG GGTAAAGCGA GTGGGAGTCG ACCTCCTGGT GGAGGGAGTC
CCCGCAAAAA CAAAAAGTCA GAACCACTAA
 
Protein sequence
MSDLHLKMMR LALCEARKGV GKTAPNPAVG CVIVRDGEVV GTGWHKKAGT PHAEVHALKA 
AGEKAAGADA YVTLEPCSHF GKTPPCAKAL IEAKVARVFV AMVDPNPLVS GKGIQMLKDA
GIAVEVGLLE EESRELNLPF IKWIQTRLPF VVLKSALTLD GKSATASGDS KWVTSDRARR
EVHRLRGRLD AIMVGVGTVA KDDPLLTCRV PGGKDPLRVI VDSTLRIPLH AAVLGVPSKA
QTIIATCSGD EAKMQALKAH GVEILTCCES DGRVDLADLF VKLGARGVQS VLLEGGSHLA
GAALRAGLID KCMIFLAPKL VGGAGMGLFA GEGATLMADA IRLEQMRVKR VGVDLLVEGV
PAKTKSQNH