Gene GM21_0888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0888 
Symbol 
ID8136209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1060356 
End bp1061516 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content65% 
IMG OID644868504 
Producthopanoid biosynthesis associated glycosyl transferase protein HpnI 
Protein accessionYP_003020713 
Protein GI253699524 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03472] hopanoid biosynthesis associated glycosyl transferase protein HpnI 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value3.0492799999999998e-21 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGAAGG AGCTGCTTCC ATTCCTGGTT ACCGCACCGG CTCTCGGCTA TGCCGCTTTC 
ACGCTCTACT GCGGCCGCAG CTTCTTCGCG GGGCAAAGGC CCCTGCCCGA GCACACCCCT
CCCGTCTCCA TCCTGAAGCC GGTCAAAGGG GTGGACGGCG ACAGCTTCGA AAACTTCGCC
TCGTTCTGCA GACAGGAGTA CCCAACGTTT CAGATCGTCT TCGCCGCGGC CTCTCCCGCC
GATCCCGTCA TCCCCATCAT CGAGCGCCTC ATCGCCGCCT TCCCGCAGGT CGACATCTCG
CTGGTCGTTG ACGGCGCCGT CCATGGCGCG AACTACAAGG TGTGCAACCT GATGCATGCC
TGCGCGAAGG CCAAGTATCC CCTGCTCATC GTCTGCGACA GCGACATCCG GGTCGACAGC
CAATATCTGC GCCGGGTCTG CGCGCCTTTC GCCGACCCCC AAGTGGGGCT CGTGACATCG
CTTTACCGCA GCTCCAGCGT GAAAGGGGTC GGCTGCGCCA TAGAGGCGCT CGGCTTTTGC
AGCGAGATGG TCCCGAACGT CATGGCCGCG GTGAAACTGG AGGGTTTGAG CTTCGCTCTG
GGCGCCTCGA TGGCGCTGCG CCGGGAGGCG CTGGAGCGGA TAGGGGGCTT CGAGGCCCTG
GTGGACTACC TGGCCGACGA CTACCAACTG GGGAACATGA TCCACAACAA CGGGTTCCGC
CTGGAACTCT CGCCGCACTT CGTGGAGAGC GTCATGCGCG GCGACGAGAC GGTATCGGAG
GTGATGGCGC GGCAGCTTCG CTGGGGGAGG ACCATGCGGG TTTCCCGCCC CGGCGGTTAT
CTCGCCTCGG GGATAACGCT CCCCTTCCCG GCGGCCCTGC TGGCGCTGCT TATCTCCGGC
TTCACCGCGG CGGGTTGGCT GGCCGCGGCG CTGCTCTACC TGGTGCGTTT TGCCGTGTCA
CTCGCTTACA GCCAACTGTT GGTGCGGGAC CGGCTGTTGC CGCGCTGGCT TTGGCTCCTG
CCGCTGCGCG ACGCGCTCGC TTTCGCGGTA TGGGCGCTTT CGCTCCTGGG AAACCGGGTG
CGTTGGCGCG GAGAGCTGTT CCAACTGGAC AACGGGGGGA AGATCCGGTC GATAGGGAAA
AGGGGACAGG CTGGTTTTTG A
 
Protein sequence
MLKELLPFLV TAPALGYAAF TLYCGRSFFA GQRPLPEHTP PVSILKPVKG VDGDSFENFA 
SFCRQEYPTF QIVFAAASPA DPVIPIIERL IAAFPQVDIS LVVDGAVHGA NYKVCNLMHA
CAKAKYPLLI VCDSDIRVDS QYLRRVCAPF ADPQVGLVTS LYRSSSVKGV GCAIEALGFC
SEMVPNVMAA VKLEGLSFAL GASMALRREA LERIGGFEAL VDYLADDYQL GNMIHNNGFR
LELSPHFVES VMRGDETVSE VMARQLRWGR TMRVSRPGGY LASGITLPFP AALLALLISG
FTAAGWLAAA LLYLVRFAVS LAYSQLLVRD RLLPRWLWLL PLRDALAFAV WALSLLGNRV
RWRGELFQLD NGGKIRSIGK RGQAGF