Gene Gdia_1291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1291 
Symbol 
ID6974696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1438123 
End bp1439301 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content70% 
IMG OID643390820 
Producthopanoid biosynthesis associated glycosyl transferase protein HpnI 
Protein accessionYP_002275688 
Protein GI209543459 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03472] hopanoid biosynthesis associated glycosyl transferase protein HpnI 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.522898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGCCT TGCACGCCCT TGCCTCACCC GCCGGTCTTG CCGCCCTTGT CGCGGCGGCC 
GGATGCGTGC AGGCGACGCT GGGGACCGCA CTGGTCGCGC GATTCCGCTG GCGCGAACGC
CATATTCCCC CGGACAGCAC GCTGCCGCCG GTCACGATCC TGAAGCCGCT GCATGGCGAC
GAGCCGCTGC TGGAGGAGGC GCTGGAGAGT TTCTGCACCC AGGATTATCC CTGCATGCAG
ATCGTCTTCG GCGTGCAGGA TGCCGGCGAT CCGGCGATCG CGATCGTCAG GCGCCTGCGG
GAACGCCATC CCGGCCTGGA CATCGCCCTG GTGGTCGATC CCGCCGTCCA TGGCGTCAAC
CGCAAGATCG GCAACCTGAT CAACATGCTG CCCCGGGCCC GGCACGACGT GCTGGTCATC
TCGGACTCCG ACATCCACGT GGCGCCGGAT TACCTGCGGC ATGTCGTCCA TGCGCTGGCG
CGGCCGGGCG TGGGCCTGGC CACCACGCTG TATGCCGGCC TGCCGGCGAC GCGCAGCCTG
CCGCGCCTGC TGGCGGCATG CCAGATCAAC CACAATTTCC TGCCCGGGGT CATGCTGTCG
CGCTATCTGG GGCGGCAGGA CTGCCTGGGC GCCACCATGG CCCTGCGGCG CGAGACGCTG
GATGCGGTGG GCGGCCTGGC GGCGCTGGCA CCGCATATCG CCGACGACGC CATGCTGGGC
CGCCTGGTGC GCGAGCACGG GCTGCACATC GCCATCGCCC CCTGCATGAC CTGGACCACG
GTGGGCGAGC CGACCTTCAA CGACGTCATG CTTCACGAAC TGCGCTGGGG CCGGACGGTC
AAGACGCTGG AGCCGGCGGG CTATGCCGCG TCCGCCATCC AGCTTCCGCT GTTCTGGGCG
GCGGCTGCCG TCCTGCTGCA ACCGCATGCG CGCTGGACCT GGGCGGTCTT CCTGACCGTC
TGGGCGGTGC GGGCGGCCAG CGCCTTCCTG ATGGACCGGC TGCTGGCGCA GCGCAGCCTG
GTCCCGCTTC TGCTGCTGCC GCTACGCGAC TGGCTTTCCG CCGCGATCAT GGTCGGCAGC
GTCAGCGGCA CGCGTGTCGC GTGGCGTGGA CAGACGATGC ATATCGCGCC ACATTCGGTT
ATGACGCCTC CATCCCATCC TGTCGTGCCG GGCGACTGA
 
Protein sequence
MFALHALASP AGLAALVAAA GCVQATLGTA LVARFRWRER HIPPDSTLPP VTILKPLHGD 
EPLLEEALES FCTQDYPCMQ IVFGVQDAGD PAIAIVRRLR ERHPGLDIAL VVDPAVHGVN
RKIGNLINML PRARHDVLVI SDSDIHVAPD YLRHVVHALA RPGVGLATTL YAGLPATRSL
PRLLAACQIN HNFLPGVMLS RYLGRQDCLG ATMALRRETL DAVGGLAALA PHIADDAMLG
RLVREHGLHI AIAPCMTWTT VGEPTFNDVM LHELRWGRTV KTLEPAGYAA SAIQLPLFWA
AAAVLLQPHA RWTWAVFLTV WAVRAASAFL MDRLLAQRSL VPLLLLPLRD WLSAAIMVGS
VSGTRVAWRG QTMHIAPHSV MTPPSHPVVP GD