Gene Gdia_0643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0643 
Symbol 
ID6974040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp730517 
End bp731758 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content66% 
IMG OID643390174 
Productglycosyl transferase group 1 
Protein accessionYP_002275050 
Protein GI209542821 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.984184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTCCA ACCCGATCCT GATGGATATT TCCCGGCTGC TGTCGCGTGC CGGCAGCGCG 
GTACCGACGG GAATCGATCG TGTCGAGCTG GAATACGCGC TGTATCTGTC ACGGAATTTC
CCGGCGCGGG TGACCTTCGT GGCCTATCAC CCGCTGGGGC GGATCGGGGT CCTGCCGCGC
CGTCCCACAA CCTGGTTCCT GCGGATGCTG GCGCGGGCCT GGACGAACGG GACGCGGCCC
TGGCGCGGGG CGTCACTGAT GGCCGGGCTG CTGCTGCAGG CGGCGGCGGC CGGCACGGTT
GGCGAGACCG CGCGGCGCCG GCTGTATCTG CTGCTGTCGC ATCATCACCT GATGAAGCGT
GACGTGATCG CCGGTTTCAT GGATCGGGCG AATGCCGGAT TGGCCGTCAT GGTCCACGAC
CTGATACCGA TCGACTATCC GGAATATGCC CGCCCGACCG AACCGGACCG CCATCGGCAG
CGTATGGACA CGGTGGGCCA ACTGGCGGAT TCCGTCATCG TGCCCTCGCA GGCCGTGGCG
GACTCCCTGC GGCACTACCT GGCGCCGGGT GGCCGCTGCC CGCCGATCGG CGTGGTGCAC
CATGGATGCC ATGTCGATCC AATCTCCACG ATCCCGGTCA GCGGCCTGGC GCCGGATGTC
CCGTATTTCG TCGTGCTGGG AACGATCGAA CCGCGAAAAA ATCATTTATT ATTGCTTAAT
ATCTGGCGGC ATATGGCAAC GGGACGTGAC CGGAGCCGGT TGCCGCATCT GGTCGTCATC
GGGCGCCGCG GCTGGGAAAA CGAAAATATA CTCGACATGA TGGAACGTTG CCCCGCCCTT
CAGGGGGTCG TGCACGAATA CGCGACATTG TCCGATGTGG TGGTCGCTGA TCTGGTCCGG
GGGGCGCGCG CCCTGCTGTT TCCCTCGTTC GCCGAAGGAT TCGGCCTGCC GTTGCTGGAG
GCCCTGTCCA TGGGCACCCC TTGCCTGTGC AGCGACCTGC CGGTCTTTCG CGAAATCGCC
GGAGATCTGC CCTGTTACCT GGACCCGCTG GATGGACCCG GCTGGCAGCG CAGGATCCTG
GACCTGGCCG GAGAGGACAC GCGCGAGAAC GGCGAGACGC CGGTCTTTGC CGACTGGCCG
GCCCAGGTGG CGGCCGGGTT GGCGACGATC GACGGCGCGG TGCGTGCCGA TGCGGCGGCG
GCCGTTCGTC CGATGAACGT GGAAGCGAGG ACGGCGTGCT GA
 
Protein sequence
MMSNPILMDI SRLLSRAGSA VPTGIDRVEL EYALYLSRNF PARVTFVAYH PLGRIGVLPR 
RPTTWFLRML ARAWTNGTRP WRGASLMAGL LLQAAAAGTV GETARRRLYL LLSHHHLMKR
DVIAGFMDRA NAGLAVMVHD LIPIDYPEYA RPTEPDRHRQ RMDTVGQLAD SVIVPSQAVA
DSLRHYLAPG GRCPPIGVVH HGCHVDPIST IPVSGLAPDV PYFVVLGTIE PRKNHLLLLN
IWRHMATGRD RSRLPHLVVI GRRGWENENI LDMMERCPAL QGVVHEYATL SDVVVADLVR
GARALLFPSF AEGFGLPLLE ALSMGTPCLC SDLPVFREIA GDLPCYLDPL DGPGWQRRIL
DLAGEDTREN GETPVFADWP AQVAAGLATI DGAVRADAAA AVRPMNVEAR TAC