Gene Gdia_0624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0624 
Symbol 
ID6974021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp699467 
End bp700606 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content70% 
IMG OID643390155 
Productglycosyl transferase group 1 
Protein accessionYP_002275031 
Protein GI209542802 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.225322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTC TCGAAATCAC CAATGTCGAT TTTTCGCTGC GGCACTTCCT GCTGCCGCTG 
ATGCGGGGCC TGCGCGCCGA TGGGCACGAG GTCGTCGGCG TCTGCGCGGA CGGTCCCCTG
CTGGCCGATG TGCGCGGCGA GGGATTCCGC GTCGAGACGG TGCCGCTGGT CCGATCCTTC
TCGCCCCTGG CGCAGATGCA GGCGCTGATC GCGCTGGTCC GGCTGATCCG GGAGGAAAAG
CCGGACATCG TCCACGCCCA CATGCCGATC AGCGGCCTGC TGGGCCGCCT GGCGGCGTGG
CTGTGCCGCG TGCCGTGCGT GGCCTATACC TGCCATGGCT TCCTGTTCAA CCAGCCGGGG
CCCGCCCCAC GGCGCGGTCT GGCGCTGGTG CTGGAATGGC TGGCCGGGCG GATCACCGAC
CGGTATTTCA CCGTATCGGT GCAGGAGGCC GAGGACGCCC GGCGCCTGAA GATCCACCCG
GCGCCGCTGG CGGTGGGCAA TGGGCGCAAC CCCTCCCTCT TCCAGCCCGA TCCCGAGGCA
CGGCGGCGGA TTCGCGCCGA ACTGGGGGTG GCGGAAGGGG CGGTGGTCAT CATCGCCGTG
TCACGGCTGG TGCGGCACAA GGGCTATCCG GAACTGCTGA AGGCGATGGA GCAGGTGTCC
GGCGCGATGC TGTGGGTGGT GGGCGAACGC CTGGAGTCCG ACCATGGAGA ATCGCTCGAT
TCGTGCTTCG AGGAGGCGCA GCGGGTACTT GGCGCGCGGC TGCGGTGCCT GGGCTATCGC
GAGGACGTTC CGGCCCTGCT GGCGGCGGCG GATATCTTCA CCCTGCCCAG CCATTTCGAG
GGACTGCCGA TGTCGGTGAT CGAGGCGATG CTGACCGGCC TGCCGGTGGT GGCCAGCGAT
ATTCGCGGCC CGCGCGAACA GGTCGTGAAC GGCCGTACCG GGCTGCTGGT TCCCCCGGGC
GAGGCCGTGC CGCTGGCGCG CTCCCTCGGC TGCCTGGTCC GCGACCCGGA CCTGCGCTAT
CGGATGGGCG AGGTCGGGCG TGAGAGGGCC CGCGCCCGGT ATGACGAGGA CATCGTGGTC
GGCCGCACCA AGATGGCCCT GCTGGCACCC GGGACGACGC CGACCGACGA TGCCGGCTGA
 
Protein sequence
MKILEITNVD FSLRHFLLPL MRGLRADGHE VVGVCADGPL LADVRGEGFR VETVPLVRSF 
SPLAQMQALI ALVRLIREEK PDIVHAHMPI SGLLGRLAAW LCRVPCVAYT CHGFLFNQPG
PAPRRGLALV LEWLAGRITD RYFTVSVQEA EDARRLKIHP APLAVGNGRN PSLFQPDPEA
RRRIRAELGV AEGAVVIIAV SRLVRHKGYP ELLKAMEQVS GAMLWVVGER LESDHGESLD
SCFEEAQRVL GARLRCLGYR EDVPALLAAA DIFTLPSHFE GLPMSVIEAM LTGLPVVASD
IRGPREQVVN GRTGLLVPPG EAVPLARSLG CLVRDPDLRY RMGEVGRERA RARYDEDIVV
GRTKMALLAP GTTPTDDAG