Gene Gdia_0334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0334 
Symbol 
ID6973728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp375652 
End bp376713 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content72% 
IMG OID643389866 
ProductKpsF/GutQ family protein 
Protein accessionYP_002274745 
Protein GI209542516 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.139935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.203565 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAAC CCGCCTTCCA TGACGATGCC GGCGCACCCG CGGACGAGGC CCGGATCCAG 
TCCGACATCA CCGCCGCGTG CCGCGTTCTG GCGCGCGAGA GCGACGGACT GGCCCGGCTG
GCGGCCGCCC TGCGCGCCCC GGCGGTATCC GACGAGCCGG AAGGCATCAC GCCGCTGGGC
ACGGCCTTCG CACGGGCGGT CGATGCGTTT TCGACGCTGG CGGGCCGGGT CATCGTGACG
GGTATCGGCA AGTCCGGCCA TGTCGGGCGC AAGATCCAGT CCACCCTGGC ATCGACCGGA
ACGCCCTCGG TCTTCGTGCA TCCGTCCGAA GCGTCGCACG GCGACCTGGG CATGATCCAG
CGGGGCGATG CGGTGCTGGC CCTGTCCAAT TCCGGCGAAA CGGCGGAACT GGCCGACATC
GTGGCGCACG CCCGCCGCTA CGGCCTGCTG CTGGCCGCCA TTACCGCCGC ACCCGATTCG
ACCCTGGCCC GGGCCGCCGA CATCGCCCTG ATCGTGCCGA AGGCGCCGGA GGCCTGCCCG
ATGGGCCTGG CCCCCACCAC GTCGACGACC ATGCAGATGG CGCTGGGCGA CGCGCTGGCC
GTTGTGCTGC TGGAGCGGCG AAATTTCTCC GCCACCGATT TCGGCGTCTT CCATCCCGGC
GGCCGGCTGG GCACGCGGCT GCGCCGCGTC AGCGACCTGA TGCATCGGGG CGCCGCCATG
CCGCTGGGCA CGCCGGACAT CGCGATGCGC CAGGTCATCA TGGAAATGAC GCGCAAGGCC
TTCGGCTGCA TCGGTGTCGT CTCCCCGGAC GGGCGGCTGC GCGGGCTGAT CACCGACGGC
GACCTGCGCC GGGCGCTGGA CCGCGACCTG GACAGCACGC TGGCCGCCGA CATCATGAAC
CCCACCCCGC TGACCACCGG CCCCGACGTG CTGGCGGCCG AGGCCCTGCG CCTGATGAAC
GCGCGGGCGC GGCCGATCAC CAGCCTGTTC GTGGTGGACG CGGCCGGGCT CCCGGTCGGC
ATCCTGCATA TCCACGACCT TCTGCGGGCG GGCGTCGCAT GA
 
Protein sequence
MTQPAFHDDA GAPADEARIQ SDITAACRVL ARESDGLARL AAALRAPAVS DEPEGITPLG 
TAFARAVDAF STLAGRVIVT GIGKSGHVGR KIQSTLASTG TPSVFVHPSE ASHGDLGMIQ
RGDAVLALSN SGETAELADI VAHARRYGLL LAAITAAPDS TLARAADIAL IVPKAPEACP
MGLAPTTSTT MQMALGDALA VVLLERRNFS ATDFGVFHPG GRLGTRLRRV SDLMHRGAAM
PLGTPDIAMR QVIMEMTRKA FGCIGVVSPD GRLRGLITDG DLRRALDRDL DSTLAADIMN
PTPLTTGPDV LAAEALRLMN ARARPITSLF VVDAAGLPVG ILHIHDLLRA GVA