Gene Gdia_0644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0644 
Symbol 
ID6974041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp731755 
End bp733437 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content67% 
IMG OID643390175 
ProductCapsule polysaccharide biosynthesis protein 
Protein accessionYP_002275051 
Protein GI209542822 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3563] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.91013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATAA CGGGTGATAC AACCGCCTTG TTGCGTCCTC CACCCTTTCA CCGGGTCATG 
CCGGTTCTGG CCGTGAAGCA GGATGGTGCT ACGTCGGAAC GGGTCTGCGG TTCCGGGCGG
GAACTGGACG ATCTGGCCGC GCGGATCGTG CGGCAGCGCG TCGGCGGATG TTTCTGGGGG
CGGCCTGCCG GAGCACGGCG GCCGGTCGTC GTATATGGCC CGGGATGGGG GGCCGGTTCC
CGTCCAATTC GTTCGATCCT GCGGAATGTC TTCGCCGCGC ATGCCGCGCA GGACGTGGTT
CTGGTCTGCG GGCGGCGCCA GGGCGCACTG GCCGCATGGG GCCGCCGCCA TGGGCTGGCG
GTAGTCCCGG CCGATACCGA TCCGCACGAT CTTCTTGACG GTGCGACCTG TGTCTATGCC
CACCCGACAA CCGATCTGGC CCGGCTGGCC GTTCTGCGCG GCCATGCGGT GAAGGGGATT
CCGGATGGGA TGAGCCTGGC CGGAATCATG GCCGACATTG CGAATCGCAC TGATTATGTC
GATCCGTTTA CCGGTGCGCC GATGGCATGC GCCGACGCGA TCACGCTTCT GGCGGACTGG
CGGCGGACGA TCATGGCCAA TCGCAGGATC GGCGCATGCC TTGGAATGTC GCGATGGAAG
CGGCGCAGGA TTGCCGATTT CCTGGCGACG GCGCCCGGCC ATCCACCCTT TCGCCTGGAC
ATCCGCGCCG GGCAGCGCGG CGCGGTGGCG GTGTGGGCAA CCCGGCAGCC ACCGGGACTG
GAAGATGCGG TGCGCCGGGC GGGTATTCCG TTGTGGCGGG TGGAGGACGG CTTCATCCGA
TCCATCGGCC TGGGCAGCGC GTTGACCCCG CCTTCGTCGA TTACGATCGA CACGCGCGGC
ATGTATTACG ACCCGGCGCA GGAGAGCGAC CTGGAACATA TTCTGGCCAC ATCCCCCTTT
GATGATGTGT TGCGCGCCCG CGCCAGGAAA TTGATGGATG CGTTGGTGCT GCTTGGTATT
TCCAAATATG GCCCCGGTCG GGCCTCTGCC CTGCCTGCAT ACTGGGACGC CGCGGGGCGG
CGCACGATCC TTGTGCCCGG CCAGGTCGCG GACGACCAGT CGGTGCGCCG TGGGGGCGGG
CGGATCGCAG GCAACCTGGA ACTGCTGCAG GCGGTCCGGG AGAGCAACCC GGACGCCTTC
ATCCTCTATC GGCCGCATCC CGATGTCGAA GCGACCCATC GGGTGGGGCA TGTGGACGAC
GCGCTTGTCC TGCGGCTGGC CGATCGGATC GATCGCGGCG GAGCGATCAC CGATACGATC
GCCCGCGTGG ACGAGGTTCA TACCCTGACC TCGCTTGCCG GATTCGAGGC CCTGATGCGG
GGACGGCGCG TCGTGACCTA TGGCGCGCCG TTCTATGCGG GATGGGGGCT GACGGTCGAT
CGGGGGGACG TTCCCCCCCG CCGGACGCGG CGCCTGTCCC TGGAGGAACT GGTCGCGGGG
GTCCTGATCC TGTTTCCGCG TTATCTGGAC CCGGTGACCC GGTTGCCATG CAGCCCGGAA
GTCCTGATCG AGCGCCTGCA GGACGCCCGG GTCTGGCGGC CTACCTGGTG GATGCGGGCG
TTGGCCTTGC AGACAGGCTG GCGCAAGGCT GTCGCGCGCT ATCGTGCGGC GGTGGCGTCA
TGA
 
Protein sequence
MRITGDTTAL LRPPPFHRVM PVLAVKQDGA TSERVCGSGR ELDDLAARIV RQRVGGCFWG 
RPAGARRPVV VYGPGWGAGS RPIRSILRNV FAAHAAQDVV LVCGRRQGAL AAWGRRHGLA
VVPADTDPHD LLDGATCVYA HPTTDLARLA VLRGHAVKGI PDGMSLAGIM ADIANRTDYV
DPFTGAPMAC ADAITLLADW RRTIMANRRI GACLGMSRWK RRRIADFLAT APGHPPFRLD
IRAGQRGAVA VWATRQPPGL EDAVRRAGIP LWRVEDGFIR SIGLGSALTP PSSITIDTRG
MYYDPAQESD LEHILATSPF DDVLRARARK LMDALVLLGI SKYGPGRASA LPAYWDAAGR
RTILVPGQVA DDQSVRRGGG RIAGNLELLQ AVRESNPDAF ILYRPHPDVE ATHRVGHVDD
ALVLRLADRI DRGGAITDTI ARVDEVHTLT SLAGFEALMR GRRVVTYGAP FYAGWGLTVD
RGDVPPRRTR RLSLEELVAG VLILFPRYLD PVTRLPCSPE VLIERLQDAR VWRPTWWMRA
LALQTGWRKA VARYRAAVAS