Gene Gdia_0662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0662 
Symbol 
ID6974059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp752961 
End bp754136 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content58% 
IMG OID643390192 
Productputative capsule polysaccharide export inner-membrane protein CtrB 
Protein accessionYP_002275068 
Protein GI209542839 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID[TIGR01010] polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0904257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0442626 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGGT TTGAAATTCC GAGCGGGACA ATCGCCTCGG CTGGTTCGGC GCTTCTCGCG 
ATTCCGCGGG GCTTTCACCG CGGACGTGGG GTCCGAAAGG CTTGGCCATT TCTGCTGGTG
GTGATGCTGC CGACCTTCAT CGCGGCAATT TACTATTTTT TGATCGCCGC GCCGCAATAT
GTCTCGCAGG CCGAATTCGT CGTGCGGGGG GCTTCATCCC AGCCCATGGG AATGCTGAGC
AGCCTGCTGA CGGGGGAGGG CGGATCGTCG GCCGACGAAG ATGCCTACGT CGTGCAGGAC
TACCTGACGT CGCGGGATGC GGCCAGAACG ATGCTTCGCA CGCAAGGGGG CGCGGCGATG
TTCAATCGCC CCGAAGGCGA TTGGCCTGCC CGGTTCCCCA ATATCTTTAC GGGCGCGACC
TTCGAGCATT TTTATCGCTA TTATAAGCGG CATATCACCG TTGATCTCGA TACATCGACG
TCCATAACGA CCCTTCAGGT CCGTACCTTC CGCGCGCAGG ATTCCCAAGC CGTTGCACAG
GCTCTTCTTG TCGCGGCGGA ACAACTCGTC AATCAGATGA ATGCGCGGAA GCGGGCCAAC
ATGATCGGCA GCGCGGCGAA GGAACTTGCG GAAGCGCAGG ATCAACTGCG GGACGTAGAG
GAGCAGATGG CCGCCTATCG CAATCGGGAG GCACTCTTGG ACCCACTCAA GCAAGCGGCC
CCGATGTTGT CGAATATCAA TGAACTGCAG GTGGCGCTGA CGTCGACGCG GATACAGCTT
GCCCAGGTTC AGACAGAATC GCCGAATAGT CCTTCGATTC CGGTGTATCA GCATCGGATC
GCGGTGCTGG AAGATCAGAT TGCCAGGTCG AACAAGGAGG TTACGGGGTC GAAGACCTCG
CTGGTCCCCA AGATCACGGA TTACGACGCC TTGGTGATTA AACAGGAAAT TGTCGAGAAA
GGGTTGGCTG CGGCGGCGTC CGCCTTGATC AGCGCCAAAG GGCAGGCGGA TCGGCAGCAG
GTCTATCTGG AGGAGATTTC GCAGCCGGAT TTGGCGGATT ATGCCACATA TCCGCAGCGG
ATTGCCGACG TGCTGATTGT CTTTGCGACG TTCTTGATGG TCTACCTGAT GGGTAAGCTG
ATCATTAATG GCGCGCGTGA ACACCAGATC GTGTGA
 
Protein sequence
MDGFEIPSGT IASAGSALLA IPRGFHRGRG VRKAWPFLLV VMLPTFIAAI YYFLIAAPQY 
VSQAEFVVRG ASSQPMGMLS SLLTGEGGSS ADEDAYVVQD YLTSRDAART MLRTQGGAAM
FNRPEGDWPA RFPNIFTGAT FEHFYRYYKR HITVDLDTST SITTLQVRTF RAQDSQAVAQ
ALLVAAEQLV NQMNARKRAN MIGSAAKELA EAQDQLRDVE EQMAAYRNRE ALLDPLKQAA
PMLSNINELQ VALTSTRIQL AQVQTESPNS PSIPVYQHRI AVLEDQIARS NKEVTGSKTS
LVPKITDYDA LVIKQEIVEK GLAAAASALI SAKGQADRQQ VYLEEISQPD LADYATYPQR
IADVLIVFAT FLMVYLMGKL IINGAREHQI V