Gene Gdia_1526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1526 
Symbol 
ID6974936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1701560 
End bp1703263 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content70% 
IMG OID643391057 
Producttype II secretion system protein E 
Protein accessionYP_002275920 
Protein GI209543691 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.30964 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGACG CGGGATTGGA CGCTGACTCG CCGATCGACG CACTGGCGCG TTTGCTGGTC 
GAGCGGGACC GGTGCGACCC GCGCGCCATC GACCGCGCGC GCCGCGCGGC GGACCAGAAT
GGCGGGCGCC TGGACCGGAT TCTGCTGCAA CTGGGGCTGG TGTCGGAACG CGACATGGCC
TTGAGCTACG CGGAATTCCT GGACATGCCC CTGGCGTCCG CTGATCTCTA CCCCCGGGAA
CCCGTCCTGA CGCAGTATCT CGGCGCCCGC TTCCTGCGTG ACATCCATGC GGTGCCGCTG
GGCGTGGACG ACGGGACGGT CACGCTCGCC CTGCGCGATC CGCTGGACGG CTTCGCCGCC
TCGTCGATCG CGGCGGCCAC GGGGCTGCGC GTGTCCTGCC GGGTGGCCGT GCCGATCGAA
CTGGAAGCCG CCCTCGACCG CCTGTACCCG TCCGGGGGCG ATGACGGCGT CCAGCCCGAC
GAGGACACGG CACCGCTGGA AGACGATGCC GAGCGGCTGA AGGACCTGGC GTCCGAGGCC
CCGGTGATCC GCCTGGTCAA CCAGATCGTC AGCCGCGCCG TGGAAACCCA TGCCTCGGAC
ATTCATATCG AACCGTTCGA GGACCGGCTG CGCGTGCGCT ATCGCTATGA CGGCGTGCTG
CACGAGGTCG AAAGTCCCCC CGCGCACCTG ATCCCGGCGA TCATTTCGCG CATCAAGATC
ATGGCCCGGC TGGACATCGC GGAACGGCGC CTGCCGCAGG ACGGCCGGAT CAAGCTGGCC
GTGCGCGGGC ACGACATCGA TTTTCGCGTC TCCACGATCC CCTCGCTGCA TGGCGAGACG
GTGGTGCTGC GCGTGCTGGA CCGTTCCAGC GTGCGGTTCG ACTACGCGAC GCTGGGCCTG
CCGCCGGCCA TCGTCGCACG CCTGCGCGGC CTGCTGGCGC TGCCGAACGG CATCGTGCTG
GTGACCGGGC CGACCGGGTC GGGCAAGACG ACCACGCTCT ATACCGGGCT GGCGGACCTG
AACGCGGTGA CGCGCAAGGT GGTGACGATC GAGGACCCGA TCGAATACCA GCTTGGCGGC
ATCAACCAGG TGCAGGTCCG GCCGCAGATC GGACTGACCT TCGCCGCCCT GCTGCGCGCG
ATCCTGCGCC AGGACCCCGA CGTCATCATG GTCGGTGAAA TCCGCGACAT CGAAACCGCC
CAGATCGCGG TGCAGGCCGC CCTGACCGGC CACCTGGTGC TCTCGACCCT GCATACCAAC
TCCGCCGCCG CCGCCATCAT CCGCCTGCGC GACATGGGGG TGGAGGATTA CCTGCTGACG
GCCGTGCTGC GCGGCGTGGT GGCGCAGCGG CTGGTCCGGC GGCTGTGCGG CCAGTGCCGG
ACGCCCTACA CCCCGCCCCG GGAACTGGTC GATCGCTTCG ACCTGGAAAC GCTCGCAGGC
GGCGGGCCCG TCACGCTGTT CCATCCCGTG GGTTGCGCCG CCTGCCGGAA CACCGGCTAT
AGCGGCCGGC AGGCCATCGC CGAGCTTCTG GAACCGGACG AGGCGGTGGA ACGCCTGATC
TTCGCGCGCA GCGACCACCT GGCGATCGAA CGCGCGGCGG TCGAGGCCGG AATGGTCCCG
ATGTTCACCT CCGGCCTGGT CGCGGCCCTG AAGGGCGAGA CGACGATCGA GGAAGTGACC
CGCAGCGTCC GGGCGGGAAC GTGA
 
Protein sequence
MMDAGLDADS PIDALARLLV ERDRCDPRAI DRARRAADQN GGRLDRILLQ LGLVSERDMA 
LSYAEFLDMP LASADLYPRE PVLTQYLGAR FLRDIHAVPL GVDDGTVTLA LRDPLDGFAA
SSIAAATGLR VSCRVAVPIE LEAALDRLYP SGGDDGVQPD EDTAPLEDDA ERLKDLASEA
PVIRLVNQIV SRAVETHASD IHIEPFEDRL RVRYRYDGVL HEVESPPAHL IPAIISRIKI
MARLDIAERR LPQDGRIKLA VRGHDIDFRV STIPSLHGET VVLRVLDRSS VRFDYATLGL
PPAIVARLRG LLALPNGIVL VTGPTGSGKT TTLYTGLADL NAVTRKVVTI EDPIEYQLGG
INQVQVRPQI GLTFAALLRA ILRQDPDVIM VGEIRDIETA QIAVQAALTG HLVLSTLHTN
SAAAAIIRLR DMGVEDYLLT AVLRGVVAQR LVRRLCGQCR TPYTPPRELV DRFDLETLAG
GGPVTLFHPV GCAACRNTGY SGRQAIAELL EPDEAVERLI FARSDHLAIE RAAVEAGMVP
MFTSGLVAAL KGETTIEEVT RSVRAGT