Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0664 |
Symbol | |
ID | 6974061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 755825 |
End bp | 756985 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643390194 |
Product | polysaccharide export protein |
Protein accession | YP_002275070 |
Protein GI | 209542841 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0130788 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0340972 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGATC GCTACGCATC AGTTGCTCGT GTCCTGGCCA TCTCCGCCTT GCTGGCCGGA TGCAGTACGT TGCCTAGCAG CGGGCCGGTC AGTTCACAGA TCCTGCAAGC CGCAAAAGAC CCGAAACTGA ACCCGATCGG TTTCAGCATA GTCCCGTTCA CGCCGAAGAC ACTCGACGTG CTTCAAAACG AAACGCCTCC GCTTCTTTCC ACCCTGGAGA AAGAAGGGCC GGGTCAGGGC GAACACGGGG CCATCGGCCC CGGCGACGTT CTTGCAATCT CCGTCTTCGA GATTGGCAGC AGCCTGTTTT CGGGCGGCGG CCTGACAGGA GGAAGGGCGG CGGCATCAGG GGCAGCCAGT ATTGAAGCCC TCCCCCCCGT CGAGGTCGAT GATCAGGGCT ATATTCCCTT TCCCTATATC GGCCGCGTTT TCGTCGCGGG AGAAACGCCG ACCCAGCTTG CCAAGACGAT CGAGGACCAA CTGGCCGCGA AGTCGCAGAA TCCCCAGGTC ATTGTGCGGA TCATGACCGA CCTGCATAAC TCCATCATCG TATCGGGCGA CATTCTTCAC CCTGGACGTC AGATGCTCAC CCTCGCCCAT GAGGGGCTTC TGGACATCAT CGCCATGGGA GGTGGCCCGA GCCACTCGTC CGAGGACAGC GTGGTACTCC TGACCCGCCA TGGGGTGACG GGGAGCATTC CCCTGCGTAC GCTCGAAACC CATCCGGAGC AGAACATCCC GCTTATGCCG GGCGACCGGG TCCAGGTTAT CTACCTGCCC CGGACCTATA CGGTTTTCGG CGCAACACGC GTAATGCAAA CGCCGTTCAA TACACCGGTA CTCACACTTG ACCAGGCCAT CGCGCGCATT GGCGGACCGG CTGACGATCG TGCCGACGCC AACGCAATCT ACCTGTTCCG CTATGAAAGC GACGAGGTGG CACAGAAGCT CGGGCTGACC CCGAAACCCG GTGGCACGCC AATTATCTAC AATATCGATT TGATGAACCC GACGAACTAC TTTCTTTCCC AGAAATTCGT CATGAAAGAC AAGGATCTGA TCTTTGTCTC GAATGCGAAG GTCAACAAGC TGTACAAGTT CCTGACCTTG ATTGGCGCCG TGACCAGCCC GGCTATTACC GCTGCCTACG TGGCGAGGTA G
|
Protein sequence | MIDRYASVAR VLAISALLAG CSTLPSSGPV SSQILQAAKD PKLNPIGFSI VPFTPKTLDV LQNETPPLLS TLEKEGPGQG EHGAIGPGDV LAISVFEIGS SLFSGGGLTG GRAAASGAAS IEALPPVEVD DQGYIPFPYI GRVFVAGETP TQLAKTIEDQ LAAKSQNPQV IVRIMTDLHN SIIVSGDILH PGRQMLTLAH EGLLDIIAMG GGPSHSSEDS VVLLTRHGVT GSIPLRTLET HPEQNIPLMP GDRVQVIYLP RTYTVFGATR VMQTPFNTPV LTLDQAIARI GGPADDRADA NAIYLFRYES DEVAQKLGLT PKPGGTPIIY NIDLMNPTNY FLSQKFVMKD KDLIFVSNAK VNKLYKFLTL IGAVTSPAIT AAYVAR
|
| |