Gene Gdia_0664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0664 
Symbol 
ID6974061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp755825 
End bp756985 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content59% 
IMG OID643390194 
Productpolysaccharide export protein 
Protein accessionYP_002275070 
Protein GI209542841 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0130788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0340972 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATC GCTACGCATC AGTTGCTCGT GTCCTGGCCA TCTCCGCCTT GCTGGCCGGA 
TGCAGTACGT TGCCTAGCAG CGGGCCGGTC AGTTCACAGA TCCTGCAAGC CGCAAAAGAC
CCGAAACTGA ACCCGATCGG TTTCAGCATA GTCCCGTTCA CGCCGAAGAC ACTCGACGTG
CTTCAAAACG AAACGCCTCC GCTTCTTTCC ACCCTGGAGA AAGAAGGGCC GGGTCAGGGC
GAACACGGGG CCATCGGCCC CGGCGACGTT CTTGCAATCT CCGTCTTCGA GATTGGCAGC
AGCCTGTTTT CGGGCGGCGG CCTGACAGGA GGAAGGGCGG CGGCATCAGG GGCAGCCAGT
ATTGAAGCCC TCCCCCCCGT CGAGGTCGAT GATCAGGGCT ATATTCCCTT TCCCTATATC
GGCCGCGTTT TCGTCGCGGG AGAAACGCCG ACCCAGCTTG CCAAGACGAT CGAGGACCAA
CTGGCCGCGA AGTCGCAGAA TCCCCAGGTC ATTGTGCGGA TCATGACCGA CCTGCATAAC
TCCATCATCG TATCGGGCGA CATTCTTCAC CCTGGACGTC AGATGCTCAC CCTCGCCCAT
GAGGGGCTTC TGGACATCAT CGCCATGGGA GGTGGCCCGA GCCACTCGTC CGAGGACAGC
GTGGTACTCC TGACCCGCCA TGGGGTGACG GGGAGCATTC CCCTGCGTAC GCTCGAAACC
CATCCGGAGC AGAACATCCC GCTTATGCCG GGCGACCGGG TCCAGGTTAT CTACCTGCCC
CGGACCTATA CGGTTTTCGG CGCAACACGC GTAATGCAAA CGCCGTTCAA TACACCGGTA
CTCACACTTG ACCAGGCCAT CGCGCGCATT GGCGGACCGG CTGACGATCG TGCCGACGCC
AACGCAATCT ACCTGTTCCG CTATGAAAGC GACGAGGTGG CACAGAAGCT CGGGCTGACC
CCGAAACCCG GTGGCACGCC AATTATCTAC AATATCGATT TGATGAACCC GACGAACTAC
TTTCTTTCCC AGAAATTCGT CATGAAAGAC AAGGATCTGA TCTTTGTCTC GAATGCGAAG
GTCAACAAGC TGTACAAGTT CCTGACCTTG ATTGGCGCCG TGACCAGCCC GGCTATTACC
GCTGCCTACG TGGCGAGGTA G
 
Protein sequence
MIDRYASVAR VLAISALLAG CSTLPSSGPV SSQILQAAKD PKLNPIGFSI VPFTPKTLDV 
LQNETPPLLS TLEKEGPGQG EHGAIGPGDV LAISVFEIGS SLFSGGGLTG GRAAASGAAS
IEALPPVEVD DQGYIPFPYI GRVFVAGETP TQLAKTIEDQ LAAKSQNPQV IVRIMTDLHN
SIIVSGDILH PGRQMLTLAH EGLLDIIAMG GGPSHSSEDS VVLLTRHGVT GSIPLRTLET
HPEQNIPLMP GDRVQVIYLP RTYTVFGATR VMQTPFNTPV LTLDQAIARI GGPADDRADA
NAIYLFRYES DEVAQKLGLT PKPGGTPIIY NIDLMNPTNY FLSQKFVMKD KDLIFVSNAK
VNKLYKFLTL IGAVTSPAIT AAYVAR