Gene Gdia_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3044 
Symbol 
ID6976478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3331338 
End bp3333692 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content64% 
IMG OID643392552 
ProductOrganic solvent tolerance protein 
Protein accessionYP_002277389 
Protein GI209545160 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.328846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCGA TCCTGATGTC CCGTCGCCTG TTTCGCCCTG CCCGGGGCCA GCGTTCGCAG 
GCGCGTGGCA CATTGCTGGC TGCCGGCCTG CTGGGCAGCA CGTTCCTGGG CGCGGTCATC
CAGGTTCACC GCGCCCATGC GCAGGTCCAG GCCAAGCCGA TCAAGATCAC GACCGGCAGC
CCCATGTCGC AGTCGGCCCC CGTCACGTTC CAGGCGGACA CCGTCACCTA CGACAACGCG
CGGGGCATCG TGACCTGGAC CGGCAACGTC CAGATCTGGC AGGAAGACCA CGTCCTGCGC
GCCGATACGG TGACGTACGA CCGCAATACC GGCATCGCCG CCGCCCACGG CCACGTCGCG
ATGGTGGAAC CCGACGGCAC GGTCATCTTC AGCGACTACG CCGAACTCAG CAACGGCATG
CGCGACGGCA TCATGACCCG CCTGCATATG CTGATGACCG AAAACGCCAA GCTGGCGGCC
AACGGCATGC GGCGCACCGG GGCCAAGGTC AACGACATGG CGCGCGCCGT CTACACGGCC
TGCAACATCT GCGCCCAGCA TCCCGAACGG CCGCCCTTCT GGCAGTTGCG GGCCTATGAC
GCGATCGACG ATACCGAACA CAAGCGCATC GACTTCCGTG ACGCCTATCT GGACATGTTC
GGCATTCCGG TCATGTACAT GCCCGCCTTT TCGATGTCCG ACCCGTCGGT CAAGCGGCAG
AGCGGCTTCC TGACGCCGGG GATCACGCCG CATGACCGCT ATCTGGGCGC CTATGTCACC
ATCCCCTATT ACTGGGTGAT CGACAAACAG TCCGACATGA CGATCCAGGG TCTGCTGTCC
ACCCGGACGG GCCCCCAGGT CAGCACCCAG TACCGCAACG CGCTGAATTT CGGCACGCTG
AACATCCTGG CCGGCCTGGC CTACGACACC AACCGCCAGG GGTCGTACGT GAACACCTTC
GGCAACACCA CGGGCACGTC CGACGAGCAC GGCATACAGG GCTACATCTT CGCCCAGGCG
CAGGCGTCGA TCACGCGCGC CTGGCGCGCG GGGGCGAACA TCAACCTGGC CACGTCGGCC
GACTACATGC GCGACTATCG CGTGTCCGGC TACGGCCAGG AAATGCTGAC CTCGAACGTC
TATCTCGAAG GGTTCGGAAC CGGGTCCTAT TCGCGCGTCG ACGCCCAGGC CTACCAGGGC
CTGAACCAGG GCGTCATCCG CGACAACGAA CTGCCGTGGG TGCTGCCGCG CTACACCTAC
AGCTATTTCG GCCAGCCCGA CGCGTGGGGC GGCCGGCTGG CGGTGGATAC CACCGATTTC
TACGTCTATC GCGCATCCGG CGTCTCGGAC CAGCGCGGGC AGCTCTCGCT GAACTGGGAC
CGGCCGTTCC GCAACCATCT GGGCCAGCTC TGGAAGCTGA CCCTGCACCT GGAATCGGCC
ATCCACCGGG CGACCAGCCT GAACCAGCAG CCGATCTACG CCCCGACCAC GACCCGCCAG
CAGATCACGG GCCAGGTCCT GCCGACGGTC GCGCTGAAGA TGAACTGGCC GTTCCTGCGG
GGCTTCATGA ACGGCAAGGG CACCCAGATC CTAGAACCCA TCGCCCAGGT CATCGCGGCG
CCCAACACCG GCAACAGCCG CAACAGCAAC CTGCCGAACG AAGACAGCCT GTCCTACGAA
TTCACCGATT CGACGCTGTT CGCGATCAAC CGCTACCCGG GGACCGACCG GCTGGACGGC
GGCCTGCGCG GCAATTTCGG CGTGCATGGC AACTGGACGT GGAACGGGCA TGAAATCGAT
ACGCTGGTCG GCGAAAGCCT CCAGGAGCAT ATCGACCACA ACCGCATTCC CTATTCCGGG
CTGAACCACC ATTTTTCCGA CGTGGTGGCG CGCGCGCGCT TCGCGCCGAA CCAGTATATC
GACTTCACCG GCCGCACCCG GATCGACCCG TATGCCGGCC GGATCGATTT CGGCGACGCG
CTGGTCAGCA CCGGGGTAAA GCATTTCCAC CTGACCAGCG GCTATGTCTA CGAACCGGTG
ACGCCGTATT ATTATTATGC CACCAACATC CAGACGGCGA GCCCCAACGC GGCCTATTAT
GTGCGCACGA ACGAGGTGAC GGTCGGTGCC AACACCAACT GGCAGAACTA CAGCCTGTCG
GCCTTCGTCC GCCGCAGCCT GTCGCGCCAG CAATTCGTCA GCATGGGCGG CAACGCCGGC
TACAACAATG ACTGTTTCGG CTTCAACCTG ATGTATATCA AGCAATATAC GTCGATCGGC
GGCCAACAGC GCAATTCGAC CATCATGTTC ACCCTGACCT TCAAGACCAT CGGTGCCTTT
GGTATTCGTG GCTGA
 
Protein sequence
MMPILMSRRL FRPARGQRSQ ARGTLLAAGL LGSTFLGAVI QVHRAHAQVQ AKPIKITTGS 
PMSQSAPVTF QADTVTYDNA RGIVTWTGNV QIWQEDHVLR ADTVTYDRNT GIAAAHGHVA
MVEPDGTVIF SDYAELSNGM RDGIMTRLHM LMTENAKLAA NGMRRTGAKV NDMARAVYTA
CNICAQHPER PPFWQLRAYD AIDDTEHKRI DFRDAYLDMF GIPVMYMPAF SMSDPSVKRQ
SGFLTPGITP HDRYLGAYVT IPYYWVIDKQ SDMTIQGLLS TRTGPQVSTQ YRNALNFGTL
NILAGLAYDT NRQGSYVNTF GNTTGTSDEH GIQGYIFAQA QASITRAWRA GANINLATSA
DYMRDYRVSG YGQEMLTSNV YLEGFGTGSY SRVDAQAYQG LNQGVIRDNE LPWVLPRYTY
SYFGQPDAWG GRLAVDTTDF YVYRASGVSD QRGQLSLNWD RPFRNHLGQL WKLTLHLESA
IHRATSLNQQ PIYAPTTTRQ QITGQVLPTV ALKMNWPFLR GFMNGKGTQI LEPIAQVIAA
PNTGNSRNSN LPNEDSLSYE FTDSTLFAIN RYPGTDRLDG GLRGNFGVHG NWTWNGHEID
TLVGESLQEH IDHNRIPYSG LNHHFSDVVA RARFAPNQYI DFTGRTRIDP YAGRIDFGDA
LVSTGVKHFH LTSGYVYEPV TPYYYYATNI QTASPNAAYY VRTNEVTVGA NTNWQNYSLS
AFVRRSLSRQ QFVSMGGNAG YNNDCFGFNL MYIKQYTSIG GQQRNSTIMF TLTFKTIGAF
GIRG