Gene Gdia_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3036 
Symbol 
ID6976470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3323540 
End bp3325492 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content65% 
IMG OID643392544 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_002277381 
Protein GI209545152 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.672133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0730738 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACAA AGACAGCCGC GGGTTCGGAA GCGACTGCTG GCGATCAGGA CAACGACACC 
ACTCTGCTGG ATACCCAATC CGGAGCCGTC AAGAGACTGA TCGCACGGGG CAAGGAACGG
GGGTACATCA CCTTCGACGA GCTGAACGCC GTCCTGCCCC AGGATCAGAT GTCGTCGGAG
CAGATCGAGG ATGTGATGGC GGTCCTGTCC GAAATGGGCA TCCAGGTCGT CGAGAACGAG
GATAACGACG ACAGCGAGGC CAATCGCGAG GAGAAGGCCG AGGAGGCCGA CACCGAGGGC
GAGGAAGCCG GGGGTGCCGC GGGCAACGTC GATACCGAGA GCCTGGGCCG TACCGACGAC
CCGGTGCGCA TGTACCTGCG CGAGATGGGG TCCGTCGAAC TGCTGTCGCG CGAGGGCGAA
ATCGCCATCG CCAAGCGGAT CGAGGCCGGC CGCGACGAGA TGATCGGCGG CCTGTGCGAA
AGCCCGCTGA CCTTCCGCGC CATCATTTCG TGGCACGAGC GCCTGAAGGC GGGCGAGATG
CTGCTGCGCG ACATCGTGGA CCTGGAGGCC ATGCAGTCCG GCGGCGCCGA AGCCGAGGCC
GGCGCCGAGG GCGGCGAGCA GGAAGACGGC AGCTTCGACG CGGCCCCCGA GAGCGAGGAC
GGCGAGGAAG GCGACAGCGC CGGCCTGTCG CTGTCCGCGC TGGAAGAAAA GCTGAAGCCC
GAGATCCTGG CGCAGTTCGA GGAAATCGAG GAACTGTATT CCCGGCTGCA GAAGCTGCAG
TCGAAGCGGC TGGAGACCCT GACCTCGGGC GCCGAGATGT CGGACAAGTC CGAGAAATCG
TACGAGAAGC TGCGCGAGGA ACTGGTGGGC AAGGTGCAGC AGGTCCACCT GCACAACACC
CGCATCGAGG TGCTGGTGCA GCACCTGAAG GAAATCTTCC AGCGGCTGAA CGGGCTGGAA
GGGCGCATGC TGCGCCTGGC CGAGAGCACC AAGGTCTCGC GCGAGGACTT CCTGATCAAG
TATCGCGGCA GCGAGCTGGA CCCGGGCTGG ATGGACATGG TGTCCGCCCT GCCGGGCAAG
GCGTGGAAGA ATTTCGTCGC CAAGCATTCG GCGTCGGTGC TGGACCTGCG CGGCCAGGTC
GCATCCCTGT CGCAGGAAAC CGGCCTGCCG GTCGGCGAAT TCCGCCGCGT CTACGCCACC
GTGTCGCGCG GCGAGCGCGA TTCGGCCCGC GCGAAGAAGG AGATGATCGA GGCGAACCTG
CGCCTGGTGA TCTCGATCGC CAAGAAATAT ACCAATCGCG GGTTGCAGTT CCTGGACCTG
ATCCAGGAGG GCAATATCGG CCTGATGAAG GCGGTGGATA AGTTCGAATA TCGCCGGGGC
TACAAGTTCT CGACCTATGC CACGTGGTGG ATCCGCCAGG CGATCACCCG GTCGATCGCC
GACCAGGCCC GCACGATCCG CATCCCGGTC CATATGATCG AGACCATCAA CAAGCTGGTC
CGCACGTCGC GCCAGATGCT GCATGAGATC GGACGCGAGC CCGCGCCCGA GGAACTGGCC
GAAAAGCTGG GCATGCCGCT GGAGAAGGTG CGCAAGGTCC TGAAGATCGC CAAGGAACCG
ATCTCGCTGG AAACGCCGAT CGGTGACGAG GAAGACAGCC ACCTGGGCGA TTTCATCGAG
GACAAGACGG CGGTCATCCC GCTGGACGCC GCGATCCAGA CCAACCTGCG CGAAGCCACG
ACGCGGGTCC TGTCCTCGCT GACCCCGCGT GAGGAACGCG TGCTGCGCAT GCGCTTCGGC
ATCGGCATGA ACACCGACCA CACCCTGGAA GAGGTGGGCC AGCAGTTCAA CGTGACGCGC
GAGCGCATCC GCCAGATCGA GGCGAAGGCG TTGCGCAAGC TGAAGCACCC GAGCCGCAGC
CGCAAGCTGC GCTCGTTCCT GGACGACAAC TGA
 
Protein sequence
MATKTAAGSE ATAGDQDNDT TLLDTQSGAV KRLIARGKER GYITFDELNA VLPQDQMSSE 
QIEDVMAVLS EMGIQVVENE DNDDSEANRE EKAEEADTEG EEAGGAAGNV DTESLGRTDD
PVRMYLREMG SVELLSREGE IAIAKRIEAG RDEMIGGLCE SPLTFRAIIS WHERLKAGEM
LLRDIVDLEA MQSGGAEAEA GAEGGEQEDG SFDAAPESED GEEGDSAGLS LSALEEKLKP
EILAQFEEIE ELYSRLQKLQ SKRLETLTSG AEMSDKSEKS YEKLREELVG KVQQVHLHNT
RIEVLVQHLK EIFQRLNGLE GRMLRLAEST KVSREDFLIK YRGSELDPGW MDMVSALPGK
AWKNFVAKHS ASVLDLRGQV ASLSQETGLP VGEFRRVYAT VSRGERDSAR AKKEMIEANL
RLVISIAKKY TNRGLQFLDL IQEGNIGLMK AVDKFEYRRG YKFSTYATWW IRQAITRSIA
DQARTIRIPV HMIETINKLV RTSRQMLHEI GREPAPEELA EKLGMPLEKV RKVLKIAKEP
ISLETPIGDE EDSHLGDFIE DKTAVIPLDA AIQTNLREAT TRVLSSLTPR EERVLRMRFG
IGMNTDHTLE EVGQQFNVTR ERIRQIEAKA LRKLKHPSRS RKLRSFLDDN