Gene Gdia_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0474 
Symbol 
ID6973869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp519355 
End bp520242 
Gene Length888 bp 
Protein Length295 aa 
Translation table11 
GC content63% 
IMG OID643390007 
ProductRNA polymerase factor sigma-32 
Protein accessionYP_002274885 
Protein GI209542656 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.455996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00148932 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCTCTT CCGTTCTCAA TGTTGGTCCT GAATCCAACC TGTCAAGATA CCTTCAGGAC 
ATCCGTAAAT TTCCGATGCT GTCGCCCGAG GATGAACAGC GCCTGTCCCG GCGCTGGAAG
GACAAGGGCG ATACCGAGGC CGCGCATGCG CTCGTCACCT CGCACCTGCG CCTGGTCGCC
AAGATCGCCA TGGGCTATCG CGGCTATGGC CTGCCGCTGG GCGAGCTGAT CAGCGAAGGC
AATATCGGGA TGATGCAGGC GGTCCGCCGC TTCGACCCGG ATCGCGGCTT CCGCCTGGCG
ACCTATGCGA TGTGGTGGAT TCGGGCCGCC ATCCAGGAAT ATATCCTGCA TAGCTGGTCG
CTGGTGAAGA TGGGCACCAC CGCCGCGCAG AAGAAGCTGT TCTTCAACCT GCGCCGCCTG
AAGGGCCAGA TGCAGGCCAT CGACGACGGC GACCTGAAGC CCGAGCAGGT GAACAAGATC
GCCGAATCGC TGGGTGTGCC CGAGCAGGAC GTCATCAACA TGAACCGCCG CCTGTCGGCG
CCGGATCACA GCCTGAACGC CCCGCTGCGC GCCGACAGCG AAGGCGAATG GCAGGACTGG
CTGGTCGACG AACACGACAA CCAGGAACAG ACCCTGGCGG AAAACGAGGA ATTCAGCGGA
CGCAAGGCAT TGCTGGACAA TGCCATGAAG ACGCTGAACG ACCGTGAGCG CCACATCCTG
ACCGAACGCC GCCTGAAGGA CGACCCGGCC ACGCTGGAAG AATTGTCGCA CACCTACAAC
ATCTCGCGCG AGCGGGTGCG GCAGATCGAG GTCCGGGCGT TCGAGAAGGT GCAGGCGGCG
ATGAAGGCGG AAGTCGAGGC CCATCGCGAA GCCCACGCCG CAAACTGA
 
Protein sequence
MASSVLNVGP ESNLSRYLQD IRKFPMLSPE DEQRLSRRWK DKGDTEAAHA LVTSHLRLVA 
KIAMGYRGYG LPLGELISEG NIGMMQAVRR FDPDRGFRLA TYAMWWIRAA IQEYILHSWS
LVKMGTTAAQ KKLFFNLRRL KGQMQAIDDG DLKPEQVNKI AESLGVPEQD VINMNRRLSA
PDHSLNAPLR ADSEGEWQDW LVDEHDNQEQ TLAENEEFSG RKALLDNAMK TLNDRERHIL
TERRLKDDPA TLEELSHTYN ISRERVRQIE VRAFEKVQAA MKAEVEAHRE AHAAN