Gene Gdia_0338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0338 
Symbol 
ID6973732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp379403 
End bp380779 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content68% 
IMG OID643389870 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_002274749 
Protein GI209542520 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0175077 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.141122 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCG GCCCCAGGCT CGCGCTCCGG CAGGCCCAGA CCCTCGTCAT GACGCCGCAA 
TTGCGGCAGG CGATCCACCT TCTGCAATTG TCCAACCTCG ACGCCTGTTC GTTCATCGAA
CAGGAACTGG AACGCAATCC GCTGCTGGAG CGCGCGAACC TGGCGGGGCT CGAGGAACTC
CGGCCCAAGG TCGTGGCCCA CGGTGCGCGC CATGACCCTC GCCTGGAACG GGTCCCCGAC
CAGGACTGGG CCGCCACAAC GGGGACCCTG CCCGGCCCGG CCCCCTCGCT GCACGAACGG
CTGGCGCAGC AGGTGCGGCT GTCGTGGCGC GATGCCGGCG ACCGGCTGGT AGGGGCGTAT
CTGATCACCC TGCTGGACGC CGACGGGCGG CTCGTCCCCC CGACGGCGGT CATCGCCCAG
TCCCTGGGCA CCACGTTCGA CCATGTCGAG CGTGTGCGCC AGACCATGAT GCAGTTCGAG
CCGACGGGCC TGTTCGCCCA CGATCTGGGC GAATGCCTGG CCGCCCAGTT GCGCGAACGC
GACCGGCTGG ACCCGGCGAT GGCCGTCCTG CTGCGGCACC TGGACCTGCT GGCCCGGCGC
GACCTGCGCC GGCTGCAGGG TCTGTGCGGC GTCGATGCCG AGGACTTGGC GGACATGATT
GCCGAGGTCC GCACCCTGTC GCCCCGCCCG GCCCGGGATT TCGGCGATCC GGCGGCCTGT
TCCGTCATCC CGGACGTCCT GTTGCAGCCC GCGCCGGACG GGGACTGGAT GCTGGAACTC
AACCCCGAGA CCATGCCGAG CGTGGTGGTG AATTCCGCAC TGAGCACCCG GGTGGCGCTG
CGGGCCCGGC GCGAGGACAG GCCGTTCCTG AACGAGCGCC TGGCCAGCGC CAACTGGCTG
GTGCGCGCCC TGCAGCAGCG GGCCGACACG ATCATCCGCG TCGCAACCGC GATCATGCGC
CGGCAACGCG GCTTCCTGGA TCACGGGATC GGCCACCTGC GCCCGCTGGT GCTGCGCGAT
ATCGCCGAAA CGGTCGGGCT GCACGAAAGC ACCGTCAGCC GGGTGACGGC GAACAAGTAC
ATCGCCACCC CGCGCGGCCT GTTCGAGCTG AAATATTTCT TCACCACCGC GATCCCGGGC
CGCGCGGGCG GCGCCGACCA CAGCGCCGAG GCCATCCGCC ATCGGATCAA GACCTTGATT
TCACAGGAAT CTGATGGAAA AATCCTGTCC GACGATGCAA TCGTGACCCG GTTACGCAAG
GAAGGAATAG ACATTTCTCG CCGAACCGTG GCCAAATACC GGGATGCGTT GAGGATACCC
AACTCGGCGC AGAGAAAACG GGACAAGGCC CTCATCGGCC TGAAACCGGA AGCTTGA
 
Protein sequence
MTIGPRLALR QAQTLVMTPQ LRQAIHLLQL SNLDACSFIE QELERNPLLE RANLAGLEEL 
RPKVVAHGAR HDPRLERVPD QDWAATTGTL PGPAPSLHER LAQQVRLSWR DAGDRLVGAY
LITLLDADGR LVPPTAVIAQ SLGTTFDHVE RVRQTMMQFE PTGLFAHDLG ECLAAQLRER
DRLDPAMAVL LRHLDLLARR DLRRLQGLCG VDAEDLADMI AEVRTLSPRP ARDFGDPAAC
SVIPDVLLQP APDGDWMLEL NPETMPSVVV NSALSTRVAL RARREDRPFL NERLASANWL
VRALQQRADT IIRVATAIMR RQRGFLDHGI GHLRPLVLRD IAETVGLHES TVSRVTANKY
IATPRGLFEL KYFFTTAIPG RAGGADHSAE AIRHRIKTLI SQESDGKILS DDAIVTRLRK
EGIDISRRTV AKYRDALRIP NSAQRKRDKA LIGLKPEA