Gene MCA2994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2994 
SymbolrpoD 
ID3103408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp3166790 
End bp3168589 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content59% 
IMG OID637172119 
ProductRNA polymerase sigma-70 factor 
Protein accessionYP_115383 
Protein GI53802939 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCCAG AACAGCAGCA GCACCAGCTT AAGGAACTTA TTGCTAAGGG CAAGATGCAG 
GGCTACCTCA CCTTTTCCGA GGTCAACGAT CATCTCCCCA GCGACATCGT CGATCCCGAG
CAGATGGAAG ACATCATCTG CATGATCAAC GACATGGGAA TCGAAGTCCA CGAGGATGTC
CCCGACAACG AAGACCTGAT CCAGGACACG GCGGTGGTGG CCGAATCCGA CGAGGATGAA
GCCGCGGAAG TCGCAGCGGC GCTCGCTTCG TCGGTCGATA CCGAACTTGG GCGCACCACC
GATCCCGTGC GGATGTACAT GCGCGAAATG GGAACCGTGG AGCTGCTGAC CCGCGAAGGC
GAGCTGAGCA TCGCCAAGCG CATCGAAGAG GGGCTCAACA TCGCCGCCAA CGAACTAGTA
CGGTTCCCGG CAGCCATGGA ACACTTCCTC AAAGCCTTCG AGACCGTCGA CAACGGCGAA
CTCAGACTCT CCGATCTGAT CTCCGATTTC TTCGATCCCG CCGCGGCAGA CACGGTGGCA
ACCGAAGACG ACGCCGAAGC GGAAATCATC GATGAAGGCG ATACCGGCCC CGACCCCGAG
CTGGTAAAAG AAAAATTGGC GGCTTTCCGC AAGCTGCACA AGGCGGCCGT CACCGCCCAC
CACAAGTACG GCTACGGCCA CGAGAAGACG CAGAAGTCGT TCGATGCCCT GGCCACGGCC
TTCATGGAAT TCAAGAAGAC CCCGCAGTTC TTCAAGGAAC TGACCGATAC GCTGCGCGAC
ACCGTGGCGA AAATCCGCGA GCAGGAACGC GCGCTCACGG ACATCTGCGT GACCAAAGGG
AAGATGCCCA AGGCGGAGTT CATAGAGTCG TTCAGCGGCA GTGAAACCGA CGAGGGATGG
CTCGACCGGG TATGCGATCC GGCGAAACCC TATGCGGCTG CGATCAAGGC CCATGCGGAG
GAAATTCGCC GGATTCAGGC CAAACTGGCT CAAATCGAGA AGGAGAACCG GCTCGACATC
GCGCTCATCA AAGACATTCA TCGCCGAGTG TCCGCGGGGG AGAACCAGGC ACGCCTGGCC
AAACGCGAAA TGATCGAAGC GAACCTTCGC CTCGTCATCT CCATCGCCAA GAAATACACC
AACCGGGGAC TGCAGTTCCT GGACCTGATC CAGGAAGGCA ACATCGGCCT GATGAAGGCG
GTGGACAAGT TCGAGTACCG GCGCGGCTAC AAGTTTTCGA CTTACGCAAC CTGGTGGATC
CGCCAGGCCA TCACACGGTC CATCGCCGAC CAGGCGCGCA CGATCCGTAT TCCGGTACAC
ATGATCGAGA CGATCAACAA GCTCAACCGC ATTTCGCGGC AAATGCTGCA GGAAATGGGG
CGCGAACCGA CGCCAGATGA ACTCGCCGCA CGCATGGACA TGCCGGAAGA CAAGGTGCGC
AAGGTGATGA AAATCGCCAA GGAACCCATT TCCATGGAGA CCCCCATCGG CGACGACGAC
GATTCGCACC TGGGGGACTT CATCGAGGAC TCCCGCGTCC TGTCACCGGT CGAGCATGCC
ACCGTGGCCA GCCTGCGCGA AACCACCCAA CAGGTGCTTG CCGGACTGAC CGCCCGGGAG
GCGAAGGTAC TCCGGATGCG TTTTGGCATC GACATGAACA CCGACCACAC GCTGGAAGAG
GTCGGAAAGC AGTTTGACGT GACGCGTGAA CGCATCCGCC AGATTGAAGC CAAAGCCCTG
CGCAAATTGC GCCACCCGTC ACGGTCCGAA CAGCTGCGCA GCTTTCTGGA CATCGAATGA
 
Protein sequence
MTPEQQQHQL KELIAKGKMQ GYLTFSEVND HLPSDIVDPE QMEDIICMIN DMGIEVHEDV 
PDNEDLIQDT AVVAESDEDE AAEVAAALAS SVDTELGRTT DPVRMYMREM GTVELLTREG
ELSIAKRIEE GLNIAANELV RFPAAMEHFL KAFETVDNGE LRLSDLISDF FDPAAADTVA
TEDDAEAEII DEGDTGPDPE LVKEKLAAFR KLHKAAVTAH HKYGYGHEKT QKSFDALATA
FMEFKKTPQF FKELTDTLRD TVAKIREQER ALTDICVTKG KMPKAEFIES FSGSETDEGW
LDRVCDPAKP YAAAIKAHAE EIRRIQAKLA QIEKENRLDI ALIKDIHRRV SAGENQARLA
KREMIEANLR LVISIAKKYT NRGLQFLDLI QEGNIGLMKA VDKFEYRRGY KFSTYATWWI
RQAITRSIAD QARTIRIPVH MIETINKLNR ISRQMLQEMG REPTPDELAA RMDMPEDKVR
KVMKIAKEPI SMETPIGDDD DSHLGDFIED SRVLSPVEHA TVASLRETTQ QVLAGLTARE
AKVLRMRFGI DMNTDHTLEE VGKQFDVTRE RIRQIEAKAL RKLRHPSRSE QLRSFLDIE