Gene MCA0741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0741 
SymbolrpoN 
ID3103530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp780492 
End bp781955 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content60% 
IMG OID637169945 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_113244 
Protein GI53805053 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.111429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAT CACTGCAACT TCGGCTGGGG CAGCAACTGG CCATGACCCC CCAGCTGCAG 
CAGGCCATCA AGCTGCTGCA GATGTCCACG CTCGAATTGC AACAGGAAAT CCAGCAGGCC
CTGGATTCCA ACATGATGCT GGAAATCACC GACGAGGAAG AGCCCGTCCT GAACGCTGTG
ACCGAGGAGC CGCCGCTTGC CGCTGCCGAA CCCGCCTACG CCGAGCTCCC CGATCTCGAC
ACCCAGACCA CCATCCCCGA CGAACTGCCG GTCGACTCCT CCTGGGAGGA CGTTTTCGAC
GGCATGCACA ATTACACGCC CAGCAACGCG GCAGAGCCGG AGAACGAAGA TTTCCTGGGG
CAGCGGGGCA AGGGACAAAG TCTCCAGGAT TACCTGCTCT GGCAGATGGA GCTCACCCCC
TTCACCGAGC GCGACCATGC GATCGCAACC GCCATCATCG ATGCGGTGGA CGATGACGGC
TATCTGGATG CCACCGTGGA AGAGATCACC CAGGGGCTGA GCTCGCAACT CGAAAACCTC
GAACAGGACG AAGTCCGGGC GGTTCTGCAC CGCATCCAAA ACTTCGATCC ACCCGGCATC
GCTGCGGAAA ACCCGGCCGA CTGCCTGCGC ATCCAACTGC AGCAGATGCC CGAGAACACC
CCCTACCGCG CCCAGGCCCT CGAGCTGGTC CGCCATCACG TCGACCTGCT CGCGAAGAAG
GATCTCGTCA GGCTCAAGAA AGCGCTGGAG CTCGACGACG ATGAGCTGGC CGAAGTGATC
CGTCTGGTCC GGTCCCTCGA CCCGAAACCG GGCCGCGCGG TGGAACCGGA CGACTACCAG
TACATCATCC CGGACGTCTT CGTCTATCGG CAGGGAACCG AATGGGCCGT CGCCCTCAAC
CCTGAAATCG CTCCCAGACT GCGCGTCAAC CCCTATTACA GCGGCCTGAT TCGGCGAGCG
GACAGCAGCT CCGACAACGT GACCATGCGC AATCATCTGC AGGAAGCGCG CTGGTTCATC
AAGAGCCTGC AGAGCCGCAA CGAAACCCTG CTCAAAGTGG CGCGCGCCAT CGTAGACCGC
CAGCGCGAGT TTCTCGAGAT TGGTGAAACC GCGATGAAAC CCCTGGTGCT GCGTGACATT
GCCGAAGAAG TCTCCATGCA CGAATCGACG ATTTCGCGGG TAACGACCCA GAAATACATG
CACACGCCCA ATGGCATCTA CGAATTCAAG TATTTCTTTT CGAGCCACGT GTCCACGGAT
TCCGGCGGCG AATGCTCGGC CACTGCAATC AAGGCGTTCC TCAAGGAAAT CGTGAGCAAG
GAAGACGCGA CCCGTCCCTT GAGCGACCAT GCCATCGCCG GCATGCTGAA AGACAAGGGC
ATCAACGTTG CGCGGCGAAC CATCGCCAAA TACCGTGAAG CGATGGGCAT TCCACCGTCC
AACGAAAGAA AGCAGTTGTT CTAA
 
Protein sequence
MKQSLQLRLG QQLAMTPQLQ QAIKLLQMST LELQQEIQQA LDSNMMLEIT DEEEPVLNAV 
TEEPPLAAAE PAYAELPDLD TQTTIPDELP VDSSWEDVFD GMHNYTPSNA AEPENEDFLG
QRGKGQSLQD YLLWQMELTP FTERDHAIAT AIIDAVDDDG YLDATVEEIT QGLSSQLENL
EQDEVRAVLH RIQNFDPPGI AAENPADCLR IQLQQMPENT PYRAQALELV RHHVDLLAKK
DLVRLKKALE LDDDELAEVI RLVRSLDPKP GRAVEPDDYQ YIIPDVFVYR QGTEWAVALN
PEIAPRLRVN PYYSGLIRRA DSSSDNVTMR NHLQEARWFI KSLQSRNETL LKVARAIVDR
QREFLEIGET AMKPLVLRDI AEEVSMHEST ISRVTTQKYM HTPNGIYEFK YFFSSHVSTD
SGGECSATAI KAFLKEIVSK EDATRPLSDH AIAGMLKDKG INVARRTIAK YREAMGIPPS
NERKQLF