Gene MCA1102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1102 
Symbol 
ID3103939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1157365 
End bp1158564 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content67% 
IMG OID637170287 
Productsigma-54 dependent transcriptional regulator 
Protein accessionYP_113572 
Protein GI53804564 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCTACT GGAGTTCGGC TGCCGAGCAG CTGACGGGCC ACACGGGTGA ATTTGCGACC 
GGCGAATCCA TCGCCCGGCT GCTGCCCGGC TATGGCCCCC CTCGAGCTGA CGACGGCAGA
GGCGAGGACG AAATCCGGGT CGAGCGGGCG GATGGCTCGG CGCTGCTGTT GCGACGCCGG
TACCATCCCC TCCGGGACCG GGAAGGGGCC GCCCGGGGCC GCCTGGAAGT ACTCACCGCG
ATCGAAACCA TCGGTACCCC TCTGGCGGAA GCCGATGTCG AGATCTTGCA TGGACTGCTC
ACGCGCGAAC CGGCGATGAA ACAGGTGTTC CGTCTCATCC GCAATGTCGC GGAAACCGAG
GCCACGGTAT TGGTCCGCGG CGAATCGGGC ACCGGCAAAG AACTGGTCGC CCGCGCCGTG
CATGCCGAGA GTCACCGCGC CAGGGAGCCG TTTCTCGCGG TCAATTGTGC CGCGCTGGCA
CCCTCGCTCC TGGAAAGCGA ACTGTTCGGG CATGTCCGCG GCGCGTTTAC CGGCGCCGTG
CGCAGCCATG CCGGGCTGTT CCAGCGCGCC GACGGCGGCA CCTTGTTCCT CGATGAAGTC
GCCGAACTGC CGCTTGAGCT GCAGGCCAAG CTGCTGCGCG TGCTTCAGGA ACGAAGCTTC
GTTCCGGTCG GTGGCGACCG CATGCTCAGC GTGGATGTAA GGATCGTGGC AGCGACCCAT
CGCTCGCTGC GGGATGCGGT ACGCGAGGGC CGCTTCCGCG AGGATCTGAT GTACCGTCTG
CGGGTAGTCC CCCTCTTCCT GCCACCCTTG CGCGAGCGCC GCCGCGACAT CGGCCTGCTG
CTCTGGCATT TCATCGAACG GCACAATGCC CGCGGCCTGC GCCGCATCGT CCGCATCGAC
CCCGATGCCA TGCGCCGGCT GCTCGACCAT GCCTGGCCGG GTAATGTGCG CGAATTGCAG
AACGTGGTGG AATACGCCTT CGCCGTGGGG CGGGGGGAGG TGCTGGAACT CGATGATCTG
CCGCCCGAAT TCCGCGACGA GCGCGCCGGG CCGCAAACGT CCATGGAAGC CGTGCCGGAT
GAGGCCGATC GCATCCGCAC GGCGCTGCGG CAATCCGGCG GGCGCATCGA CCAGGCCGCC
CGGCTGCTGC AGATGAGCCG GGCGACGCTA TGGCGCAAAA GGAAGAAGCT GGGCGTGTGA
 
Protein sequence
MSYWSSAAEQ LTGHTGEFAT GESIARLLPG YGPPRADDGR GEDEIRVERA DGSALLLRRR 
YHPLRDREGA ARGRLEVLTA IETIGTPLAE ADVEILHGLL TREPAMKQVF RLIRNVAETE
ATVLVRGESG TGKELVARAV HAESHRAREP FLAVNCAALA PSLLESELFG HVRGAFTGAV
RSHAGLFQRA DGGTLFLDEV AELPLELQAK LLRVLQERSF VPVGGDRMLS VDVRIVAATH
RSLRDAVREG RFREDLMYRL RVVPLFLPPL RERRRDIGLL LWHFIERHNA RGLRRIVRID
PDAMRRLLDH AWPGNVRELQ NVVEYAFAVG RGEVLELDDL PPEFRDERAG PQTSMEAVPD
EADRIRTALR QSGGRIDQAA RLLQMSRATL WRKRKKLGV