Gene MCA3010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA3010 
Symbol 
ID3104552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp3192837 
End bp3194108 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content66% 
IMG OID637172136 
Producthypothetical protein 
Protein accessionYP_115398 
Protein GI53802901 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGGCA CACGTGCGGA ACTGATCGAG AAGATTCGCC TGGGCGAGGA CAGCTTCCTG 
GAACTGGAGG AGGTCAAGTT CGCCGGTGGC AAGCTGCGCG GCCCTGCGCA GGAGGATCTG
GCGGACGAAC TGGCCGCCTT TGCCAACAGC GCCGGCGGGG TGCTGCTGCT CGGCGTGGAA
GACCGCGCCC GCGAGGTGCT GGGCATTCCG CTGGAGCATC TGGATGCGGT GGAAGCGCGG
GTGCGGCAGG CCTGCGAGGA TTCGGTGAAA CCGCCTTTGG CGCCGGTCAT CGAGCGCATG
ACGCTGCCCG ATTCGGCGGG GGCGGAGCAG CCGGTGCTGC GGGTGGAAGT GGCGCGCAGT
CTGTTCGTGC ACCAGAGCCA GGGAGGGTAC TTCCATCGCG TGGGTTCCTC GAAACGCCCC
ATGCCCCCGG ATCATCTGGC GCGTCTGTTC CAGCAGCGCA GCCAGTCGCG GCTGATCCGC
TTCGACGAGA CCCCCGTGTC GCGCGCCACG CTCACCGATC TGGACGAGGT GCTGTGGCGG
CGTTTCGCGC CGGCCCAAAG CGCTGATGCA CCGGAAGTAC TGCTCGGCAA GCTGGCCATG
GCAGCCCAAG ACGAGCAGGA TATTTGGCGG CCGACCGTGG CGGGCCTGCT GATGGCCAGC
CGCGAACCGC ACCGCTATCT GCCCGGTGCC TTGATTCAGG CCGTCGCCTA TGCGGGCACC
ACCGTCGTTC CCGAGGGAGA ACTGGTCTAT CAGCGCGATG CTCAGGACAT CACCGGGCCG
CTGGATGAAC AGGTTCGTAT GGCCTGCGCC TTCGTGCGCA AGAACATGCA GGTGGCCGCG
TTCAAGCGTG CCGACGGCGG ACGGGTGGAC ATGCCCCAGT ACGACATGAC CGCCGTGTTC
GAGGCGGTGG TCAACGCGGT GGCGCACCGC GACTATTCGA TGGCAGGAGC CAAGGTACGA
CTGCGGTTGT TTGCCGATCG GCTGGAGCTG TATTCGCCGG GCATGCTGCC CAACACCATG
ACCCCAGAGA GCCTGCCATT CCGGCAGGCC GCCCGCAATG AAGCCCTGAC CAGCCTCTTG
GCGCGCTGCC CGGTGGGGGA TGACGAACTG GCGCTGTATC GCAGCCACCT CATGGACAAG
CGTGGGGAAG GTGTGTCGAT CATCCTCGCG CGCAGCGAAG CTCTCTCGGG TAAGCGCCCC
GAATACCAGA TGAACGATGA CAGCGAGCTG GTTTTGACGA TTTATGCCGC TCGCACGGAG
CAAGAAGCAT GA
 
Protein sequence
MLGTRAELIE KIRLGEDSFL ELEEVKFAGG KLRGPAQEDL ADELAAFANS AGGVLLLGVE 
DRAREVLGIP LEHLDAVEAR VRQACEDSVK PPLAPVIERM TLPDSAGAEQ PVLRVEVARS
LFVHQSQGGY FHRVGSSKRP MPPDHLARLF QQRSQSRLIR FDETPVSRAT LTDLDEVLWR
RFAPAQSADA PEVLLGKLAM AAQDEQDIWR PTVAGLLMAS REPHRYLPGA LIQAVAYAGT
TVVPEGELVY QRDAQDITGP LDEQVRMACA FVRKNMQVAA FKRADGGRVD MPQYDMTAVF
EAVVNAVAHR DYSMAGAKVR LRLFADRLEL YSPGMLPNTM TPESLPFRQA ARNEALTSLL
ARCPVGDDEL ALYRSHLMDK RGEGVSIILA RSEALSGKRP EYQMNDDSEL VLTIYAARTE
QEA