Gene MCA2997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2997 
Symbol 
ID3103962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp3170363 
End bp3171721 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content63% 
IMG OID637172123 
Productaminotransferase, class I 
Protein accessionYP_115385 
Protein GI53802930 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.845231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCAGA GCGGCGCCGC AGGCGCGTTT GCTCCCGCGC TGCAAGCCAC GATGTTGTAT 
AGTCGGAAGC TCGCGAACGC TGATGGGCCG ACGGCTTCCG CTCTCGGCCG GGCCTTGGGC
GAACCCGCCT CGAATTTTTT CAAAACCATC CAGCCGAACC GGCTCGAGCA ACCACACCGG
ATTTTCATGA CACCTCGACT TTCCCGCCGA ACCGAACGCC TTACCAGTTC CCTCATCCGT
GACATCCTGC AGATCACTCA GCGGCCCGGG GTCATCTCTT TCGCCGGCGG CTTGCCGGCG
GAAGAGATGA TGCCGGAACT GGATTTCGGC GCCTGTGCCG CAGACTCCCG CCAATACGGT
CCCAGTGAAG GCGAGCCGGT GTTGCGGGAT TTGATTGCCC GAGGGCTCTC TGGCCTCGGG
CTTCGCTGTC AGACCGAACA GGTTCTGGTG ACGACGGGCT CCCAGCAGGG TATCGACCTG
GTCGGCAAGC TTTTCATCGA CGAAGGAACG CCGGTGTTGC TGGAATCGCC GACGTACCTC
GCCGCGCTCC AATGTTTCCG GGTCTATGGC GCGGAGTTTC ACGGCCTGCC CTTGCAGGTC
GGGGGCATCG ATCCGGACGC ACTGAAAGCG GCCATCGTCC GCCACAGACC CGCTTTCGTG
TATCTCATCC CCAGCTTCCA GAACCCGTCG GGATGCTGTT ACGCCGATGC GGCACGCCGC
GCCGTCGCGG CGGTGCTCGA TGAGACCGGT ACCCCTTTGG TGGAGGACGA CCCCTACCGG
GATTTGGTCT ATACGTCATG CGACCGGACG CCGGTCTGCG CTTATCTCGA AAGGGCGCCC
TGGGTCTATC TGGGCAGCTT TTCCAAAATA ACGGCGCCGG GACTGCGCGT CGGCTACCTC
GCATCGTCTC CCGGTCTGTT CCCGTGGCTC GTCCGCCTCA AGCAATCGAG CGACCTTCAC
ACCGGCCGCA CCGGTCAGGC CTGGCTGGCG CGCTTCCTCT CTTCCGGCGA TTTCGGGAAG
CATCTGGCGC ACATGAACGG CGTCTATGCC GGGCGGCGGG ATACGATGCA GGCTGCCCTG
GAGCGGCATT TCAGCGGCCT GGCGGAATGG TCGGCACCGG CCGGTGGACT GTTTTTCTGG
TTGCGGCTGG TAGGGAACAT CGACACTCTG GCTGCACTCA AGGTGGCATT GGGCCGCGAT
GTGGCATTCA TGCCGGGAGA ACCGTTCTTC CCGGTCGCGG ATCAGCGCTA TCCGGCTTTG
CGGTTAAACT TCAGTCATGC TACGCCGGAA AAGATCGAGA GGGGCATCGG CCTCCTGTCG
GAGGTGCTGA GCGAATGCGC CGCTCCTGCC GCCGGTTGA
 
Protein sequence
MSQSGAAGAF APALQATMLY SRKLANADGP TASALGRALG EPASNFFKTI QPNRLEQPHR 
IFMTPRLSRR TERLTSSLIR DILQITQRPG VISFAGGLPA EEMMPELDFG ACAADSRQYG
PSEGEPVLRD LIARGLSGLG LRCQTEQVLV TTGSQQGIDL VGKLFIDEGT PVLLESPTYL
AALQCFRVYG AEFHGLPLQV GGIDPDALKA AIVRHRPAFV YLIPSFQNPS GCCYADAARR
AVAAVLDETG TPLVEDDPYR DLVYTSCDRT PVCAYLERAP WVYLGSFSKI TAPGLRVGYL
ASSPGLFPWL VRLKQSSDLH TGRTGQAWLA RFLSSGDFGK HLAHMNGVYA GRRDTMQAAL
ERHFSGLAEW SAPAGGLFFW LRLVGNIDTL AALKVALGRD VAFMPGEPFF PVADQRYPAL
RLNFSHATPE KIERGIGLLS EVLSECAAPA AG