Gene MCA2099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2099 
SymbolpepA 
ID3104100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2257728 
End bp2259221 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content64% 
IMG OID637171253 
Productaminopeptidase A/I 
Protein accessionYP_114529 
Protein GI53803849 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.41187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTATT CGACAAGAAC TGACACGCTG GAGCGGCTTT CGACCGATTG TCTGATCGTG 
GGCGTGTTCC AGAAGCGTAA ACTCGCACCC ACGGCCGAGG CGCTGGACGC GCTGTTCGAC
GGGCTGCTGG CCAGGCTGCT CAAGCGCGAT GACGTGGAAG GCAAGGCCGG CGATACCCTG
CTGGTCAACC ATGTGCCGGG CGGCCGGATC GACCGGGTGC TGCTGGTCGG GCTGGGGAAA
CGTGAGGAGC TGAACGTCGC CGCCTATCGT AAAAGTCTGG CGGCGGCCTT CAAGGTGTTG
AAGGAATCCG GCGCCAAGCA TGCGGTGTCG GCTCTGCACG AGGTGGAGGT TGGCCAGCGG
GGGGCGGACT GGAAGATCCG TCAGGCCATC GAGCTGCTGG AAAGCGGGCT CTACCGCTTC
CAGGAGATGA AGGGGGCGTC AGCCGAAGAC CATCCGCCCC GCCTGTCCAG GCTGCAATTC
CTGTTGGCCT CCGATCAGGA TGCGGCCCCG GTCGAAACGG GAATCCGGGA GGGTCAGGCG
ATCGCCCACG GCATGACCCT GGCCCGCAAC CTGGGCAATC TTCCGGGGAA CGTCTGCACA
CCCGCTTACC TCGCCGAACA GGCGCTGAAG CTCGGCAAGG AATACAAGAA GCTGAAGGTT
TCGGTGCTGG AAGAGAGCGA CATGGAGGAA CTGGGTATGG GAGCCTTGCT GTCGGTGGCG
CGCGGCAGCC GCCAGCCGGC CAAGCTGATC GTCCTGGAAT ACCGTGGTGC CGCCGGCAAG
GCCAAGCCTT ATGTCCTCAT CGGCAAGGGT CTGACCTTCG ATGCGGGAGG CATTTCCCTG
AAGCCTGCCG CCAACATGGA CGAGATGAAA TACGACATGT GCGGGGGCGC CGGCGTCATC
GGCGCGATCC AGGCGGTGGC GGAGATGGGG CTGCCGTTGA ACGTGGTCGG TCTGGTGCCG
GCTTCCGAGA ACCTGCCGGA CGGCAATGCC AACAAGCCCG GCGACATCGT CAGGAGCATG
GCCGGCATCA CCATCGAGAT CCTCAATACC GACGCGGAAG GGCGCCTCAT CCTGTGTGAC
GCGCTGACCT ATGCCAAGCG TTTCGACCCC GTGGCGGTGA TCGACGTGGC GACCCTGACC
GGGGCTTGTA TCGTGGCGCT GGGGCGTCAT CCCAGCGGCC TGATGGGCAA TGACGACGCA
TTGTGCGAGC AGTTGACCCG GGCCGGCGAA ACCACCTGGG ACCGGGTCTG GCGCATGCCG
ATCTGGGACG ATTACCAGGA ACAGCTCAAG TCCAATTTCG CCGATGTCGC CAACATCGGT
GGGCCGGATG GCGGCAGCAT CACCGCCGCC TGCTTCCTTT CGCGGTTCGC CAAAGACTTC
AAATGGGCGC ATCTCGACAT CGCGGGGACG GCCTGGAAAA CGGGAGCCGA CAAGGGCGCT
ACCGGCCGTC CGGTGCCGCT CCTGGTGCAA TACCTCATCG ACCGGGCGGC ATGA
 
Protein sequence
MEYSTRTDTL ERLSTDCLIV GVFQKRKLAP TAEALDALFD GLLARLLKRD DVEGKAGDTL 
LVNHVPGGRI DRVLLVGLGK REELNVAAYR KSLAAAFKVL KESGAKHAVS ALHEVEVGQR
GADWKIRQAI ELLESGLYRF QEMKGASAED HPPRLSRLQF LLASDQDAAP VETGIREGQA
IAHGMTLARN LGNLPGNVCT PAYLAEQALK LGKEYKKLKV SVLEESDMEE LGMGALLSVA
RGSRQPAKLI VLEYRGAAGK AKPYVLIGKG LTFDAGGISL KPAANMDEMK YDMCGGAGVI
GAIQAVAEMG LPLNVVGLVP ASENLPDGNA NKPGDIVRSM AGITIEILNT DAEGRLILCD
ALTYAKRFDP VAVIDVATLT GACIVALGRH PSGLMGNDDA LCEQLTRAGE TTWDRVWRMP
IWDDYQEQLK SNFADVANIG GPDGGSITAA CFLSRFAKDF KWAHLDIAGT AWKTGADKGA
TGRPVPLLVQ YLIDRAA