Gene MCA1889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1889 
Symbol 
ID3104013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2031004 
End bp2032113 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content61% 
IMG OID637171046 
Producthypothetical protein 
Protein accessionYP_114324 
Protein GI53803792 
COG category[R] General function prediction only 
COG ID[COG3943] Virulence protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCA AACCACCGAC GACCGCCCCG AACCCCGGCG CCCCGACACC CGCACCGGGC 
GAAATCCCGT TCCTGCTCTA CACCGCCCAG GATGAAAGCG TCAAAGTGCG TGTGCTGGTG
CAGGCCGAAA CGGTCTGGCT CACACAGCGC CAGATGGCCG AACTATTCGA CAAGGACGTG
CGCACCATCA ACGAGCACAT CCGCAACATC TATGAAGAAG GCGAACTCAC CGAAGCGGCA
ACTATCCGGA ATTTCCGGAT AGTTCAGACG GAGGGGGCGC GGCAGGTGAC GCGCGAGGTT
GCCCATTACA ACCTGGACGT GATCATCTCG GTCGGCTACC GGGTCAAGTC TCATCGCGGT
ACCCAGTTCC GCATCTGGGC CACCGGCGTG CTCAAGGAGT ACATCAAAAA AGGCTTCGTC
CTCGACGACG AGCGCTTGAA GCAGGGTAAG CAGGTATTCG GCGAAGACTA CTTCCGCGAG
CTTTTGGAGC GGGTGCGCTC CATCCGCGCC AGCGAGCGGC GCATCTGGCA GCAGATCACC
GACATCTTCG CCGAGTGCAG CATCGATTAC GACCCGAAAA GCGAAATTAC CCAGGACTTT
TTCGCCACGG TGCAGAACAA GTTCCACTAC GCCATCACCG GGCAGACCGC CGCCGAGATC
ATCCACGCCA AGGCCGACCG CGCCGCGCCC AACATGGGGC TCACCACTTG GAAAAATGCC
CCTTCCGGGC GCATCCTGCC CTCGGATGTG ACCGTCGCCA AGAATTACCT CGACGAGCCC
GAGATCAAGC GCCTGGAACG CAGCGTCTCG GGCTTTTTCG ACTACATCGA AAACCTGCTC
GAAAACCGGC GTCTGTTCAA CATGGCCGAG TTCGTCGCCG CCGTGGACAA GTTCCTCGCC
TTCAACGAAT ACCGCGTGCT CGAAGGCCGC GGGCGGGTGA GCAAAAAGCA GGCGGACGAG
AAGGCGCTGG CCGAATACGC CGAGTTCAAC AAGACGCAGC GGATCGAGTC GGATTTTGAT
CGGTTTGTGA GGGAGCGCTA TGCCGAGTTC GACGCGCGGC GGCGAGAGAT GGAGCGCGCT
TTGGAAGGCA AGGGGGGCAA AGATGCGTGA
 
Protein sequence
MSRKPPTTAP NPGAPTPAPG EIPFLLYTAQ DESVKVRVLV QAETVWLTQR QMAELFDKDV 
RTINEHIRNI YEEGELTEAA TIRNFRIVQT EGARQVTREV AHYNLDVIIS VGYRVKSHRG
TQFRIWATGV LKEYIKKGFV LDDERLKQGK QVFGEDYFRE LLERVRSIRA SERRIWQQIT
DIFAECSIDY DPKSEITQDF FATVQNKFHY AITGQTAAEI IHAKADRAAP NMGLTTWKNA
PSGRILPSDV TVAKNYLDEP EIKRLERSVS GFFDYIENLL ENRRLFNMAE FVAAVDKFLA
FNEYRVLEGR GRVSKKQADE KALAEYAEFN KTQRIESDFD RFVRERYAEF DARRREMERA
LEGKGGKDA