Gene MCA2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2101 
Symbol 
ID3102802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2260437 
End bp2261507 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content63% 
IMG OID637171255 
Producthypothetical protein 
Protein accessionYP_114531 
Protein GI53803847 
COG category[R] General function prediction only 
COG ID[COG0795] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACGC TGAACCGCTA CATCGGCTGG GAAGTCATCA AGGGGGCGGC ATTCGCTGCC 
CTGGTCCTCC TCGCCCTCCT GAATTTCTTC ACCTTCGCCG ACGAACTGCG CGACCTGGGT
GAGGGCAACT ACGGGCTGGG CAGCATCTTC CTCTATCTGA CGCTGACCTC GCCCCACAGC
CTGTACGAAC TCATCCCTTC CGGCGCACTG ATCGGCGGCC TGGTGGTGCT CGGCAACATG
GCAAACAACC ACGAATTGGT GGCGATGCAG GCCGCCGGCG TTTCCCGGGG TCGCATCGTC
TGGGCGGTCC TGCGGGCGGG CATCGTGATC TCGCTGATAT CGGTCGTCAT CAGCGAATAC
GTCATTCCGC CGGCGGAACG GGCCGCCCAG ATGCTCAAGG CCACCGCAAC CCGCCAACAG
GTCGCCTCCC AGACCAAGTA CGGAGTCTGG ATCCGGGACG GTAACGTTTA CGTCAACGTC
CGGGAAATCG AGAACCAGGA ACGCCTGGGC GACATCCACA TCTTCGAAAT ATCGCCGGAC
GGCCGCCCGG CCTTGGCCAT GCATGCCGCG CGCGCCAGTT TCGACCGCGG CATCTGGAAA
CTCGAGGACA TCGGCCTCAC CCGCTTCGAC CCCGCGGGGA ACGCCGCCAT CGCCGAACAC
AAGGAACAGG AGGATTGGTC CTCCGTCCTA TCCCCGGACA TGCTCGACGT GTTCATCGTC
CGCCCGGAAA ACCTGTCGGC ACAGGACCTC GCGAAGTACA TGGCCTATCA GACCGAAAAC
GCGCAGAAAT CGCTGGCCGT GGAGCAAGCC TTCTGGGGAC GCATGGTCAA CCCGCTCATC
ACGCTGGCCA TGCTCCTACT GGCCATCCCT TTCGTGTTCA ACGCCCGCCG TGACGTCAGC
AGCGGGCAAC GGATCGTGAT CGGCGTCACG ATCGGCCTCG GCTTTTACCT GACCAACAGA
ATGGTGTCCC ATCTGGGACT GGTCTACGAA GTGAATGCCC CACTGACGAT GGTAACACCT
CCCCTGGTCG TCCTCTTCGC CGCCCTCGCC GCCTTCAGAC GCCGCCCCTA G
 
Protein sequence
MNTLNRYIGW EVIKGAAFAA LVLLALLNFF TFADELRDLG EGNYGLGSIF LYLTLTSPHS 
LYELIPSGAL IGGLVVLGNM ANNHELVAMQ AAGVSRGRIV WAVLRAGIVI SLISVVISEY
VIPPAERAAQ MLKATATRQQ VASQTKYGVW IRDGNVYVNV REIENQERLG DIHIFEISPD
GRPALAMHAA RASFDRGIWK LEDIGLTRFD PAGNAAIAEH KEQEDWSSVL SPDMLDVFIV
RPENLSAQDL AKYMAYQTEN AQKSLAVEQA FWGRMVNPLI TLAMLLLAIP FVFNARRDVS
SGQRIVIGVT IGLGFYLTNR MVSHLGLVYE VNAPLTMVTP PLVVLFAALA AFRRRP