Gene MCA1539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1539 
Symbol 
ID3104919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1640804 
End bp1642597 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content69% 
IMG OID637170712 
Producthypothetical protein 
Protein accessionYP_113994 
Protein GI53804162 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGTC TGCGCATCTG CCGTTCCCCG GCCTTCGCCT ATCCGCTTCC GCCCTCGGGC 
CCGACGGATG CGGCATGGCA CGCCCGGCTG TGCGAGGCGC TGCGGCGCCA TCTCTCGCCC
GCGGCCGCGG CCCTCCTGGC CGAGCCCGTG CCGAGCGAGG ATGGCGCATG GATCGAGTGG
TACACCTCGC TGGCGGGCCA GCCCATTCCA CTGACTTCTC TGTCCGGCGA GGCCTCACGG
CGTGCGCGCA ACCTGCTGGA GGACCGGCTC CAGGCCATGG TGGCGCTGGC AGAGCGGCTG
GCGGCCGCCG ATCCGGAACT GGCTGATGCG ATGCGCCGGG CGGCGAGTTT CCCGGACCAG
TCCGCGATCT ACGTGGTCGA CGGCCAACCG GTAATGACCT TCTGGGGCTA CGGTACCGCT
CCATGGCCCG AGCAGGCCGC TTCTTCGCCT TCAGGCCGGT CGTCGGCGAA GCTTTTCCGC
CGGCCGTGGC CGGTTCTGGC CGGCTTGATT TTGGCCGCCG GTCTGGGATG GGCCGGTTTT
CTCTTCGACC TGTGGCGCTG GCCACCCTGG GGGCCAGATC ATGCCGCCAT GCTCGCGGCC
GAACAGGAGA CCGGTCACGA TCTGCGGCGT GTCCTGCTCG CACGCCAGTC CGATCTGGCG
CGGGCCCTGG GGCGATGCGC GCTCGAGGCA CAGCGTGATG CTCTGGCGCG GGAGGGAGAG
CTGTTGGCGG GAAAACTGGA AGCGCTGACG GCCGAGCTGG CTGCAGGTCT GGATACCTGC
CAGGATGAGG CGGCGCTCGA CGCTTTGTCG AAGGAACGCG ACGCATTGAG CGGCCGGCTC
GCCGGTCTGC AGCGGGACCT GGAAGCCGCG CTGGAAAAAT GCGGCGGCGC CAGACTGAGC
GGCGAACGGG CCAAGTCGGC CAGGCTGCGC GCCAAGCTGA CCGCTCTGCA AGGCCAGCTC
GCCGTCAAAC TGAACCGCTG CGGCAAGAAG CCGCCCGGCG ACACTGCCGA AACGGGCTCG
GTGCCTGCCG TCTCGCCCAT TCCGGAACCC TCGCCCAGGC CGGCCCAGGC GGCGGCGAGT
CTGCCGCCCT GTCCCGGCGA GCGTCCGCCG GAGGATGCGC CGGACGTCGC GATCGTGCTG
GACGCTTCCG GCAGCATGCG CATTCCGTCT GTTCTGGACG GCAACGGCGC CCAGCTCATC
GCCCGCTTCG AGCGCTGCAT GATGGACAGC GGACTGCTCG CCCCGGTGGT CTGTGCCGAC
CTCATCGGCG CCTACGAATC GATCATCCAG GGCGGCCGCG GGCCGACCCG GCTGATGGCC
GCCCAGAAGG CCGTCAACAA CGTGGTCGGC GGTCTGCCCG GCGACGTCGA CGTCGGCCTG
GTGGTCCTCG AGGACTGCCC GCAGGCATCC GATTACGGTA TGTATGACGG CGCCCGCCGA
GGGCAGCTCC TGCAGCGGGT CAACGGCCTG ATTCCGCGCA AGGGCACGCC GCTGGGCAAC
GGCATCGTGC AGGCGGCGAA CAAGGTGGAT GGTGTGCGGG CGCCGGCGGT CATGGTCGTC
GTGTCCGACG GCAAGGACAG TTGTAACGCG GATCCCTGCG CCATCGCCGC GGCGGTCAAA
GCCGCCAAAC CCAAACTCAA GATCAACGTA GTGGACATCG TCGGTGACGG CGCCGTCGGC
TGCATCGCGA AGGCCACCGG TGGCGAGGTG CTCACCCCCC GATCGGGCAT GAGCCTGGAC
CAGATGGTCC GCCAGGCCGC CAAGGATGCC GAGAAGCCGG AACATTGCAA ATGA
 
Protein sequence
MSRLRICRSP AFAYPLPPSG PTDAAWHARL CEALRRHLSP AAAALLAEPV PSEDGAWIEW 
YTSLAGQPIP LTSLSGEASR RARNLLEDRL QAMVALAERL AAADPELADA MRRAASFPDQ
SAIYVVDGQP VMTFWGYGTA PWPEQAASSP SGRSSAKLFR RPWPVLAGLI LAAGLGWAGF
LFDLWRWPPW GPDHAAMLAA EQETGHDLRR VLLARQSDLA RALGRCALEA QRDALAREGE
LLAGKLEALT AELAAGLDTC QDEAALDALS KERDALSGRL AGLQRDLEAA LEKCGGARLS
GERAKSARLR AKLTALQGQL AVKLNRCGKK PPGDTAETGS VPAVSPIPEP SPRPAQAAAS
LPPCPGERPP EDAPDVAIVL DASGSMRIPS VLDGNGAQLI ARFERCMMDS GLLAPVVCAD
LIGAYESIIQ GGRGPTRLMA AQKAVNNVVG GLPGDVDVGL VVLEDCPQAS DYGMYDGARR
GQLLQRVNGL IPRKGTPLGN GIVQAANKVD GVRAPAVMVV VSDGKDSCNA DPCAIAAAVK
AAKPKLKINV VDIVGDGAVG CIAKATGGEV LTPRSGMSLD QMVRQAAKDA EKPEHCK