Gene MCA0339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0339 
Symbol 
ID3104742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp338735 
End bp340015 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content66% 
IMG OID637169559 
Producthypothetical protein 
Protein accessionYP_112871 
Protein GI53802521 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.709733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCGCC TGCTGCGCGA GGAGCTGAGC CAGCTCAGGG CAGAGATCGG CCGTGCATCC 
CGCGACAGCC GCGAGGAGAT CGGGGGGGCG ATCGGCCGGT TCCAGATGGC GCAGCAGGAT
CAGGCCGAAA GGATGCGGCT GCTGATGGAC GAAAAGCTCG TGCGGGTCCA GGCCGATGCC
CGGGATGGAC GGGAGGAGAT GAGCCTGGGA CTGAAGCGCT TCGGCGAAGT GCAGAAGGAA
CAGCTCGACA GCGTCGGCCT GCTGCTGAAA AACCAGCTCG AAACCCTCGC CGGGACGAAC
GAGAAGAGCA TCGAACGTCT GCGCCTCACC ATTGAGGAAA GGCTGCAGGC GCTGCAGCAC
GACAACGGCC AGAAGCTGGA GCAGATGCGC CAGACCGTCG ACGAGAAGCT CCACAATACC
CTGGAGCAGC GGCTGGGCGA GTCGTTCAAG CTGGTCAGCG AGCGCCTGGA GCAGGTACAC
AAGGGCTTGG GGGAGATGCA GTCTCTCGCC TCGGGGGTGG GGGACCTGAA GCGGGTGCTC
ACCAATGTCA AGACCCGCGG TACTTGGGGC GAGGTGCAGC TCGACGCCCT GCTGGAACAG
ATATTGACTC CGGAGCAGTA CGGGAAGAAC GTGCCGACCC GACCCAACGG GGCGGAACGG
GTCGAGTTCG CGATCCGGCT GCCTGGACGG GATTCGGGCG ATGCGCCGGT GTGGCTGCCG
ATCGACGCCA AATTCCCCAT CGAGGATTAC CAGAAGTTGC TGGATGCGCA GGACGGCGCG
GATCACGCCG GCATCGAAGC GAGCGGCAAG GCGCTGGAAC AACGCCTGAA GAACGAGGCC
CGGACGATCC GCGACAAATA CGTGGAGCCG CCGCACACCA CCGACTTCGC CATCCTCTAC
CTGCCGATCG AGGGGCTGTA CGCCGAGGCC CTGCGCCGGC CCGGGCTGGC CGAAACCCTG
CAGCGCGACT ACCGGGTGAC CCTGGCCGGC CCGACCACCC TGGCCGCCAT GCTGAACAGC
CTGCAGATGG GTTTCCGGAC CCTGGCGATC GAACGGCGCT CGTCGGAGGT ATGGACCCTC
CTGGGAGCGG TCAAGACCGA ATTCACCAAA TTCGGCGATG CCCTGGCCTA TACCCGCAAG
AAGCTGGAGG AGGCCACCAG CTCCATCGAC AAGGCTGAAA CCCGCACCCG CGTGCTGACC
CGCAAGCTCA AGGAGGTCGA GGCCATCCCG GCGCAGGAGG CGGGGTGGTT GCTGCCCTCG
ACGGCGGAGG CGGACGACTG A
 
Protein sequence
MERLLREELS QLRAEIGRAS RDSREEIGGA IGRFQMAQQD QAERMRLLMD EKLVRVQADA 
RDGREEMSLG LKRFGEVQKE QLDSVGLLLK NQLETLAGTN EKSIERLRLT IEERLQALQH
DNGQKLEQMR QTVDEKLHNT LEQRLGESFK LVSERLEQVH KGLGEMQSLA SGVGDLKRVL
TNVKTRGTWG EVQLDALLEQ ILTPEQYGKN VPTRPNGAER VEFAIRLPGR DSGDAPVWLP
IDAKFPIEDY QKLLDAQDGA DHAGIEASGK ALEQRLKNEA RTIRDKYVEP PHTTDFAILY
LPIEGLYAEA LRRPGLAETL QRDYRVTLAG PTTLAAMLNS LQMGFRTLAI ERRSSEVWTL
LGAVKTEFTK FGDALAYTRK KLEEATSSID KAETRTRVLT RKLKEVEAIP AQEAGWLLPS
TAEADD