Gene MCA1110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1110 
SymbolmdoD 
ID3103343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1164796 
End bp1166376 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content64% 
IMG OID637170295 
Productglucan biosynthesis protein D 
Protein accessionYP_113580 
Protein GI53804541 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTGA GCCGCCGCCA TTTCCTCCAA CTCGCCGTCG CAGTGAATGC GCTTTCACTG 
TTCAAAGCCG GCATGGTTAA TGCCGAACCC GGAATCGGAC TGAAATTCGG CCCCCCCTCC
CCCTTCTCGT TCGAGACGCT CAAGACGTTG GCGCGGGAAC GGGCCGGCCG GGAATACGCC
CCGCCGCCCC AACCCGATCC AGCGATCGTC AAGCAGATCG ACTACGACGC CCACGGCAAA
CTGCAGTACC GCAAGGAGGC CGCGCTCTGG GCCGAAAGCG GCGGCAGCTA CCCGATTTCC
TTCCAGCACG TGGGCATGTT CTTCCCCAAA ACGGTGATCA TGAACGTCGT CGAGAAAGGC
ACGGCGCGGG AAATCCTGTA CGATCCCCGT CTCTTCACCA TGGGACCGGA TCACGTGGCC
AAGGGTCTTC CGGCCAGCCC TTCGGCCTTC GCCGGGTTCT GGGTACACGA GAGCCGCAAG
GGTTCCGACT GGAAGACGCG GGAACCCTGG GTCACTTTCC TGGGAGCCTC GTACTTTCGC
GCTATCGGCG AACTGGGCCA GGTCGGCCTG TCGGCACGCG GCATCGCGCT GAATCCCGGC
ACTTCCAACC CGGAGGAATT TCCGGATTTC GTTTCCTTCT GGTTCGAACC CGCCGCGAAA
ACCGACGATC CGGTGACCGT TTACGCCCTG CTCGATGGCC CCAGCCTAAC CGGCGCCTAT
CGCTTCTTGC TGCGGCGGAC CCGGGGCGTG GTCATGGAGA TCGAAGCTGC GCTGTTCCTG
CGCAAGGACA TCGAACGTCT GGGGATCGCG CCGCTGACGT CCATGTACTG GTTCTCCGAA
ACCGCCAAAC CCACCGGCGT GGACTGGCGG CCGGAAGTGC ACGACTCCGA CGGACTGGCG
CTGTGGACCG GCGTCGGTGA ACACATCTGG CGGCCGATCA ACAATCCGTC GCACATCATG
GTATCGAGCT TTGCCGACAA GTCACCGAAA GGCTTTGGGC TCAGCCAGCG CGACCGGGTG
CTCGACCACT ACCAGGACGG CGTGCGCTAC CACTTGCGTC CCTCGGCCTG GGTGGAACCG
CTCGGTGACT GGGGCGAGGG TGCGGTGCAG CTCACTGAAA TCCCCACCGA CGACGAAATC
CACGACAACA TCGTGGCAAT GTGGGTGCCG AAGGAGCCCG CCACGGCGGG TAAGACCTAT
GACCTGCGCT ACCGCATCCA TTGGCTGGCG GACGAAGCAT TCCCCAGCCC GCTGGCGCGC
TGCGTGGCAA CCCGTCTCGG CAACGGCGGC CAGCCCGGCA AACCGCGCCC CAAGGGCGTG
CGCAAATTCA TGGTCGAATT CCTGGGCAAG CCGCTGGCGA AGCTTCCTTA CGGCGAGAAA
CCGGAACCCG TGATTACCGC GACCCGGGGC GAATTGTCGC GAATCGAGAT CGAAGCCGTG
CCGGACGACG TCCCGGGCCA TTGGCGTACC CATTTCGACC TGGCGGTGAC GGGGCAGGAT
CCGGTCGAAA TCCGCTGCTA TCTGCGCCAC AAAGACGAGG TGATGTCCGA GACCTGGCTA
TACCAGTACC ACCCGTTCTG A
 
Protein sequence
MSLSRRHFLQ LAVAVNALSL FKAGMVNAEP GIGLKFGPPS PFSFETLKTL ARERAGREYA 
PPPQPDPAIV KQIDYDAHGK LQYRKEAALW AESGGSYPIS FQHVGMFFPK TVIMNVVEKG
TAREILYDPR LFTMGPDHVA KGLPASPSAF AGFWVHESRK GSDWKTREPW VTFLGASYFR
AIGELGQVGL SARGIALNPG TSNPEEFPDF VSFWFEPAAK TDDPVTVYAL LDGPSLTGAY
RFLLRRTRGV VMEIEAALFL RKDIERLGIA PLTSMYWFSE TAKPTGVDWR PEVHDSDGLA
LWTGVGEHIW RPINNPSHIM VSSFADKSPK GFGLSQRDRV LDHYQDGVRY HLRPSAWVEP
LGDWGEGAVQ LTEIPTDDEI HDNIVAMWVP KEPATAGKTY DLRYRIHWLA DEAFPSPLAR
CVATRLGNGG QPGKPRPKGV RKFMVEFLGK PLAKLPYGEK PEPVITATRG ELSRIEIEAV
PDDVPGHWRT HFDLAVTGQD PVEIRCYLRH KDEVMSETWL YQYHPF