Gene MCA0143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0143 
SymbolmdoD 
ID3103503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp148339 
End bp149970 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content65% 
IMG OID637169368 
Productglucan biosynthesis protein D 
Protein accessionYP_112682 
Protein GI53802574 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATCCT CGCCATTGCT TTCTATTTTC TTCCGTAGGG CTGAGTTACT CATGATAGGA 
AAGACGAGTC GACGCGATTT CATGCGCCTG GGCGCGAAGC TCGGGCTGGC CGCTCCCTTC
CTGGCGGGCG TGGGGCGGAG CTGGGCGGAA CCGAAAGGGC TCAAGTTCAG CCCATCCCTT
CCCTTCAGCT ACGAGATGTT GGTCAAGCGG GCGGAGGCGC TCGCCTCGCG CCCCTACTCG
CCGCCGCCGG CCGTCTCCGA GGTGGTGCGC AAACTGGATT ATGAGGCTTG GGGGCAGATC
CGGTTCCGGA CCGAGGACGC CTTGTTCGCC GAAGGGCCGT CCATTTATCC GGTCACGTTT
TTCCACCTCG GCCAGTTTTT CCAGAAACCC GTCAAGATTC ACGTCGTAGA GGATGGAAAA
GCGCGCCAGA TCTACTACAG CGCCGAGTAT TTCGACATGC CGGGTGACAG CCCGGCCCGG
CAAATGCCGG AACGATCGGG CTTCGCCGGT TTCCGTCTGC AAGAGGCCCG CACCCGCTCG
GACTGGCGCA CGCAGGATTG GATCGCCTAC CTGGGGGCGT CCTACTTCCG GGCGATCGGC
GCGCTCAACC AATACGGTCT GTCGGCGCGC GGCATCGTCA TCGACGCGGC CGAGCCGACG
CCTGAGGAAT TCCCCGATTT CACCGAGTTT TACATCGAGG GCGCCGCGGC GGAAACCGAT
CCGGTCATCA TCTGCGCGCT GCTGGACGGC CCCAGCGTCA CCGGAGCGTA CCGCTTCCTC
ACTTGGCGGA AAGAGGGGGT GGTGCAGGAG GTCGAAGCGG CGGTGTTCCT GCGCCGGAAT
GTCAAGCGGC TCGGTCTCGC GCCGTTGACC TCGATGTATT GGTTCAGCGA GTCGGAGAAG
CGGAGGTTGG AGGACTGGCG GCCCGAGGTC CACGATTCCG ACGGCCTGGC CATTTGGACC
GGTACGGGGG AGCGCATCTG GCGACCGCTG ATCAATCAGC CGTACGCGGT GACATCGAGT
TTCGTCGACC ACGATCCCAA GGGATTCGGT CTGCTCCAGC GCGACCGGGT GTTCGAAAAT
TATCTCGACG GCGTGAATTA CGAGCGGCGA CCCAGCCTGT GGGTCGAGCC GCTCGGCGGC
TGGGGAGAAG GGGCGGTGCA ACTGGTCGAG CTGCCCACGG ATGACGAGAT CCACGACAAC
ATCGGTGTTT ACTGGCGGCC GGCGGCGGCG CCCAAGGCCG GTTCATCTTA CCGCTTACGG
TATCGCTTGC ACTGGCAGGC CGACGAACCT TATCCGGCCG CCGTCGCCCG CTGCGTCGCG
ACCCGCATCG GTCGGGGCGG TCAGCCCGGC AAGCCCCGGC CTCGCGGAGT CTACAAGTTT
GCGGTGGAAT TTGCCGGCGC GGCGCTGGAG CCGTTGTGGG GCGACACCGT GAAGGCGGCG
CCGGTCGTCA CGGCTTCCAG CGGGGCGATT CAGGGGGCTT TCATCGAGCC GATCCCTCAT
ACCCGCCGTT GGCGGGCGAT TTTCGACCTG ATCCCCGACG GTACCGCTCC GGTCGAACTG
CGCCTCTACA TCCAGGGCAA CGGCGATGCC CTCACCGAAA CCTGGCTGTA CCAGTTCCGG
CCGCCGGCCT GA
 
Protein sequence
MSSSPLLSIF FRRAELLMIG KTSRRDFMRL GAKLGLAAPF LAGVGRSWAE PKGLKFSPSL 
PFSYEMLVKR AEALASRPYS PPPAVSEVVR KLDYEAWGQI RFRTEDALFA EGPSIYPVTF
FHLGQFFQKP VKIHVVEDGK ARQIYYSAEY FDMPGDSPAR QMPERSGFAG FRLQEARTRS
DWRTQDWIAY LGASYFRAIG ALNQYGLSAR GIVIDAAEPT PEEFPDFTEF YIEGAAAETD
PVIICALLDG PSVTGAYRFL TWRKEGVVQE VEAAVFLRRN VKRLGLAPLT SMYWFSESEK
RRLEDWRPEV HDSDGLAIWT GTGERIWRPL INQPYAVTSS FVDHDPKGFG LLQRDRVFEN
YLDGVNYERR PSLWVEPLGG WGEGAVQLVE LPTDDEIHDN IGVYWRPAAA PKAGSSYRLR
YRLHWQADEP YPAAVARCVA TRIGRGGQPG KPRPRGVYKF AVEFAGAALE PLWGDTVKAA
PVVTASSGAI QGAFIEPIPH TRRWRAIFDL IPDGTAPVEL RLYIQGNGDA LTETWLYQFR
PPA