Gene MCA1437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1437 
Symbol 
ID3103375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1528353 
End bp1529669 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content64% 
IMG OID637170612 
Productsugar transferase/glycosyl transferase, WecB/TagA/CpsF family protein 
Protein accessionYP_113894 
Protein GI53804210 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1922] Teichoic acid biosynthesis proteins
[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR00696] bacterial polymer biosynthesis proteins, WecB/TagA/CpsF family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.211028 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCGTC ACACGGCGAT TCCCGGACCC CTGGGAAAAC GCCCCCTTTG GCACTGGCGT 
CTGTGGCTGC TCCCGCTGCT GACCCTGGTC ACCAGTCTGC TGCCACGGAT CGTCGATTGC
ATCGTGGCGG GAGCGTTGCT GATTCTGCTG GCACCGCTGC TGCTGCTTCG CGCGCTCGTC
GCCAAACTGC GCGCCGGACG GGTCTTCGCC GCCACGGAGA AAGTCGGCCG TTTCCAGGTG
CCGTTCCGGC AGCTCGCCTT TGCCGACGAT GCCCCTGCCC GCGACTTGGC TGTCCTGCTG
AATGTATTGT GGGGAGACAT GGCTTTCGCC GGACCGCGTC CGTTGAGTCC CGAGGAGGCC
GCGGCGGTGC CGGCGTCCCA GAGCATCCGT TTCCGCCTGC GGCCGGGGAT TTTCTCGCCC
TACACGTTGC GATCCAGGAT CGGGATCGCC TATGAGGGCG AAGGGGAGCT GGACCGCGAG
TTCTATTACA CCGAAACGGC GGCGGGCAAT CTCGGCCTGA TGTTGCGCAC TGGCGTCGGC
GGCTTGGTCG CCGGCGATGC CATACGGCCG GCGCCGGACC AGCTGGAGTT CTTCGGTGTC
GCCATCGCCA ATACCACCAT GAGCGAGGCC ATCGACTGGA TCGTCCGCCG CGTCCGTGAA
GACCGTCCTG CCACCATCGC TTTCGTCAAC CCGGACTGCC TGAACATCGC CTATGGCAAC
CCCGAGTACC GGGAGGTCCT GAGCAAGGTC GAACGGGTAC TGCCGGATGG CATCGGCATC
CGCTTCGGTT GCCGTATTCT GGGCACCAGC CTGCGCGCCA ACGTCAACGG CACCGATATG
TTTCCCCGTC TGTGCGAACG TTGTGCGCAT GAAAATCTGT CGCTGTTCCT GCTGGGTGCC
CGGCCGGGTA TAGCCAGGCA GGCCGCCGAA AACATGCTGC AGCGTTATCC GAACCTCAAG
ATCGCCGGCA GCTGCGACGG CTATTTCGCG CCGGACGCCG AAAACGAGAT CATCGAGACC
ATCAACCGCT CCGGAGCGGA CATCCTGCTG GTGGCTTTCG GCGTTCCCAA ACAGGAGCTC
TGGCTTTGGA AACACCGGTC CCGGCTCAAG CCACGCGTTG CCATGGGGGT GGGCGGACTG
TTCGACTTTT ACTCGGGCCG GATTCCCCGG GCTCCGCTTT GGCTGAGGGA AATCGGGCTG
GAATGGGGCT GGCGACTCTT GCAGGAGCCG GGACGGATGT GGCGGCGCTA CATCATCGGC
AACCCCCTTT TCCTCTATCG TGTATGGCGG CAGAAAATTG GAAAGCTCAC CCTCTGA
 
Protein sequence
MARHTAIPGP LGKRPLWHWR LWLLPLLTLV TSLLPRIVDC IVAGALLILL APLLLLRALV 
AKLRAGRVFA ATEKVGRFQV PFRQLAFADD APARDLAVLL NVLWGDMAFA GPRPLSPEEA
AAVPASQSIR FRLRPGIFSP YTLRSRIGIA YEGEGELDRE FYYTETAAGN LGLMLRTGVG
GLVAGDAIRP APDQLEFFGV AIANTTMSEA IDWIVRRVRE DRPATIAFVN PDCLNIAYGN
PEYREVLSKV ERVLPDGIGI RFGCRILGTS LRANVNGTDM FPRLCERCAH ENLSLFLLGA
RPGIARQAAE NMLQRYPNLK IAGSCDGYFA PDAENEIIET INRSGADILL VAFGVPKQEL
WLWKHRSRLK PRVAMGVGGL FDFYSGRIPR APLWLREIGL EWGWRLLQEP GRMWRRYIIG
NPLFLYRVWR QKIGKLTL