Gene MCA2126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2126 
Symbol 
ID3103459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2287047 
End bp2288147 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content69% 
IMG OID637171276 
Productglycosyl transferase, group 1 family protein 
Protein accessionYP_114552 
Protein GI53803802 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTGC TGCACGTCGA AGGCGGCAGA AACCTCTACG GCGGCGCCCG CCAGGTGCTG 
TACCTGCTGG AAGGGCTCGA GCAGCGCGGG ATCGACAACG TACTGGTCTG CCCGGCCGGC
AGCGAACTCG CCCGGGAGGC CGCCGCCCAT GCCGAGGTGC ATGCCATTCC GATGTCCGGC
GACCTCGATT TCCGCCTCAT CGGCCGGCTT TACCGGATCA TCGGGCGGGT CCGGCCGGAC
CTCGCGCACC TGCACAGCCG GATCGGGGCG GACGTCATGG GCGGCATCGC CGCGCGTCTG
GCCGGCGTGC CGGTGGTTCA TTCCAGGCGT CAGGACAACC CCGAGATGCG CTGGGCCGTC
GCCGTGAAAT ACCGTCTGCA TGACCGGGTG GTCGCGATTT CCGAAGGCAT CGCGCGGGTA
CTCGCCTCGG AAGGTCTGCC GGCGGCGAAA TTGCGCGTCG TGCGCAGCGC CATCGATCCG
GCCCCTTTCC TCCAGCCCGG CGACCGCCCC GGGTTCCGCA CCGAATTTGG CCTGCCCGAG
GACTGCACGG TGATCGGCGT GATCGCCCAG CTCATCGAAC GCAAGGGCCA TCGCTTTCTG
CTCGAAGCCC TGCCCGAACT GACCGGGCGC TATCCGGGCC TGCACGTCCT CCTGTTCGGC
AAGGGCCCGC TGGAATCTTC CCTGATCGAA ACCGTACGCC ACCTCGGCTT GGCGGACCGC
GTCCATTTCG CCGGCTTCCG GGACGATCTG CCGCGCATCC TGCCCTGCCT GGACCTGGTG
GTACATCCGG CCCTGCGCGA AGGCCTGGGC ATCTCACTGC TGCAGGCCGC CGCGGCCGGC
GTCCCCATCG TGGCCTCGCG CGCCGGTGGG ATTCCCGAAG CCGTGCGCGA CGGCGACAAT
GGACTGCTCG TCCCACCGGG CGATGCCGCG GCCCTGGCGG CCGCCATCCG CCGCCTGCTC
GACGATCGGG ACCTGGCGCG GGACATGGGC CAGCGCGGCC GGGCGCTGAT CGGCCGTGAG
TTCTCGGTCG AGGGCATGGT CGAAGGAAAC CTGGCAGTCT ACCGGGAACT GCTGGCGGAG
AAAGGTAGCC CGCTCAGCTG A
 
Protein sequence
MKVLHVEGGR NLYGGARQVL YLLEGLEQRG IDNVLVCPAG SELAREAAAH AEVHAIPMSG 
DLDFRLIGRL YRIIGRVRPD LAHLHSRIGA DVMGGIAARL AGVPVVHSRR QDNPEMRWAV
AVKYRLHDRV VAISEGIARV LASEGLPAAK LRVVRSAIDP APFLQPGDRP GFRTEFGLPE
DCTVIGVIAQ LIERKGHRFL LEALPELTGR YPGLHVLLFG KGPLESSLIE TVRHLGLADR
VHFAGFRDDL PRILPCLDLV VHPALREGLG ISLLQAAAAG VPIVASRAGG IPEAVRDGDN
GLLVPPGDAA ALAAAIRRLL DDRDLARDMG QRGRALIGRE FSVEGMVEGN LAVYRELLAE
KGSPLS