Gene MCA1434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1434 
Symbol 
ID3102768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1525682 
End bp1526872 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content68% 
IMG OID637170609 
Productglycosyl transferase, group 2 family protein 
Protein accessionYP_113891 
Protein GI53804213 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATCG ATACTTTCAT CCTGTTACCC GGTGTCGTGC TGAGCGCAGC GGTCCTGCCC 
GGCACTTTCG AGCTGGCCAT GCTGACGCTC GGCGGCGTTT TGCCCCGCCG GAAAACCGCC
GCCATCCGGG ACGCGGCTCC GCTGCGCTTC TGCATCGTGA CTCCGGCCCA CGACGAGGCC
GATGGGATCG CGGCCTGCCT GCGTAGCATC CAGGGCGCCG AGACGGGGCA TCACAGCGTG
ACGATCGTGG TCGTCGCCGA CAACTGCAGC GACGATACGG CCGCCCGTGC CGAAGCCGCC
GGCGCACGGG TGCTGGTCCG TGAAGACCCT GAACGGCGCG GGAAGGGGTA TGCGCTCGAT
CATGCCTTCT CGATCCTGCT GAAGGAGGAT CACGACGTGT TCGTGGTCGT CGATGCCGAC
ACCCGGGTGG AGCCGAATTT TCTCGGCGAA CTCGCGGTGC TGTTCCAGGC CGGCGCCGAC
GCGGCGCAGA CGCGTTACCG CGTCTCCAAC CCGGAACAGT CGGTCCGCGC CCGGTTGATG
CACGTCGCCT GGCTGGCATT CAATGTATTG CGTCCGCGCG GGCGGGACTA TTGGGGCTGG
TCCGCGGGCA TACTCGGCAG CGGCTTCGGG CTGCACCGCC GGACGCTGGA GAGCGTTCCC
TTTGACGCCG GTTCCATCGC CGAGGACCTG GAGTATCACA TCCGCCTGGT GCGAGCGGGC
AGGCGCGTGC GCTTCTGCGA CGGTACGACC GTGTGGTCAC CCATGCCAGC CACGGCTGCA
GCGGCTTCCA GCCAGCGTGC GCGCTGGGAG GGGGGCAGGT TCCGGATGAT GCGCGAACAG
ATCCCCCCGC TCATCCGGCA GGTGGCAGGA GGCCGCTGGC CGCTGCTGGA ACCGTTGCTG
GACCTGCTGC TGCTGCCGCT GGCCTATCAT GTGATCCTGC TGGCGCTGCT GCTGGCCTGG
CCCTGGCCAC CAGGGAGGAT AGGGGCCGCC GCCGGTATGG TCATCGTGGG TTTGCACATC
GCTGCAGCCC TGGCCGTGGG ACGGGCCGGC TGGCGGGACT GGGGAGCACT GGCCGCAGCG
CCTTTCTATG TCGTCTGGAA ACTCACGTTG GGCAAACGCC TGCTCTCGTC AGCGGGCCGG
GATGCCGCCT GGGTACGCAC CGAAAGGACG AAATCCGATG AATCCGCCTG A
 
Protein sequence
MPIDTFILLP GVVLSAAVLP GTFELAMLTL GGVLPRRKTA AIRDAAPLRF CIVTPAHDEA 
DGIAACLRSI QGAETGHHSV TIVVVADNCS DDTAARAEAA GARVLVREDP ERRGKGYALD
HAFSILLKED HDVFVVVDAD TRVEPNFLGE LAVLFQAGAD AAQTRYRVSN PEQSVRARLM
HVAWLAFNVL RPRGRDYWGW SAGILGSGFG LHRRTLESVP FDAGSIAEDL EYHIRLVRAG
RRVRFCDGTT VWSPMPATAA AASSQRARWE GGRFRMMREQ IPPLIRQVAG GRWPLLEPLL
DLLLLPLAYH VILLALLLAW PWPPGRIGAA AGMVIVGLHI AAALAVGRAG WRDWGALAAA
PFYVVWKLTL GKRLLSSAGR DAAWVRTERT KSDESA