Gene MCA1017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1017 
Symbol 
ID3103139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1068621 
End bp1069607 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content67% 
IMG OID637170202 
Productnucleoside diphosphate sugar epimerase family protein 
Protein accessionYP_113493 
Protein GI53804856 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.582194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACAC TGGTCACCGG CGCCACCGGC CATCTCGGCG CCAATCTGGT TCGGGCGCTG 
CTGGCCCGGG GCGAGAAGGT GCGCGCCTTC ATCCGCCGAC AAAGTGACGT CGCGGCGTTG
GACGGCTTGG CGGTCGAACG GGCCTACGGC GATCTGCGCG ACCGCCGCTC GATCCGGGAC
GCGCTGGAAG GCGTGGAGCG GTTGTACCAC ACCGCGGCCT TCGTCAGTAT CCGCGACGGT
GACCGCCAGG AGCTGTTCGA CGTCAACGTG GTCGGCACTC GCATGCTGAT GCAGGAGGCG
CGGCGGGCCG GCGTGCGTCG GGTGGTGCAT ACCAGCTCCT TCGGCGCGGT CGGCATCAAC
CCCCAAGGCG CATCGAACGA ACACTGGACA GTCAGCCCGT TCGAACCGGG CACCGACTAC
GAACGGACCA AGGCCGTGTC GGAACACGAC GTGATCCTCG AAGCCGTGCG CGGCCTCGAC
GTGACCATCG TCAACCCGGC CGCGATCGTC GGTCCGTGGG ATTTCCGGCC CAGCCTGGTC
GGCCGTACCA TCCTCGACTT CGCCCATGGC CGGATGAGGG CGTTCGTTCC CGGTGCCTTC
GACTTCGTCC CGATGCGCGA CGTGGTGGCT GTGGAACTGC TGGCCATGGA CAAAGGCATC
CGCGGTGAGC GCTATCTCGT CACCGGCGAG CACTGCACCA TCGGTCAGAT ACTGCAATGG
CTGGAGGAGC TGACCGGGCA TCCGCGTCCG AGGCTCGCGA TCCCGCCGCG CCTCATGCAG
GGCATCGCAC TGCTGAAGGA CCCGCTGGAA CGCCGTTTTT TCCCCCGCCG GACGCCACGC
TTCAACTACC ACTCCATCCG CCTGCTCAAC TCGGGCAAGC GCGGCGATTC CTCACGGAGC
CGGCGCGAAC TGGGCCTGGT CCCGACTTCC ACCCGGGCGG CTTTCGCCGA CGCCGTGGCC
TGGTTCAGGG AGAGGGGGAT GATCTGA
 
Protein sequence
MTTLVTGATG HLGANLVRAL LARGEKVRAF IRRQSDVAAL DGLAVERAYG DLRDRRSIRD 
ALEGVERLYH TAAFVSIRDG DRQELFDVNV VGTRMLMQEA RRAGVRRVVH TSSFGAVGIN
PQGASNEHWT VSPFEPGTDY ERTKAVSEHD VILEAVRGLD VTIVNPAAIV GPWDFRPSLV
GRTILDFAHG RMRAFVPGAF DFVPMRDVVA VELLAMDKGI RGERYLVTGE HCTIGQILQW
LEELTGHPRP RLAIPPRLMQ GIALLKDPLE RRFFPRRTPR FNYHSIRLLN SGKRGDSSRS
RRELGLVPTS TRAAFADAVA WFRERGMI