Gene MCA1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1301 
Symbol 
ID3104928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1388055 
End bp1389365 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content70% 
IMG OID637170479 
ProductYjeF-related protein 
Protein accessionYP_113763 
Protein GI53804634 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCCGTGG TGTGCGGTCC CGGCAACAAC GGCGGCGACG GCTATGTGAT CGCCCGGCTG 
GCGCTGGCGG CGGGCTTCGA CGTTCGCGCC TATCCTGTCG GCCCGGTCGA GCGGCTGCGG
GGAGATGGCG CCGCCGCGTT CGCGGAGTAC CGCAACGCCG ATGGGCCGCT GCTGAACTTC
ATTCCACCCG GCTTCGAAGG CGCCGAGATC CTGGTCGACG CCCTGCTCGG CACCGGCCTG
GACCGGGACG TGACCGACGA ATACGCCGCC GTCATCGATG CCGTCAACGA TTTTCCCGGT
AAGGTGGTCG CCGTGGACAT CCCCTCCGGA CTCAACGCCG ACACCGGGGC GGTGATGGGA
AACGCGGTCC GTGCCGACCT GACGGTCAGC TTCATCGGTC TCAAGCAGGG GCTGTTCACC
GGTGCCGGCC CAGCGCACTG CGGCGAGATC GTCTTCGACG ACCTGGACAC GCCGCCGGAA
ATCCGTCTGG CCCAAACGCC TTCCTCTCGC TTGCTGCGGA GCAATGACTT CACCCTCCCG
CCCCGGCGCC GGGATGCCCA CAAAGGCCAC TACGGGCATG TATTGGTGAT CGGCGGCGAA
TGCGGCTACA GCGGGGCGGC ACGGATGGCG GCCGAGGCGG CGGCGCGTAC AGGGGCGGGA
CTGGTCAGCA TCGCCACGCG GACATCCCAC GCGCCCCTCC TGAACGTGGG CCGGCCGGAG
CTGATGGTGC ACGGTGCCGA ATCCGGCGGC GAGCTCGGGC CATTGCTGCA GCGCGCATCG
GTGCTGGCAC TCGGTCCGGG GCTTGGGCAG GGCGAATGGG CGAAGGCGTT GTTCGACGCA
GCCCTCGACT GCGGCAAACC GGCGGTGATC GACGCCGATG CGCTGAATCT CCTGGCGAAG
CTGCCGCGCC GATGCGACCA CTGGATACTG ACGCCGCATC CCGGCGAGGC CGCCCGCCTG
CTCGGGGTAG CCGTTGCGGA TGTCCAGCGC GATCGCTTCG CCGCCGTGTC CGCCTTGCAG
CGGCGTTACG GCGGCGTCGC CGTGCTCAAG GGAGCGGGTA CGCTGATCGC CGGACCGGAT
GGTGTCCCGC ACGTCGCCCG CTGGGGCAAT CCCGGCATGG CCAGCGGCGG CATGGGCGAC
GTGCTCACCG GCGTGATCGC AGGACTGCGG GCGCAGCACG TCCCCCCGTT CGAATCCGCC
TGTCTGGGGG TTCGTATACA TGGCCAGGCG GGCGACCTGG CCGCCTCGGC CGGCGAGCGG
GGTCTCCTCG CCGGCGACCT CATCGATGCC TTGCGCGCCT GTATCAACTG A
 
Protein sequence
MSVVCGPGNN GGDGYVIARL ALAAGFDVRA YPVGPVERLR GDGAAAFAEY RNADGPLLNF 
IPPGFEGAEI LVDALLGTGL DRDVTDEYAA VIDAVNDFPG KVVAVDIPSG LNADTGAVMG
NAVRADLTVS FIGLKQGLFT GAGPAHCGEI VFDDLDTPPE IRLAQTPSSR LLRSNDFTLP
PRRRDAHKGH YGHVLVIGGE CGYSGAARMA AEAAARTGAG LVSIATRTSH APLLNVGRPE
LMVHGAESGG ELGPLLQRAS VLALGPGLGQ GEWAKALFDA ALDCGKPAVI DADALNLLAK
LPRRCDHWIL TPHPGEAARL LGVAVADVQR DRFAAVSALQ RRYGGVAVLK GAGTLIAGPD
GVPHVARWGN PGMASGGMGD VLTGVIAGLR AQHVPPFESA CLGVRIHGQA GDLAASAGER
GLLAGDLIDA LRACIN