Gene Emin_1463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1463 
Symbol 
ID6263756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1558051 
End bp1559298 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content44% 
IMG OID642611948 
Productglycine hydroxymethyltransferase 
Protein accessionYP_001876348 
Protein GI187251866 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTCAA ATTTGCAGAA AACAGACAAG GCAGTTTTTG ACGCGGTTGA AAAGGAACTA 
GGCCGCCAGC GCACTAAGCT TGAATTAATA GCAAGTGAAA ACTTTACATC TTTAAGCGTT
ATGGAAGCGC AGGGGAGTAT TCTTACAAAC AAATACGCCG AAGGCTACCC GGGTAAACGT
TATTACGGCG GCTGCGAATT TGTAGACATG GTTGAAACTT TAGCGATAGA AAGAGCTAAA
CAAATTTTCG GCGCGGAGCA CGCCAATGTG CAGCCGCATT CGGGCGCGCA GGCAAATATG
GCCGCTTATT TGGCTCTTAT TAATCCAGGT GACACCGTGC TTGGCTTGAA TCTTTCCCAC
GGCGGGCATT TAACTCACGG GCATCCGATG AACTTTTCAG GCAAATATTT TAAAATCGTT
CCCATGAATG TACGTAAAGA GGATGAGCAA ATTGATTATG AGGAAGCCGC CAAACTGGCC
CTTGAACACA AACCCAAAGT TATTATGGCG GGCGCTTCAA ATTATTCAAG AATTTTTGAT
TGGAAAAAAC TGCGCGAAAT CGCGGACAGT GTGGACGCAT ACCTTATTTG TGACGTGGCG
CATTACGCAG GGCTTATAGC CGCGGGCGTT TATTCAAACC CCGTGCCTTA CGCGGACATT
GTTACAACAA CAACACATAA AACCCTGCGC GGCCCCAGAG GCGGTTTAAT TTTATGTAAA
GAAAAACATG CCAAAGCGGT TAACTCATCA GTTTTTCCCG GCCAGCAGGG CGGGCCGCTA
ATGCATGTTA TCGCGGCCAA AGCGGTTTGC TTCGGCGAAG CGTTAAAACC GGAATTTAAA
GAATACCAAA CCCAAGTTGT TAAAAACGCC AAAGAACTTT CGACCCAATT ACAAAAACTG
GGGTACCGCA TAGTTTCCGG CGGTACGGAT TGCCATGTTT TATGTGTTGA TTTAACTTCT
AAATCAATGA CTGGTAAAGC TGCTGAGGAA GCTTTAGATA AAGCGGGCAT AACCACAAAT
AAAAATACCA TTCCTTACGA TACGCAAAAA CCGTTTATTA CAAGCGGTGT AAGGCTTGGC
ACTCCCGCTG TTACAACAAG GGGCATGAAA GAAGCTGAAA TGGCGGCTAT AGCCTCCTTT
ATTGACAATG TTTTAAATAA CGCGGATAAT GAAGCTAAAC TTGCGGAAAT TTCTAAAGAG
GTTACAGCTT TTTTAGGTAA ATTCCTGTTA TATACAGAAT TAAACTAA
 
Protein sequence
MYSNLQKTDK AVFDAVEKEL GRQRTKLELI ASENFTSLSV MEAQGSILTN KYAEGYPGKR 
YYGGCEFVDM VETLAIERAK QIFGAEHANV QPHSGAQANM AAYLALINPG DTVLGLNLSH
GGHLTHGHPM NFSGKYFKIV PMNVRKEDEQ IDYEEAAKLA LEHKPKVIMA GASNYSRIFD
WKKLREIADS VDAYLICDVA HYAGLIAAGV YSNPVPYADI VTTTTHKTLR GPRGGLILCK
EKHAKAVNSS VFPGQQGGPL MHVIAAKAVC FGEALKPEFK EYQTQVVKNA KELSTQLQKL
GYRIVSGGTD CHVLCVDLTS KSMTGKAAEE ALDKAGITTN KNTIPYDTQK PFITSGVRLG
TPAVTTRGMK EAEMAAIASF IDNVLNNADN EAKLAEISKE VTAFLGKFLL YTELN