Gene Hmuk_0266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0266 
Symbol 
ID8409764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp263921 
End bp265009 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content70% 
IMG OID645018591 
Productfolate-binding protein YgfZ 
Protein accessionYP_003176110 
Protein GI257386337 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase) 
TIGRFAM ID[TIGR03317] folate-binding protein YgfZ 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0098053 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGTCA TCGAGGGCGT TCACGAGGAT CGCGGCGCGA CGTTCCGGAC GGTGGGCGGG 
AACCGCGTCG TCGCCAACTA CGGTCGCCCC GAACGAGTCC ACCGGGCGGT CCGGCAGGTC
GTCGGCGTGA TCGAGATGGG GTACGGCGTC GTCACAGTCA CCGGCGACGA CCGGATCGAC
TTCGTCGACA ACGCCGTCTC GAACCGCGTC CCGACCGCCG ACGGCGACGG GGTCTACTCG
CTGCTGCTCG ACCCGCAGGG CCACGTCGAG ACCGAACTCT ACGTCTACAA CGCGGGCGAG
CGACTGCTGC TTTTCGTCCC GCCGGCCCGC GCCGACCCCC TCGTCGAGGA CTGGCGCGAG
AAGACGTTCA TCCAGGACGT GACGATCGCC GACGCGACCG ACGAGTTCGC GGTCTTCGGG
GTCCACGGCC CCAAGGCGAC GGAGAAGATC GCGAGCGTGC TCAACAAAAC CGCGACGCCC
GAGACGCCGC TCTCGTTCGT CCGCGGGTCG ATGGTCGACG CCGGTGTGAC CGTCGTTCGC
AGCGACGGGC TGGTCGGCGA AGAGGGGTTC GAGGTCGTCT GCAGCGCCGA CGTGGCCCGC
GACGTGTACG ACACCCTGGA GAACCGCGGT CTCAACGCCG CTCCCTTCGG CTACGACACG
TGGGACGCGC TGACCCTGGA GGCGGGGACG CCGCTGTTCG ACACCGAGAT CGAGGGCCAG
ATTCCGAACG TCGTCGGCCT CGCGAACGGC GTCGACTTCG AGAAGGGCTG TTTCGTCGGC
CAGGAGGTCG TCTCGCGTGT CCACAACCGC GGCCGGCCCT CGAAGCGACT CGTCGGACTC
ACCTGTGGTG CGGTCCCCGA GTCGGGGGCC GCGGTCTTCG TCGACGACGC CAGCGTCGGC
GCGGTGACGC GAGCCGTCGA GAGCCCGACC CGCGAGGAAC CGATCGCCCT CGCACGCGTC
GACTACGAAC TACCCGACGG GACGCCGTCG GTCCGGGTCG ACGGCGGCGA AGTGGACGCC
GAGCTGGCCG CCCTGCCCTT CGTCACCGGC TCGGACGAGT CCGCGCGACT CCCGCGGTAC
GAGAGATAG
 
Protein sequence
MTVIEGVHED RGATFRTVGG NRVVANYGRP ERVHRAVRQV VGVIEMGYGV VTVTGDDRID 
FVDNAVSNRV PTADGDGVYS LLLDPQGHVE TELYVYNAGE RLLLFVPPAR ADPLVEDWRE
KTFIQDVTIA DATDEFAVFG VHGPKATEKI ASVLNKTATP ETPLSFVRGS MVDAGVTVVR
SDGLVGEEGF EVVCSADVAR DVYDTLENRG LNAAPFGYDT WDALTLEAGT PLFDTEIEGQ
IPNVVGLANG VDFEKGCFVG QEVVSRVHNR GRPSKRLVGL TCGAVPESGA AVFVDDASVG
AVTRAVESPT REEPIALARV DYELPDGTPS VRVDGGEVDA ELAALPFVTG SDESARLPRY
ER