Gene Hlac_0332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0332 
Symbol 
ID7399722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp355679 
End bp356839 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content69% 
IMG OID643707394 
Productfolate-binding protein YgfZ 
Protein accessionYP_002565006 
Protein GI222478769 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase) 
TIGRFAM ID[TIGR03317] folate-binding protein YgfZ 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.97102 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0439096 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTCG TCTCCGACAC CCACGGTGCC CACGGCGCGG TGTACCGCGA CCGCGGTGGC 
CGCCGGGTGG TGGATCACTA CAGGAAGCCC GAGCGCGTCG GCAAGGCGGT CCGCAACGTC
GTCGGCGCAA TCGAGATGGG GTACGGCGTG CTCGCGATCA CGGGCGAGGA CCGCGTCGAG
TTCATCGACA ACGCCGTCTC CAACCGAATT CCGGAGGCAG ACGGTCAGGG CGTGTACGCA
CTCCTGCTCG ATCCCCAGGG CGGCATCGAG ACGGACATGT ACGTGTACAA CGCCGACGAG
CGCCTCCTCG TCTTCCTCCC GCCCGAGCGC ACCGAGGCGG TCGCCGAGGA CTGGGCGAGC
AAGGTGTTCA TTCAGGACGT GACGATCGAC GACATCTCCG ACGAGCTCGG CGTCTTCGGA
GTCCACGGCC CCAAGTCGAC CGAGAAGGTC GCCTCGGTAC TCGGCGGACC GGGCGCACCC
GAGAAACCGC TCTCGTTCGT CCGCGGATCG ATGGTCGACG CCGGCGTCAC CGTGATCGCG
AGCGATGCGC CACTCGGCGA GGAGGGATAC GAGGTCGTCT GCGCCGCCGA GGACGCAGAA
GAGGTGCTCG ACACCCTGCT CAACCGGGGC CTCAACGCGG CCCCGTTCGG CTACCGGACG
TGGGACGCGC TCTCGCTCGA AGCCGGCACG CCCCTCTTCG AGTACGAGCT TGAAGGAACG
GTGCCGAACG TCCTCGGACT CCGCAACGCC TTGGACTTCG AGAAGGGGTG TTACGTCGGT
CAGGAGGTCG TCTCCCGCGT TGAGAATCAG GGACGGCCGA GCCGGCGTCT CATCGGACTC
GACCTCGACG GGCTTGCCGA CGCGACCGCC GACATCGACG GCGACGCCGA CCCGGAGGGG
TACGACGAGA TCCTGCCGTC TCCCGGCGCG GCCGTGTTCG ACGGCGACGA GGCGGTCGGC
GAGGTGACCC GCGCGGCGGT CGGACCGGCC GCCGGCGACC CGATCGCGTT GGCGTTCGCC
CGGTTCGACG CCGACCTCGT CGATCCCACC GTGCGCGTCG ACGGCGAAGA AGTCGCGGCG
ACGCGCTCCG ACCTCCCGTT CCCGTCCGTC GACGGGAGCG CGCAGTCCGC GCGGCTGCCG
ACGTATCCGA GCGACGAGTA G
 
Protein sequence
MTLVSDTHGA HGAVYRDRGG RRVVDHYRKP ERVGKAVRNV VGAIEMGYGV LAITGEDRVE 
FIDNAVSNRI PEADGQGVYA LLLDPQGGIE TDMYVYNADE RLLVFLPPER TEAVAEDWAS
KVFIQDVTID DISDELGVFG VHGPKSTEKV ASVLGGPGAP EKPLSFVRGS MVDAGVTVIA
SDAPLGEEGY EVVCAAEDAE EVLDTLLNRG LNAAPFGYRT WDALSLEAGT PLFEYELEGT
VPNVLGLRNA LDFEKGCYVG QEVVSRVENQ GRPSRRLIGL DLDGLADATA DIDGDADPEG
YDEILPSPGA AVFDGDEAVG EVTRAAVGPA AGDPIALAFA RFDADLVDPT VRVDGEEVAA
TRSDLPFPSV DGSAQSARLP TYPSDE