Gene Hlac_1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1237 
Symbol 
ID7399505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1247901 
End bp1249625 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content71% 
IMG OID643708301 
Productformate-tetrahydrofolate ligase FTHFS 
Protein accessionYP_002565899 
Protein GI222479662 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.474316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00654848 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTGCATG CAGATCCGGC AATCGACGGA GCCGATACGG CCGACGCACC GGAGTCCGAC 
CTCGCCGTCG CCCGCGCCGC GACTCCCCGT CCCATCGAGG AGGTCGCGGC CGACCTCGGG
CTGGCCCCCG ACGAGATCGA ACCCCGCGGC GACGGGGTCG CGAAGCTCAC GCAGTCGGCG
GTTCGCTCGG CGACTGCGAG CGAACCCGAT GGGACGACCG TGCTCGTCAC GGGGATGACC
CCGACGCCGA AGGGCGAGGG GAAGACCGTG ACGACGGTGG GGCTCGGGCA GGCGCTCGCG
GGACTGGGAG AGTCGACGGC TGTCGCGGTC CGAGAGCCCT CGCTCGGCCC CGTCTTCGGG
ATCAAGGGCG GCGCCGCGGG CGGCGGGTAC TCGCAGGTGC TTCCGATGGA GTCGATCAAC
CTCCACTTCA CCGGCGACAT CCACGCGCTC ACGGCCGCGC ACAACCTGCT TTCCGCGGCG
CTCGACAACC ACCTCCATCA GGGGAATGAG GAGGGTGTCG ACGTGCGCCG CGTCGACTGG
CCGCGCGCGC TCGACGTCAA CGACCGCGCG CTCCGCGAGA CCGTCGTGGG ACTCGGCGGT
CCCGCGCGCG GCGTGCCGCG GGAAGACGAG TTCGTCATCA CCGCCGCCTC GGAGCTGATG
GCCGTCCTCG GGCTCGCTGA GGACCTGTCG GACCTAAAGA CGCAGATCGG ACGGATCGTC
CTCGCCGAAG ACGCCGACGG CGATCCGGTC ACACCCGACG ACCTCGGCGT CACGGGGGCC
GCGGCGGCGC TCCTCCGCGA CGCGTTCCGC CCGAACCTCG TCCAGACTAT CGAAGGCGTT
CCCGCCTTGG TCCACGGCGG TCCGTTCGCC AATATCGCGC ACGGGACCAA CACGCTCGTG
GCCGACCGCG TCGGTGCCTC GCTGGCCGAC TACCTCGTCA CCGAGGCCGG CTTCGGGGCG
GACCTCGGTG CCGAGAAGTT CGCGCACATC GTCGCGCGCG AGGGGATCGT CCCGGACGTA
GCGGTCGTCG TCGCGACGGT CCGCGGCGCG AAGCGCCACG GACTGGAGAT GTGGCCGGCC
GACTTCGACG CGCTGGCGAA GACTGACCCC GAGGCGGTGC GAGCCGGCGT CGACAACGTG
ACACGGCACG TCGAGATCGT GGAATCGCTC GGGATCCCCG CGGTCGTCGG TATCAACGTC
TTCCCCGACG ACGCGGAGTC GGAGCTTGCG GCCCTCGAGT CGACGCTGAC CGACGCGGGG
ATTCCCGTCG CGCGCTCGAC CGCCTACCGC GACGGCGGCG AGGGGGCGAT GCCGCTCGCG
GAGCTGGTCC GCGAACGTGC CGGCACCGGC GAATTCGCGC CGCTGTACGA CCTCGACGCA
CCGCTCCGCG AGAAGGTCGA GACCGTCGCC CGCGAGGTGT ACGGCGCCGA CGGCGTCGAG
TACGTCGACG GCGCCGACGA GGACATCGAC CGCGTCGAGG CGTGGGGGTA CGGCGACCTC
CCCGTCTGCG TCTCGAAGAC GCCGTACTCC TTCTCGGACG ACGCCTCGCT GACGGGCGTT
CCGGAGGGGT GGACGCTCAC CGTCCGGGAG GTGTCGCCGT CGGCGGGCGC CGGCTTCGTC
GTCGTCAAGA CTGCGGACGT GATGACGATG CCGGGGCTCC CGGCCGAGCC GGCCGCCGAA
GAAATTGACG TGGACGCAGA CGGGAACCTG AGCGGGCTGT TCTGA
 
Protein sequence
MVHADPAIDG ADTADAPESD LAVARAATPR PIEEVAADLG LAPDEIEPRG DGVAKLTQSA 
VRSATASEPD GTTVLVTGMT PTPKGEGKTV TTVGLGQALA GLGESTAVAV REPSLGPVFG
IKGGAAGGGY SQVLPMESIN LHFTGDIHAL TAAHNLLSAA LDNHLHQGNE EGVDVRRVDW
PRALDVNDRA LRETVVGLGG PARGVPREDE FVITAASELM AVLGLAEDLS DLKTQIGRIV
LAEDADGDPV TPDDLGVTGA AAALLRDAFR PNLVQTIEGV PALVHGGPFA NIAHGTNTLV
ADRVGASLAD YLVTEAGFGA DLGAEKFAHI VAREGIVPDV AVVVATVRGA KRHGLEMWPA
DFDALAKTDP EAVRAGVDNV TRHVEIVESL GIPAVVGINV FPDDAESELA ALESTLTDAG
IPVARSTAYR DGGEGAMPLA ELVRERAGTG EFAPLYDLDA PLREKVETVA REVYGADGVE
YVDGADEDID RVEAWGYGDL PVCVSKTPYS FSDDASLTGV PEGWTLTVRE VSPSAGAGFV
VVKTADVMTM PGLPAEPAAE EIDVDADGNL SGLF