Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1237 |
Symbol | |
ID | 7399505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 1247901 |
End bp | 1249625 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643708301 |
Product | formate-tetrahydrofolate ligase FTHFS |
Protein accession | YP_002565899 |
Protein GI | 222479662 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG2759] Formyltetrahydrofolate synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.474316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00654848 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTGCATG CAGATCCGGC AATCGACGGA GCCGATACGG CCGACGCACC GGAGTCCGAC CTCGCCGTCG CCCGCGCCGC GACTCCCCGT CCCATCGAGG AGGTCGCGGC CGACCTCGGG CTGGCCCCCG ACGAGATCGA ACCCCGCGGC GACGGGGTCG CGAAGCTCAC GCAGTCGGCG GTTCGCTCGG CGACTGCGAG CGAACCCGAT GGGACGACCG TGCTCGTCAC GGGGATGACC CCGACGCCGA AGGGCGAGGG GAAGACCGTG ACGACGGTGG GGCTCGGGCA GGCGCTCGCG GGACTGGGAG AGTCGACGGC TGTCGCGGTC CGAGAGCCCT CGCTCGGCCC CGTCTTCGGG ATCAAGGGCG GCGCCGCGGG CGGCGGGTAC TCGCAGGTGC TTCCGATGGA GTCGATCAAC CTCCACTTCA CCGGCGACAT CCACGCGCTC ACGGCCGCGC ACAACCTGCT TTCCGCGGCG CTCGACAACC ACCTCCATCA GGGGAATGAG GAGGGTGTCG ACGTGCGCCG CGTCGACTGG CCGCGCGCGC TCGACGTCAA CGACCGCGCG CTCCGCGAGA CCGTCGTGGG ACTCGGCGGT CCCGCGCGCG GCGTGCCGCG GGAAGACGAG TTCGTCATCA CCGCCGCCTC GGAGCTGATG GCCGTCCTCG GGCTCGCTGA GGACCTGTCG GACCTAAAGA CGCAGATCGG ACGGATCGTC CTCGCCGAAG ACGCCGACGG CGATCCGGTC ACACCCGACG ACCTCGGCGT CACGGGGGCC GCGGCGGCGC TCCTCCGCGA CGCGTTCCGC CCGAACCTCG TCCAGACTAT CGAAGGCGTT CCCGCCTTGG TCCACGGCGG TCCGTTCGCC AATATCGCGC ACGGGACCAA CACGCTCGTG GCCGACCGCG TCGGTGCCTC GCTGGCCGAC TACCTCGTCA CCGAGGCCGG CTTCGGGGCG GACCTCGGTG CCGAGAAGTT CGCGCACATC GTCGCGCGCG AGGGGATCGT CCCGGACGTA GCGGTCGTCG TCGCGACGGT CCGCGGCGCG AAGCGCCACG GACTGGAGAT GTGGCCGGCC GACTTCGACG CGCTGGCGAA GACTGACCCC GAGGCGGTGC GAGCCGGCGT CGACAACGTG ACACGGCACG TCGAGATCGT GGAATCGCTC GGGATCCCCG CGGTCGTCGG TATCAACGTC TTCCCCGACG ACGCGGAGTC GGAGCTTGCG GCCCTCGAGT CGACGCTGAC CGACGCGGGG ATTCCCGTCG CGCGCTCGAC CGCCTACCGC GACGGCGGCG AGGGGGCGAT GCCGCTCGCG GAGCTGGTCC GCGAACGTGC CGGCACCGGC GAATTCGCGC CGCTGTACGA CCTCGACGCA CCGCTCCGCG AGAAGGTCGA GACCGTCGCC CGCGAGGTGT ACGGCGCCGA CGGCGTCGAG TACGTCGACG GCGCCGACGA GGACATCGAC CGCGTCGAGG CGTGGGGGTA CGGCGACCTC CCCGTCTGCG TCTCGAAGAC GCCGTACTCC TTCTCGGACG ACGCCTCGCT GACGGGCGTT CCGGAGGGGT GGACGCTCAC CGTCCGGGAG GTGTCGCCGT CGGCGGGCGC CGGCTTCGTC GTCGTCAAGA CTGCGGACGT GATGACGATG CCGGGGCTCC CGGCCGAGCC GGCCGCCGAA GAAATTGACG TGGACGCAGA CGGGAACCTG AGCGGGCTGT TCTGA
|
Protein sequence | MVHADPAIDG ADTADAPESD LAVARAATPR PIEEVAADLG LAPDEIEPRG DGVAKLTQSA VRSATASEPD GTTVLVTGMT PTPKGEGKTV TTVGLGQALA GLGESTAVAV REPSLGPVFG IKGGAAGGGY SQVLPMESIN LHFTGDIHAL TAAHNLLSAA LDNHLHQGNE EGVDVRRVDW PRALDVNDRA LRETVVGLGG PARGVPREDE FVITAASELM AVLGLAEDLS DLKTQIGRIV LAEDADGDPV TPDDLGVTGA AAALLRDAFR PNLVQTIEGV PALVHGGPFA NIAHGTNTLV ADRVGASLAD YLVTEAGFGA DLGAEKFAHI VAREGIVPDV AVVVATVRGA KRHGLEMWPA DFDALAKTDP EAVRAGVDNV TRHVEIVESL GIPAVVGINV FPDDAESELA ALESTLTDAG IPVARSTAYR DGGEGAMPLA ELVRERAGTG EFAPLYDLDA PLREKVETVA REVYGADGVE YVDGADEDID RVEAWGYGDL PVCVSKTPYS FSDDASLTGV PEGWTLTVRE VSPSAGAGFV VVKTADVMTM PGLPAEPAAE EIDVDADGNL SGLF
|
| |