Gene Tpet_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1054 
Symbol 
ID5171228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1081015 
End bp1082643 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content50% 
IMG OID640563572 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_001244647 
Protein GI148270187 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAACCGA TCAAAGAGAT CGCCGATCAG TTGGGATTGA AAGACGACGT TCTCTATCCT 
TACGGCCACT ACATAGCGAA GATAGACCAC AGATTCTTGA AAAGCCTTGA AAATCGTGAA
GATGGGAAAC TGATCCTCGT AACGGCAGTT ACTCCTACCC CTGCAGGTGA AGGAAAAACT
ACCACCAGCA TAGGACTTTC CATGTCTCTG AACAGAATCG GTAAGAAATC GATAGTTACT
CTGAGAGAGC CTTCCCTCGG CCCCACCCTT GGATTGAAAG GCGGTGCAAC AGGAGGTGGC
AGGTCGAGGG TTCTTCCTTC TGATGAGATA AACCTTCATT TCACCGGAGA CATGCACGCC
GTGGCCTCCG CCCACAATCT GCTGGCGGCC GTTCTGGATT CGCACATAAA ACACGGCAAC
GAGCTCAAGA TAGACATAAC AAGAGTATTT TGGAAGCGAA CCATGGACAT GAACGACCGT
GCTTTGAGAA GCATAGTGAT CGGTCTTGGA GGCTCAGCGA ACGGATTTCC AAGAGAAGAC
AGTTTCATCA TCACGGCCGC TTCCGAAGTG ATGGCCGTTC TTGCTCTGTC CGAGAACATG
AAGGACCTGA AAGAAAGACT TGGAAAAATA ATCGTAGCGC TCAACACTGA CAGGAAAATC
GTCAGGGTCT CTGATCTTGG AATTCAGGGG GCCATGGCCG TTCTTTTGAA AGATGCCATA
AATCCCAACC TTGTTCAGAC AACCGAAGGA ACTCCAGCGC TCATACACTG CGGACCTTTC
GCCAACATCG CGCACGGCAC CAATTCCATC ATAGCGACGA AGATGGCCAT GAAGCTCTCC
GAATACACGG TCACGGAAGC GGGTTTTGGA GCAGACCTCG GTGCCGAAAA ATTCATCGAC
TTTGTTTCCC GTGTCGGTGG TTTTTATCCG AACGCGGCCG TTCTTGTGGC CACAGTACGA
GCACTGAAAT ACCACGGTGG TGCGGACCTC AAAAACATAC ACGAGGAAAA CCTGGAAGCC
CTCAAAGAAG GATTCAAAAA TCTCAGGGTA CACTTGGAAA ACCTGAGGAA ATTCAATCTA
CCCGTCGTGG TAGCGCTGAA CAGGTTCATC ACGGACACAG AAAAGGAAAT AGCCTACGTG
GTGAAAGAGT GCGAAAAACT CGGTGTGAGG GTAGCAGTCA GTGAGGTTTT TGAAAAGGGC
AGCGAAGGTG GTGTCGAACT CGCAAAAGCC GTGACAGAAG CTGTGAAGGA CGTAAAACCG
GTTTATCTCT ACGAAATGAA CGATCCTGTG GAAAAGAAGA TAGAGATTCT CGCAAAGGAG
ATCTACAGAG CGGGAAGAGT GGAGTTTTCC GATACTGCAA AAAATGCCCT CAAATTCATT
AAAAAACACG GTTTTGATGA GCTTCCCGTG ATCGTTGCCA AAACTCCAAA GTCCATTTCC
CACGATCCGT CTCTCAGAGG TGCACCCGAA GGATACACGT TCGTTGTCAG CGACCTCTTC
GTTTCCGCCG GGGCAGGCTT TGTCGTTGCG CTTTCAGGAG ACATAAATCT GATGCCCGGT
CTTCCAGAAA GGCCGAACGC CCTGAACATG GACGTAGACG ACAGCGGTAA CATAGTAGGT
GTTTCGTGA
 
Protein sequence
MKPIKEIADQ LGLKDDVLYP YGHYIAKIDH RFLKSLENRE DGKLILVTAV TPTPAGEGKT 
TTSIGLSMSL NRIGKKSIVT LREPSLGPTL GLKGGATGGG RSRVLPSDEI NLHFTGDMHA
VASAHNLLAA VLDSHIKHGN ELKIDITRVF WKRTMDMNDR ALRSIVIGLG GSANGFPRED
SFIITAASEV MAVLALSENM KDLKERLGKI IVALNTDRKI VRVSDLGIQG AMAVLLKDAI
NPNLVQTTEG TPALIHCGPF ANIAHGTNSI IATKMAMKLS EYTVTEAGFG ADLGAEKFID
FVSRVGGFYP NAAVLVATVR ALKYHGGADL KNIHEENLEA LKEGFKNLRV HLENLRKFNL
PVVVALNRFI TDTEKEIAYV VKECEKLGVR VAVSEVFEKG SEGGVELAKA VTEAVKDVKP
VYLYEMNDPV EKKIEILAKE IYRAGRVEFS DTAKNALKFI KKHGFDELPV IVAKTPKSIS
HDPSLRGAPE GYTFVVSDLF VSAGAGFVVA LSGDINLMPG LPERPNALNM DVDDSGNIVG
VS