Gene Tpet_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1457 
Symbol 
ID5171652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1447496 
End bp1448824 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content44% 
IMG OID640563988 
Productradical SAM domain-containing protein 
Protein accessionYP_001245046 
Protein GI148270586 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCAAAC CTTCAAGATA CAATGTGATT TTCAGAGATG GCGACTACGT TGTGTTTGTG 
AACTTTCTAA CCAGAGCGAT CGCTCGTCTG GAAAAAAGCA AAGCAGAAAT TGCAGAGAAG
ATTCTCAAAG ATCCAGACGA AGAGATCCCT GAAGAATGGC TTTCCATAAA GAAAGATTTG
ATCTACGGTG GGTATATCGT TGATGAGGAC TTCGACGAAA TAGAGCATCT GAAACTTATG
AACAGAATGA CCCGCTATGA TTCTTCCTCG GTTATGGTCA CGATCATACC CACCCTGGCA
TGCAATTTTG ATTGTATCTA CTGCTATGAA TCGAAAACAG GACCTTCGAT GACAGCGAAA
ACGGCAGAAA GGATCGTAGA ATATCTGAAA AGGCTGATTC GAACCAGAAG ATCTATAAGT
GTAGGCTGGT TCGGCGGGGA ACCCCTCCTG TGTTTTGACG TGGTGAAGTT CGTAAACTCA
TCGCTGATAG AAGCATGCAG AGAAAACAAT GTGGATTTTC ACTCCTCCAT GTCAACGAAT
GGATACCTTC TCGACAAAGA AAAAGCGGAA TGGTTTGACA GGCTCGAAAT AAGAAACGTT
CAAATAACGA TCGATGGCCC AGAAGATGTT CACAATAAAT ACAGGCCTTT GAAAGGTGGC
AAGGGAACTT TCGACACCAT TGTGGAAAAT CTGGAGAATC TCTTCAGGGT CACGGAGAAA
CTTCAGGTCA CGTTCAGAAT GAATGTGGGA CCTGACAACT TCCATCGTGT AGAAGAGTTT
CTGAACGTTC TGGAGCGATT TCCGAAAGAC AGAACGAGGG TATATTTCAG GTGGATTTTT
GGATCGAACA GCAGGGAATT CTTTTTCAGG AAGGTCTATG AGATCAGAGA TCGGGAGTCT
CTGAACATTC TCAATTTCTA CGAAAGTGCT GCGAAAAGAG GCTTCAATGT GTTTCTTCCC
GTTCTTGTGC AAAACAGATA CTGTGAGTAC GACTGTGTGT CTTCTGTTGT GATAGGTCCA
CAGGGAGAGT TGTATCCATG CACAGTGAGA GTGGGAAAAG GCATGGAGAT AGGAAGATTG
ACCAACCGAG GACTGGAATA CGACAGGAAA AAATATCTCA GATGGCATTC TTTCGATGCT
TTCGAAAGCG AAGAATGCAT GAGGTGCAAA CTCCTTCCTG TGTGCATGGG AGGATGTAGA
AGTGCCAGGT TCGATGGCAA AACGGGATGC CCTGAAGAGA AAAAGGATCC GGAGAAATTC
GCAAGGGAAT GGTACAGGAT AAAACTCCTT GAAAGGCAGG TCGAAAGACA TGAAGCCTTC
GAGATTTAA
 
Protein sequence
MFKPSRYNVI FRDGDYVVFV NFLTRAIARL EKSKAEIAEK ILKDPDEEIP EEWLSIKKDL 
IYGGYIVDED FDEIEHLKLM NRMTRYDSSS VMVTIIPTLA CNFDCIYCYE SKTGPSMTAK
TAERIVEYLK RLIRTRRSIS VGWFGGEPLL CFDVVKFVNS SLIEACRENN VDFHSSMSTN
GYLLDKEKAE WFDRLEIRNV QITIDGPEDV HNKYRPLKGG KGTFDTIVEN LENLFRVTEK
LQVTFRMNVG PDNFHRVEEF LNVLERFPKD RTRVYFRWIF GSNSREFFFR KVYEIRDRES
LNILNFYESA AKRGFNVFLP VLVQNRYCEY DCVSSVVIGP QGELYPCTVR VGKGMEIGRL
TNRGLEYDRK KYLRWHSFDA FESEECMRCK LLPVCMGGCR SARFDGKTGC PEEKKDPEKF
AREWYRIKLL ERQVERHEAF EI