Gene Tpet_1412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1412 
Symbol 
ID5170401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1401956 
End bp1403197 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content48% 
IMG OID640563940 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001245001 
Protein GI148270541 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTGTCAA CAAGATTGTA TGAAGACTTT CCCACCCTGA AGATGAAGAT TAACGGAAAA 
AGGATCGTTT ATCTGGACAG CGCAGCCACT ACACTGAAGC CAAAACAGGT CGTGAAAAAG
CTGGAAGAAT TCTACTACAA CAGCTACGCG AACGTTCACA GAGCGGTTCA CACTCTCGCT
GTGCGTGCCA CCACCGAATT CGAGGAAACC AGGGAGGAGT TCGCAGAGTT TTTAGGTGCT
TCTCCAGAGG AGATAATCTT CACCTCGGGC ACCACCATGG CCATAAATCT CGCTGTGGTG
TCCCTTCTGA GGAGTGGCAT ATTGAAAGAG GACGATCTGG TTCTGGTTTC CCTTCTGGAA
CACCACGCGA ACTTTGTCCC CTGGCTCAGA CTTTCGAAAC TCCATGGGTA CAGGATCGAC
TTCATCAAGC CCTCGGGAAG ATTCGGAACA CTCGAGATGG ATGACCTGTT GAAACACAGA
GACAAAAACC CGAAAGTCGT TGCCATAACA GGCCTTTCCA ACGTGACCGG TCAAAGAATA
CCCGTTGAAG AGCTGAGAAA CGTCTTTCCT AATTCCATCA TTCTCCTGGA TGGTGCTCAA
CTGATACCTC ATGAACCGGT GAAACCGAAG GAGATAGGAG TAGATTTTCT TGCCTTTTCA
CTCCACAAGA TGCTCGGTCC CACCGGTGTG GGTGTGCTTT ACGGAAAGAG AGAGCTGATG
GAACAGATGG AACCGTTTCT CTACGGCGGA GAGATGATCG ACAGAGTGTC CCTCGAAGAC
GTCACATTCA ACGAATTACC TTACAAGTTC GAAGCGGGAA CTCCAAACAT AGCGGATATA
GTCGCATCCA GGGAAGCCTT GAGGTATTTG AAAGAAACGG GTTTCGATTC AGTTCATGAA
AAGATAGAAC AGCTCACTCA GATGGCCTTG AAAGAGATGA TGAAAATCGG CGGAGTAGAA
ATCTACGGCC CACTCGACGA AAGACAGCAC GGGATAGTCT CGTTCAATGT AAAGGGAATA
CACCCGCACG ATGTGGCTCA CATTCTCGAT CAGGAGTTTG GAATAGCCGT AAGGAGTGGG
CACCACTGCG CACAGCCTCT GATGAGTATT TTGAAAGAAC AGTCGCAAAT CGATTTTCCA
AACTCCACCT GCAGAGCCAG TTTTTACATC TACAACACGG AAGAAGATGT TAAAATTCTC
GTTGAAGGTG TGAAAAAAGT AAAGGAGTGG TTTTCAAGAT GA
 
Protein sequence
MLSTRLYEDF PTLKMKINGK RIVYLDSAAT TLKPKQVVKK LEEFYYNSYA NVHRAVHTLA 
VRATTEFEET REEFAEFLGA SPEEIIFTSG TTMAINLAVV SLLRSGILKE DDLVLVSLLE
HHANFVPWLR LSKLHGYRID FIKPSGRFGT LEMDDLLKHR DKNPKVVAIT GLSNVTGQRI
PVEELRNVFP NSIILLDGAQ LIPHEPVKPK EIGVDFLAFS LHKMLGPTGV GVLYGKRELM
EQMEPFLYGG EMIDRVSLED VTFNELPYKF EAGTPNIADI VASREALRYL KETGFDSVHE
KIEQLTQMAL KEMMKIGGVE IYGPLDERQH GIVSFNVKGI HPHDVAHILD QEFGIAVRSG
HHCAQPLMSI LKEQSQIDFP NSTCRASFYI YNTEEDVKIL VEGVKKVKEW FSR