Gene Tpet_1019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1019 
Symbol 
ID5171153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1047920 
End bp1049080 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content50% 
IMG OID640563537 
Productamidohydrolase 
Protein accessionYP_001244613 
Protein GI148270153 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.18735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGTGA AGATTCTCTT TAAAAACGCC ACGGTCTTTC CCATAACTTC CAGACCCTTC 
AAAGGAGATG TGCTTGTCTC GAACGGAAAA GTGGAAAAGT TGGGAGAAAA CATAGAAGAT
CCCGACGCGG AAATCGTTGA TCTGACGGGA AAATTCCTGT TTCCTGGTTT CATCGACGCG
CACTCCCACA TCGGGCTCTT TGAGGAAGGA GTAGGGTACT ATTACAGTGA TGGAAACGAA
GCTACGGATC CCGTCACACC GCATGTGAAA GCACTCGATG GTTTCAATCC GCAGGATCCG
GCCATAGAGC GGGCTCTTGC CGGTGGAGTC ACATCCGTGA TGATCGTTCC TGGAAGCGCC
AACCCGGTGG GTGGACAGGG AAGTGTGATA AAGTTCAGGT CCATAATTGT GGAAGAGTGC
GTTGTGAAGG ATCCCGCAGG TTTGAAGATG GCGTTCGGAG AAAATCCAAA GAGGGTCTAC
GGTGAGAGGA AACAAACTCC TTCAACGAGA ATGGGAACAG CGGGAGTGAT CAGAGACTAC
TTCACTAAAG TGAAGAATTA CATGAAGAAA AAGGAACTCG CCCAGAAAGA AGGAAAAGAA
TTCACCGAAA CCGACCTGAA AATGGAAGTC GGCGAGATGG TCCTCAGAAA GAAGATTCCT
GCCAGAATGC ACGCCCACCG AGCGGACGAC ATCCTCACCG CCATCAGAAT AGCAGAGGAG
TTCGGTTTCA ACCTCGTCAT AGAACACGGA ACGGAAGCGT ACAAGATTTC TAAGGTGCTA
GCGGAGAAAA AGATACCCGT CGTTGTGGGA CCACTCCTCA CCTTCAGAAC AAAGCTGGAA
CTGAAAGATC TGACGATGGA AACTATCGCA AAACTCCTGA AAGATGGGGT TCTTATAGCC
TTGATGTGTG ATCACCCGGT GATTCCTCTC GAGTTTGCAA CCGTTCAGGC GGCAACTGCC
ATGAGGTACG GTGCAAAGGA AGAAGATCTG CTGAAGATCC TGACGGTGAA TCCTGCTAAG
ATCCTCGGCC TCGAAGATAG AATCGGTTCC ATTGAACCTG GAAAGGACGC GGATCTTGTG
GTCTGGAGCG GACATCCGTT CGATATGAAA TCCGTGGTGG AAAGGGTTTA CATAGACGGA
GTGGAAGTCT TCAGAAGATG A
 
Protein sequence
MSVKILFKNA TVFPITSRPF KGDVLVSNGK VEKLGENIED PDAEIVDLTG KFLFPGFIDA 
HSHIGLFEEG VGYYYSDGNE ATDPVTPHVK ALDGFNPQDP AIERALAGGV TSVMIVPGSA
NPVGGQGSVI KFRSIIVEEC VVKDPAGLKM AFGENPKRVY GERKQTPSTR MGTAGVIRDY
FTKVKNYMKK KELAQKEGKE FTETDLKMEV GEMVLRKKIP ARMHAHRADD ILTAIRIAEE
FGFNLVIEHG TEAYKISKVL AEKKIPVVVG PLLTFRTKLE LKDLTMETIA KLLKDGVLIA
LMCDHPVIPL EFATVQAATA MRYGAKEEDL LKILTVNPAK ILGLEDRIGS IEPGKDADLV
VWSGHPFDMK SVVERVYIDG VEVFRR