Gene Tpen_1666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1666 
Symbol 
ID4601248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1612873 
End bp1613778 
Gene Length906 bp 
Protein Length301 aa 
Translation table11 
GC content61% 
IMG OID639774439 
Productdihydrodipicolinate synthase 
Protein accessionYP_921064 
Protein GI119720569 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID[TIGR00674] dihydrodipicolinate synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.458011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGCCA GGTTCTACGG AGTGATATCC CCATTCATCA CGCCGTTCAG GGAGGACCTC 
TCGCTGGACA GGGAGGCGGT CGCCTGGCTC GCCAGGTACC AGGCCGAGAA GGGGGTTCAC
GGGATCTTCC CGAACAGCAC TACCGGGGAG TTCGTGCACC TATCGAGGGA GGAGGCCGTC
GAGGTAACGA GGCTGGTCCT GGAGGCTGTC GGCGGCAAGG TCTGGGTTAT CCCGGGTATC
AGCGCTAACT ACACTGAGGA CTCCGTCGCT CTCGGGAGAA CCTTCAAGGA CTTGGGGGTC
GACGGCGCCG TGGTTACTCC TCCCTACTTC TTCAAGGTGT CCCCGGAGAG GCTGAAGGTC
CACTTCTCGA CTATCCTCGA AAAGGTAGAC CTCCCGATAA TAGTGTACAA CATACCGGCG
ACTACGGGGA TCAACATACC GGTGGGGCTC TACCTGGAGC TCGCGAAGGA GCACAGCAAC
CTGGCGGGCG CCAAGGCTAC CGTCGAGAGC TTCACCTACT TCCGCCAGCT GGTACAGGTA
GTGAAGGCTG AGAGGAAGGA CTTCGCCGTG CTGACAGGGC TCGACGACCT CCTGCTACCG
GTGCTGATGA TGGGAGGCGA CGGCGGGATA ATGGCGCTCG CAAACGCCGC CCCGCAGATA
CACCGCGAGG TCTACGACGC GTACAGATCC GGGGACCTGA AAAGGGCGTT GGAGGCTTGG
CACAAGCTCT TGAGGCTCGT ACGCGTCTAC GACTACGCCA CCTCCTTCCC GACCTCCGTG
AAGACTTTGC TGAAAGTCAT GGGTGCCCCG GTAAAGCCGT ACGCTAGGAC GCCTCTCACC
CCGGAGACGC GGGAAGTGGA GGAAAAGATA GCGCAGATAG CTAGGGAGCT GGGCCTCAAA
ATATAA
 
Protein sequence
MSARFYGVIS PFITPFREDL SLDREAVAWL ARYQAEKGVH GIFPNSTTGE FVHLSREEAV 
EVTRLVLEAV GGKVWVIPGI SANYTEDSVA LGRTFKDLGV DGAVVTPPYF FKVSPERLKV
HFSTILEKVD LPIIVYNIPA TTGINIPVGL YLELAKEHSN LAGAKATVES FTYFRQLVQV
VKAERKDFAV LTGLDDLLLP VLMMGGDGGI MALANAAPQI HREVYDAYRS GDLKRALEAW
HKLLRLVRVY DYATSFPTSV KTLLKVMGAP VKPYARTPLT PETREVEEKI AQIARELGLK
I