Gene Tpen_1250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1250 
Symbol 
ID4600545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1186233 
End bp1188059 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content54% 
IMG OID639774026 
Productnucleotidyl transferase 
Protein accessionYP_920651 
Protein GI119720156 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1213] Predicted sugar nucleotidyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGTAG TTATTGTAGC GGCAGGCGAG GCTTCGAGGC TTAGGCCTTA CTCCGAGGAA 
ATGCCGAAAA CACTCATGGA ACTGAGGCCC GGCCTTCCTA TAATCGACTT CATACTGAAA
AGGGTTATGG CTCTCTCTCC TAGGAAGGTG GTGGTTGTTA CGAGAAGGAC GTGGAAGGAT
ATTCTCGCCT CGCATCTCGC CGGCTTAGCC GAAGTCGTTA CAGTAGACCT CGAGGGCGGG
TTTGGGAACC TCTACAGCGT GTACACGGCG CTCACAAGAG TCGGGGACGA GTTCCTGATA
TTGATGTCTG ACCACATATT CGAGGAGGAA GTACTGATGA GGCTTGCAGA ACACGCGAGC
AAAGCGTCGT TTACCGTTTG TCTCGACAGG AATCCCTCCC GCTCCGAGGC TGTGGAAGGT
TTGAAAGTTA TAATCGAAGG AGGACTGGTT AGGGACGCTG GCAAAGATGC GGCGCCCCGG
TACGGGATCG ATACAGGGGC GATAATGGCG CGCGGGAGAG CCAAAGAGTT TATCGAGGAA
GTGATCAGAC AGAAAGGTCC AGGGGCATCT ATTGCCGACG TGCTGAGATT CGCCGCCTCT
AAGGGAGAGG AGGTCGACTA CGTAGACGTT ACGGGGCTTT TATGGAAAGA TATCGACACC
CCCGAGGACC TCGTTAAGGC TAGGAAACTC GTCGAGAAGG TGGAGCTCAG GGAGCTAGCG
AAGAGGACTG GGGATCCCCT CTCGCGAGCT CTTTTGAAAC CCCTAACGAC GAGGCTTTCG
ATGCTCGCCT TAAAGAACTC TCTGCTGAAA CCCGCCATGC TTACTAGCTG GGCTTCGGCG
TTTTTAGCAT CGTGGGCTCT GCTTCTCCCA CAGGTGGAGA GCAACGCGCT TCTAATGTTT
CCCCTATTAG CCGTGCACGT AGTAGCGAGC GAACTTCTAC GCGAGCTAGC AGTGCTAGCA
TCGTGGGACG TCACGGCGAT ATCGCAGGTA GATTTCCTCG CTACACTACC CTTAGCGGCT
TTTGTCTCAA GGCTTGCCGA TACATGGCCT TTGTCTGTGC TTGCGGGGGC TGGGATACTT
TTCACTTCGC TTGCCGGGCT CCCCAGAATG GCAAGCGCGG AGGGTTACTC AGGGTTACTG
GTCTCCAAGC TTCTCGCTAC TATCGTGCTT TCGCTCTCTC TTTTCGTGGG TGGGGCGATC
TACGGGGTAG CCTTTTGCGC ACTAATGCCC TGGTTGCACA TAGCTTTAAG GCTTAAGCCG
CGTAGATCTG GGCTTGCCAG AGGGAAAGAG CCTTCCCATG TACCCATCCC CGCGGTTCGT
GTTGACGAGA TCGCCGAGAA AATCGAGTCC CTTGTAACGA ATGTGGTGAA GCTAGCGGTG
GTTCTAGCTG TGCTTACGAT GCTCTCCCCC CTCTCATCGG ACGTCCTCTT CCAAGTCGAT
GAATATTCGC TACGCGTGGG GACTCTTCTA ACGGTCGCGC AGTTAACCGC CGTTGTGTAC
TATGGCTATA AGATCCTGGA ACCTCTCTTT AGCCTGCTAG ACGCCCTGGC GACAAGACTC
GTCGAGAAAC TCGGGGTAAC AAGGTACACT GCTAGGCGTA TACTGGTAGA GTCCGTGTAC
TTGGTGCTAA TAGTGCTTAC ATGGTTCCTC CTCCCCAGCC TCAAGAACAT ACCCGGTATT
GGCGGTGTGC TCTACAGGGT TCTAACGATA GTTATCGTCG CGCTTTTTGC CCTTATCTCC
TACGACTTGG TGAGGCTTCT CTTCAGGGTA TTCGAGGACA CGCTTAAGCG CATAGTACAC
GCTATAGCCC GTTTTGTCTC CGGGTGA
 
Protein sequence
MDVVIVAAGE ASRLRPYSEE MPKTLMELRP GLPIIDFILK RVMALSPRKV VVVTRRTWKD 
ILASHLAGLA EVVTVDLEGG FGNLYSVYTA LTRVGDEFLI LMSDHIFEEE VLMRLAEHAS
KASFTVCLDR NPSRSEAVEG LKVIIEGGLV RDAGKDAAPR YGIDTGAIMA RGRAKEFIEE
VIRQKGPGAS IADVLRFAAS KGEEVDYVDV TGLLWKDIDT PEDLVKARKL VEKVELRELA
KRTGDPLSRA LLKPLTTRLS MLALKNSLLK PAMLTSWASA FLASWALLLP QVESNALLMF
PLLAVHVVAS ELLRELAVLA SWDVTAISQV DFLATLPLAA FVSRLADTWP LSVLAGAGIL
FTSLAGLPRM ASAEGYSGLL VSKLLATIVL SLSLFVGGAI YGVAFCALMP WLHIALRLKP
RRSGLARGKE PSHVPIPAVR VDEIAEKIES LVTNVVKLAV VLAVLTMLSP LSSDVLFQVD
EYSLRVGTLL TVAQLTAVVY YGYKILEPLF SLLDALATRL VEKLGVTRYT ARRILVESVY
LVLIVLTWFL LPSLKNIPGI GGVLYRVLTI VIVALFALIS YDLVRLLFRV FEDTLKRIVH
AIARFVSG