Gene Tpen_0322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0322 
Symbol 
ID4600973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp288486 
End bp289943 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content58% 
IMG OID639773082 
Producthypothetical protein 
Protein accessionYP_919734 
Protein GI119719239 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGGAG AGAACGTGAT GGGGGATGAA GAGGTTGAGA GAAAGGTTAG AGAGGAGACG 
TGGAGGAGGG CGGCTCTAAG CCTCTACGCC ACCAGGACAG AGGATAAGAG GAAAGCCAGG
GGCGGGAGGA AGGGCAGGGT ACACTACCGC GGCCTCTTCG ACACGGTGTC TACGATTAAC
TGGGACTTTA CGCGCTTCGT GGCGCACGCG CTGACAGTCG TGCCGGACGA CGTCTACCCG
AGGTTCGGCA AGCTGTTTGA CTACGATTTG AGGCAGTACC TCTTGCTAGG CGATGACAAC
AGGCCGAGGG AGGGAAGCGC GGTCGTAGAG CTACGCGATA GGCTACAGGC AATCGTCGAT
GCAACAGAAG ACGGTCTTAG AGCCGAGAAG AAGGGTAAAG TCTGGCATGT ATACATACCT
AGAGAGAACT GGCACGTCAT CGCACACAAG CCTACATATG ACTGGATCGT ACACGTCCCA
TTGGAGGGCT TCTGGACAGA GACTAGTTTC CCGGAGGTTC TCGCGAGGAC ATCGCCAGAC
GTGCTTAGAA GCCTGCAGAG AGGGTGGATC CTGACGGATG TGACACCCCC TCATGGGCGC
TACAGCGACA TAAAATTCGG TACTACCCAG AGCTGGCAAC TTCCAAGCAC GCTCGCAACC
TTTCCAAGCG ACCACGTTGA CATAGGTGTC ACGGCGGGCA TTCTCGGTAG CACCGGGCTG
AGCATTAAGT GGGAAGTGAG CATCCACGGC TACGCTGATG AGCTAGGCTG GGCTTCAGGA
CTTGTAGGCG CTACGAAGCG AGCAGAGTTC CGTGCACTCG TCGAGAGGTG CAAGGAGCTC
AACGGCGACT CGGTGGCACT GGCGACCGCT TTCCTAGGGG ACGGCGAGCT TGAATTCTTC
CTAAGGCTTC GATGGCTCTA CTTTAAGGTT GGACAGGAGC ACATCTACTT GCCGGCGGAG
AGCGCTATAG CTAATGCCCG CGCCGCGGTT GAGCGTGCAT GGGAGTACGT AGCATTCGTA
GGCAAGGTCA CTAGGTGCGC GAAGATTCGG CACTGGCTCT ACGTTGCCTA CGGAGCGCCA
GGCAGGAGGG GTAGGAAGCC CGGGCAGAAT GGTCAGCGTC TCGATCTTTA TGCACCGGTG
GCAGGGGCGT TGCTTAACCT CGTGCTCATA GGGCATGGCG ACTACGCCCG TATCTACGCT
AGGATGCCTG TCGATGGCGC GCCGCCAGGC TGGTACGAGA GAGTAGTAGA GGAGGGTTGG
GTCGTAAGAG TCGTAAGGAA CGCCGGTATG CTCTACTACC AGGTTCCCCA GGACTCTCTT
TTCGAGCACG CGGCGGAAGA TGCAACGCTC TGGGAGGCGC TATACCGCTT CGCAGCCGCA
AAGGCGCAGG CTAAGCCAGC CGCGAAGAAG CTAGTCGAAG AACTGCTGAA GATACGCCTA
GCTGGGGCAC AAGAATAA
 
Protein sequence
MVGENVMGDE EVERKVREET WRRAALSLYA TRTEDKRKAR GGRKGRVHYR GLFDTVSTIN 
WDFTRFVAHA LTVVPDDVYP RFGKLFDYDL RQYLLLGDDN RPREGSAVVE LRDRLQAIVD
ATEDGLRAEK KGKVWHVYIP RENWHVIAHK PTYDWIVHVP LEGFWTETSF PEVLARTSPD
VLRSLQRGWI LTDVTPPHGR YSDIKFGTTQ SWQLPSTLAT FPSDHVDIGV TAGILGSTGL
SIKWEVSIHG YADELGWASG LVGATKRAEF RALVERCKEL NGDSVALATA FLGDGELEFF
LRLRWLYFKV GQEHIYLPAE SAIANARAAV ERAWEYVAFV GKVTRCAKIR HWLYVAYGAP
GRRGRKPGQN GQRLDLYAPV AGALLNLVLI GHGDYARIYA RMPVDGAPPG WYERVVEEGW
VVRVVRNAGM LYYQVPQDSL FEHAAEDATL WEALYRFAAA KAQAKPAAKK LVEELLKIRL
AGAQE