Gene Tpen_0497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0497 
Symbol 
ID4601331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp448338 
End bp450419 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content52% 
IMG OID639773264 
ProductMCM family protein 
Protein accessionYP_919907 
Protein GI119719412 
COG category[L] Replication, recombination and repair 
COG ID[COG1241] Predicted ATPase involved in replication control, Cdc46/Mcm family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGTAA CGCAGGCTAT ACCCCTCACA GAGCGGATAA CAGAGTTCCT GAAGAGGTTC 
ACGGTAGACG GCCGCGAGAA GTACAGGGAT GCTATCCGCA GGATGAGCAT TGAGAGGAGC
ATTTCGCTCG TAATAGACTT CGATGATCTT TTGTTATTTG ACAAGGAGCT GGCAGACATA
CTGCTGGAGC GCCCGCACGA TTTCCTGGAC GCGGCGTCCA AGGCTATAAT GGAGGTTCTC
AAGATAGAAA ACCCGGACTA CGCCAAGGAA GTTGGCTACG TGCATGCACG TATACGTAGA
CCACCCGAAA TCGTCCACTT GAAGATAAGG AACATAAGGG CTAGGCATCT GGGACGCCTA
GTAGCAGTAG AGGGTATTGT GACCAAAATA TCGCCGGTAA AGCAGGAGCT CGTGGAAGGA
GTCTTCAAGT GTAAAACCTG CGGCACAGAG CTGACGGTTC CTCAGGGTCC CGAGGGGTTA
ACAAAGCCTA CCACGTGTCC TGTCTGCTCC GAGAACGGGG TCAAGTCGGC AGGCTTCGTG
TTGCTACCCG AGAAGAGCAA GTTCGTAGAC CTACAGAAGT TCGTGCTACA GGAGAAACCC
GAAGAGTTGC CTCCGGGACA GCTACCCAGG TCGATAGAGG TTCTCGTGAG AGAGGACTTG
GTGGATGTAG TTAGGCCTGG CGATAGGGCA ACAGTGGTCG GGTTCCTCAG GATGGAAGAG
GACAAGAAGC TGGTAAAGAA TGCTCCACCA ATATTTCACG CGTACCTCGA AGCGAACTAC
GTTGAGGTCT CCGCGAAGGA AAACCTGGAC GTGGAGATAA CTCCGGAGGA CGAGAAGAAA
ATACTGGAGC TAAGTAGGAG GGAAGATCTA GAGGAGATAA TAATAAACTC CATAGCGCCC
TCGATATACG GGTACAAAGA GATAAAAACT GCTATAGCGC TACTCCTATT TGGAGGAGTT
CCAAAGATTC ACCCTGACGG CATAAGAGTG CGCGGAGATA TTCACATACT GTTGATCGGT
GATCCCGGAA CTGCAAAGAG CCAGCTCCTA AGGTACGTGG CGTCTATCGC TCCGAGAGGA
CTCTATACCT CTGGTAAAGG CGCTTCTGCC GCAGGACTCA CAGCCGCCGT AGTAAAGGAG
AAGAACAGCG GGGAATTCTA CTTGGAGGCG GGAGCCCTTG TTCTAGCGGA TGGAGGCGTA
GCATGCATAG ATGAGTTCGA CAAGATGGAG GCGAAGGATA GGGTAAGTAT CCACGAAGCC
ATGGAGCAAC AAACCGTGAG CATAGCTAAA GCTGGGATCG TAGCAACACT CAACGCTAGG
GCATCCATCC TGGCAGCAGC CAACCCAGCA TTCGGCCGCT ATCTCCCTGG CAGGAATATT
TCAGAGAACA TAGATCTACC CGTCACGATA CTTTCAAGGT TCGACCTAAT ATTCGTGGTT
AGAGATACCC CCAACGCTGA AAGAGACCGG GAACTCGCAC AGTACGTTGT CGACTTTCAC
GGGGAAACTT ACCCCGTATC TTTGGAGAAA GTTCTCGACG CGCAGACGCT CAAAAAGTAC
ATCGCGTACG CCAGACGCCA CGTGCGCCCG AGGCTCTCCC CAGAAGCCAA GAGCAAGATA
GTAGAATACT ATGTCAACAT GCGTAAAAAG AGCGAAGACG CCAGCTCTCC GATAGCTATA
ACGCCAAGGC AACTGGAAGC TCTTATCAGG CTCTCGGAGG CTCACGCTCG CATGCATCTC
CGCGACGTGG TTACTGCTCG TGACGCGGAA GTCGCTATTA GCCTTATGGA ATACTTCCTG
AGAAACGTCG GCATAGACAC GCAAACAATG ACCATCGATA TCGATACTAT AATGACTGGG
CAGCCTAAGT CTCAGCGCGA GAAGCTGATC GCCGTGCTGG ACACGGTTAA AAACCTCGTT
AGACAAAACA ACGGCGAGCC TATAAAGGAG GAGGACCTGT ACTCCGAGTT GGAGAAAAAC
GGCATTGACA GGAACTTCGC TAGGAAAGCT ATAGAGAAGT TGCTTGAACA GGGAGAATTG
ATGCAGCCGT CACCGGGGCG CATTAGCGTT GTTTCTCTCT AG
 
Protein sequence
MVVTQAIPLT ERITEFLKRF TVDGREKYRD AIRRMSIERS ISLVIDFDDL LLFDKELADI 
LLERPHDFLD AASKAIMEVL KIENPDYAKE VGYVHARIRR PPEIVHLKIR NIRARHLGRL
VAVEGIVTKI SPVKQELVEG VFKCKTCGTE LTVPQGPEGL TKPTTCPVCS ENGVKSAGFV
LLPEKSKFVD LQKFVLQEKP EELPPGQLPR SIEVLVREDL VDVVRPGDRA TVVGFLRMEE
DKKLVKNAPP IFHAYLEANY VEVSAKENLD VEITPEDEKK ILELSRREDL EEIIINSIAP
SIYGYKEIKT AIALLLFGGV PKIHPDGIRV RGDIHILLIG DPGTAKSQLL RYVASIAPRG
LYTSGKGASA AGLTAAVVKE KNSGEFYLEA GALVLADGGV ACIDEFDKME AKDRVSIHEA
MEQQTVSIAK AGIVATLNAR ASILAAANPA FGRYLPGRNI SENIDLPVTI LSRFDLIFVV
RDTPNAERDR ELAQYVVDFH GETYPVSLEK VLDAQTLKKY IAYARRHVRP RLSPEAKSKI
VEYYVNMRKK SEDASSPIAI TPRQLEALIR LSEAHARMHL RDVVTARDAE VAISLMEYFL
RNVGIDTQTM TIDIDTIMTG QPKSQREKLI AVLDTVKNLV RQNNGEPIKE EDLYSELEKN
GIDRNFARKA IEKLLEQGEL MQPSPGRISV VSL