Gene Tpen_0285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0285 
Symbol 
ID4602095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp249254 
End bp251920 
Gene Length2667 bp 
Protein Length888 aa 
Translation table11 
GC content55% 
IMG OID639773041 
ProductDNA-directed RNA polymerase subunit A' 
Protein accessionYP_919698 
Protein GI119719203 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02390] DNA-directed RNA polymerase subunit A' 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACGA GCACGAATCC CAAGATAGTC GCCGGGATAA AGTTTGGGAT ACTCTCTCCA 
GAGATGATCA GGAAAATCGC CGTGCTAAGG ATAGAGACCA GTGAGCTCTA CGACGAGGAA
GGCTTCCCGA TACCTGGCGG GCTCATGGAT AGAAGAATGG GATCCATAGA GCCGGGAGCC
GTCTGCCAGA CATGTGGAAA CAGGTTCACG AACTGCCCAG GGCACTTTGG ATACATAGAG
CTCGCGAGAC CCGTGATACA CCCGAGCTTC GCCCCATACA TAGCTATCCT TTTGAAGGCT
ACCTGCAACA GGTGCGGGAG GCTCAAGTTA CCCGAGGAGA AGATAAACAA GGCTAAGAAG
CGCATGGAGG TGTACTCCGC CAAGTGGCCA AGCCTGAAGA CGAAGTATGC TAACACCCTA
CTTAAAGAGG CGGCTAAGGC TACGGTATGC CCACACTGCG GCGCCCCCCA GTACAAAATC
AGACTGGACA AGCCGTACAC GTTCTACGAG GAGAGAGAGG AAGGGCTCGT AAAGCTTACA
CCGCTCGAGA TATGGGAAAG GCTTTCGCGC ATACCGGAGG GGGACTTAAG GGTTGTAGGG
ATAGACCCCA AGGAGGCTAG ACCCGAGTGG ATGGTGCTAC GCGTCATACC GGTTGTACCG
CCGTCTGTAC GCCCGTCGAT AACCCTGGAG AGCGGCGATA GAAGCGAGGA CGATCTCACG
CACAAGCTGG TCGACATAAT CCGGGTAAAC CAGAGGTTAA AGGAGAACAT AGACGCCGGA
TCTCCGTCGC TAGTCATAGA GGACCTCTGG AACCTACTCC AGTTCCACGT AGCAACGTAC
TTCGACAACG AGCTCCCCGG GATACCCCCC GCTAGACACA GGTCGGGCAG ACCCCTCAGA
ACCCTCGCGC AGCGCCTTAA AGGTAAGGAG GGGAGGTTCA GGGGGAGCCT GGCCGGCAAG
CGTGTCGACT TCTCCTCTAG AACAGTGATC TCTCCTGACC CGAACCTAAG TATAAACGAG
GTGGGCGTTC CCATAGACGT AGCAAAGGTT CTAACGGTCC CGGAGAAAGT TACCCCCTGG
AACATTGAGA AACTGAGGAA ACTCGTCATA AACGGACCAG ACGTATGGCC CGGGGCGAAC
TACATAATCA AGCCGGATGG GTCGAGGATA GACTTGAGAT ACGTCAAGCA CCGCGAGGAG
ATCTCCCAGA CGCTAAAACC CGGCTACATA GTGGAGAGGC ACCTGATGGA CGGCGACGTC
GTACTCTTCA ACAGGCAACC CTCTCTACAC AGGATCTCGA TAATGGCGCA CATAGTCAAG
GTACTTCCGT ACAAAACCTT CAGGCTTAAC CTTCTGGTAA CAATTCCCTA CAACGCGGAC
TTCGATGGAG ACGAGATGAA CCTTCACGTA CCTCAGAGCG AAGAGGCCCG CGCCGAGGCG
CGGGAGCTCA TGTTGGTACA GGAGCACATA ATGACGCCCA GGTACGGGGC TCCCATAATA
GGGGGTCTGC ACGATTACAT CTCCGGTAGC TACCTTCTAA CGAGGAAGGA CGCCTTGCTG
GACAAGAAGA GCGCCTTAAG GCTGCTCTAC ATCGGGAACA ACTGTGACCC GCTGGTGGAG
CCAGCCATAA TTAAGCCCGG ACCCTACTGG ACGGGGAAGC AAATTGTCAG CATGTTCCTC
CCGAAAGGAC TGAACTACGT TGGACGCGCA AGCGTTGCAC CTGCCTCTGG TAAGTGTGAC
GAGGAGTACT GCGAGAACGA CGGCTATGTT TTGATCAAAG ACGGGAAGCT ACTGCTCGGT
GTCTTTGATA AGCAGGCCAT AGGGGCGGAG AAGCACGGTA CGGTTCTGCA CGAGATCGTA
CGGGAGTTTG GAGTCGAAAA AGCGAAAGAA CTCATGGACG GCATGTTCAA GGTATTCATA
GCTTACCTGG ACATGCACGG TTTCACGATG GGGGTTGACA GCGTGGAGAT CCCGAGGGAA
GCCGAGGACG ATATCAGGGA GATACTGCAG GAGGCGGAGA AAAAGGTCGA GGACTTGATC
AGGCAGTACG AGAGCGGCGA GCTGCAGCCC ATGCCTGGGA AGACCCGCAA GGAGACCCTC
GAGGACTTGA TAATGAACGT GCTCGCAGAG GCGAGGACTC GGGCAGGCGA AGTCACGAGT
AAGCACCTAG GTCTGCTCAA CCACGCGGTT ATAATGGCTA AGACAGGTGC GAGAGGAAGC
ATGCTCAACT TGACGCAGAT GGCAGCGGTC GTCGGGCAGC AGTCAGTTAG AGGGAAACGG
ATAGAGAGAG GATACACCGG GCGAGCGTTA CCACACTTCG TTAAAGGCGA CCTCTCGCCG
CTCGCTAAGG GATTCGTTTA CAGCTCATTT CGCAGGGGCT TGTCCCCCGT GGAGTTCTTC
TTCCACGCAA TCTCCGGAAG AGAAGGACTC GTAGACACAG CCGTCAGGAC TGCCCAGTCC
GGGTACATGT ACCGCAGACT TCAGAGCGCA ATGCAGGACT TCTACGTATC GTACGACGGG
ACTGTCAGGA ATAGCGAGGG GATGATAATA CAGTTCAGGT ACGGCGAGGA CAGCGTTGAC
CCTGCGAGGA GCGACCACGG GAAACCCGTA GACGTTGATA AGTTGATAAA GAAGGTCTTG
ACACTAAGGG GTGAGAAGCG TGAGTGA
 
Protein sequence
MSTSTNPKIV AGIKFGILSP EMIRKIAVLR IETSELYDEE GFPIPGGLMD RRMGSIEPGA 
VCQTCGNRFT NCPGHFGYIE LARPVIHPSF APYIAILLKA TCNRCGRLKL PEEKINKAKK
RMEVYSAKWP SLKTKYANTL LKEAAKATVC PHCGAPQYKI RLDKPYTFYE EREEGLVKLT
PLEIWERLSR IPEGDLRVVG IDPKEARPEW MVLRVIPVVP PSVRPSITLE SGDRSEDDLT
HKLVDIIRVN QRLKENIDAG SPSLVIEDLW NLLQFHVATY FDNELPGIPP ARHRSGRPLR
TLAQRLKGKE GRFRGSLAGK RVDFSSRTVI SPDPNLSINE VGVPIDVAKV LTVPEKVTPW
NIEKLRKLVI NGPDVWPGAN YIIKPDGSRI DLRYVKHREE ISQTLKPGYI VERHLMDGDV
VLFNRQPSLH RISIMAHIVK VLPYKTFRLN LLVTIPYNAD FDGDEMNLHV PQSEEARAEA
RELMLVQEHI MTPRYGAPII GGLHDYISGS YLLTRKDALL DKKSALRLLY IGNNCDPLVE
PAIIKPGPYW TGKQIVSMFL PKGLNYVGRA SVAPASGKCD EEYCENDGYV LIKDGKLLLG
VFDKQAIGAE KHGTVLHEIV REFGVEKAKE LMDGMFKVFI AYLDMHGFTM GVDSVEIPRE
AEDDIREILQ EAEKKVEDLI RQYESGELQP MPGKTRKETL EDLIMNVLAE ARTRAGEVTS
KHLGLLNHAV IMAKTGARGS MLNLTQMAAV VGQQSVRGKR IERGYTGRAL PHFVKGDLSP
LAKGFVYSSF RRGLSPVEFF FHAISGREGL VDTAVRTAQS GYMYRRLQSA MQDFYVSYDG
TVRNSEGMII QFRYGEDSVD PARSDHGKPV DVDKLIKKVL TLRGEKRE