Gene Tneu_1735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1735 
Symbol 
ID6164419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1527501 
End bp1529315 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content61% 
IMG OID641668898 
ProductDNA topoisomerase type IA central domain-containing protein 
Protein accessionYP_001795099 
Protein GI171186180 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.718005 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000823973 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGACCTAA TCGTAGCAGA GAAGAGATCC GTCGCCCAGG CCATAGCCAG ATACCTGGGG 
GGGTCCTACA AGTCAGGGAG GCTTTACGGC GTGCCGTTCT ATTCCTTCAC CTACCTCGGC
AGAGAAGCCG CGGCGCTCGG CCTAAGCGGC CACATCATGG ACTACGACTT CACCGCCAGG
GAGAACGTGT GGACGTGGAT CCCGCCCGAG GAGCTGTTTA GAGCCACCCC AGTCCTCGTC
TTCCGCCCCG AGACCGCCAA CTACGTAAAA GCCCTGAGGT CGCTGGCGAA GAAGGCGGAG
CGGGTGTACC TCGCCCTCGA CGCCGACGTG GAGGGGGAGG CCATCGCGTA CGAGGCAGCC
CTCGTCGTAA AACTCGTCAA TAGGAGGGCG GAGATCTACA GAGTGCGCTT CAACGCGGTC
ACGTATAGAG ATGTGAGGTC GGCCTTCCAG AAGCCCACGA AGCTGGACCT GAGACAGGTG
GAGAAGGTGT TCACGAGGAT GCAGATAGAC CTAACGCTGG GCGCCGTCTT CACCAGGTTC
CTCACCCTCA CCGTCAGAAA CTCGCTTGAG AGGGGGAGGT TCCTCAGCTA CGGCCCCTGT
CAAACGCCGG TCCTCGGCAT CGTGGTCACG AGGGAGCTAC AGCGCAGGAA CTTCAAACCC
GAGAAGTACT ACGTCATAAA GGCGCTGGTG GAGATAGGGG GTCACGCCGT CGAGATGGCG
TCAACCGAGA GGTTTAAGAC GAGGAGGGAG GCCGAGACAG CCGCCGCCTC CGTCAAAAGA
GGCGTGGTGA AAACCGCCGT CTATAGACAG CACGCCGTAC AGCCCCCCGA GCCGCTGGAG
ACAGTTGAGC TGGAGCGCAG AGCCAGCAGG TGGCTCGGCA TAAGCTCGAA AAAGGCCCTA
GACACCGCCG AGGAGCTATA CCGCGCCGGC TACATCTCCT ACCCGCGTAC GGAGACCACC
ATATACCCCT CCACGCTGGA CCTTCGGGAG GTTTTAAAGG AGCTGGCGGG CGGGGAACAC
GGCCCCTATG CCGAGGAGTT GCTAAAAAGG GGCTTCAAAC CCACCAGAGG CGACTCAGAC
GACGGAGCCC ACCCCCCGAT ATACCCCACC AGGGGCGCGA CCCAGGGCGA GATCTACAAG
GTCTTCGGCA AGCTTGGCAG ACAGGCGTGG GCCATATACG ACCTAGTCGT TAGGCACTTC
CTCGCGACCC TCAGCCCGCC AGCTCTCGTA GAAAAGCAGA GGATCGTGGC GACCTTCGGA
GGTGTTGAGC TTGAGGCGGA GGGCCAGAGG ACGATTGAGG AGGGCTACTG GAGGATATAC
CCCTGGGAGC GGCAGAGGGA TAAGCCCCTG CCGAGGGTGG AGGCCGGCGA AGCCGCCAAA
GCCATCCGCG TGGAGGTCGT CGAGCGCGAG ACCGAGCCGC CGCCTCAGAT GTCCGAGTCC
GAGCTACTCG CCCTTATGAA AAAGTACGGA ATTGGGACCG ACGCCACCAT GCAGGACCAC
ATACATACCA ACGTGAAGAG GGGCTACATG CGCCTCCAGA GGGGCAAGTG CATCCCCACT
AAGCTCGGCG AGGCCTTGGC CACGGCGCTG TTTCAGTACG CCCCAGAGCT CATAGAGCCG
ACAGTCCGCG CCAAGATGGA GAAGGCTCTC CAAGACGTCG TAAGGGGGGC GGCGGCGCCA
ACGAGACTAA TCCAGGAGAT AAAGGACGAG TTCGAGAGGT ACTACAAAGC GCTGAAGGAG
CGCAGGCAGG ACCTCAAGAC GACCCTCGAA ACGGCTTTAA AATCCATGTC GGAGGACGGC
AGTGGAGGCG GTTGA
 
Protein sequence
MDLIVAEKRS VAQAIARYLG GSYKSGRLYG VPFYSFTYLG REAAALGLSG HIMDYDFTAR 
ENVWTWIPPE ELFRATPVLV FRPETANYVK ALRSLAKKAE RVYLALDADV EGEAIAYEAA
LVVKLVNRRA EIYRVRFNAV TYRDVRSAFQ KPTKLDLRQV EKVFTRMQID LTLGAVFTRF
LTLTVRNSLE RGRFLSYGPC QTPVLGIVVT RELQRRNFKP EKYYVIKALV EIGGHAVEMA
STERFKTRRE AETAAASVKR GVVKTAVYRQ HAVQPPEPLE TVELERRASR WLGISSKKAL
DTAEELYRAG YISYPRTETT IYPSTLDLRE VLKELAGGEH GPYAEELLKR GFKPTRGDSD
DGAHPPIYPT RGATQGEIYK VFGKLGRQAW AIYDLVVRHF LATLSPPALV EKQRIVATFG
GVELEAEGQR TIEEGYWRIY PWERQRDKPL PRVEAGEAAK AIRVEVVERE TEPPPQMSES
ELLALMKKYG IGTDATMQDH IHTNVKRGYM RLQRGKCIPT KLGEALATAL FQYAPELIEP
TVRAKMEKAL QDVVRGAAAP TRLIQEIKDE FERYYKALKE RRQDLKTTLE TALKSMSEDG
SGGG