Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_1053 |
Symbol | |
ID | 5876592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 1087901 |
End bp | 1090684 |
Gene Length | 2784 bp |
Protein Length | 927 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641541408 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_001662688 |
Protein GI | 167039703 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00826345 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAAAG ACAAAATTGT AATTAAAGGG GCAAGAGTTC ATAATTTAAA AAATGTAGAT TTGGAGATTC CACGAGATAA ATTAACTGTG ATAACGGGTT TATCAGGTTC TGGTAAGTCT TCTCTTGCCT TTGATACCAT TTATGCTGAA GGTCAGAGAA GATATGTGGA GTCGTTGTCA GCTTATGCAA GGCAATTTTT AGGACAAATG GACAAACCAG ACGTTGATTA TATAGAAGGT CTTTCACCGG CAATATCTAT AGACCAAAAG ACCACAAATA AAAACCCTCG TTCTACGGTA GGAACGATAA CTGAGATTTA CGACTATCTA AGACTTTTAT ATGCAAGGGC TGGGATTCCC CATTGTCCTG TTTGTGGAAA AGAGATTAGC ATGCAAACTA TAGATCAGAT GGTAGACAGA GTCAAAGAAT TGCCAGAAGG TACAAGGATA CAGGTACTGG CTCCTGTTAT TAGAGGAAGA AAAGGGGAAT ATGCTAAGCT TTTAAATGAC ATAAAAAAGA GCGGATATGT CAGAGTGAAA ATTGATGGAG TCATGTACGA TGTAAATGAG GAGATTAAAC TTGACAAAAA TAAAAAGCAT ACTATTGAAG TTGTGGTAGA CAGGGTTATT ATAAAACCGG GAATAGACAT GAGGTTGACA GACTCTATAG AAACAGCTTT AAAATTAGCA GATGGGATAG TTTCTATTGA TGTAATAGAC GGAGAAAGCT TTACACTCTC TGAAAAATAC GCTTGTACAG AGTGCAACAT AAGTATTGAA GAACTTTCCC CGAGAATGTT TTCCTTCAAT AGCCCTTATG GGGTTTGCCC TGTTTGCACT GGATTGGGAG AATTTATGAA GGTAGATCCA GAGCTTTTAA TACAAGACCC TAAAAAATCA TTAGCAAATG GGTTGTTGCC GGGGATTGTT GCTTCACAGG ATAGTTATGC TTATTACAAC ATTTTAAGAT TAATTGAACA TTTTGGATAT ACAGAGAATA CTCCTTATGA AAAGTTTAGT GAAGATTTAA AGAATGTACT GCTTTATGGC AAAGATACAA AAGGTAAGTC CTATGGATTT GAAGGTATAG TAAACAATCT TGAAAGAAGG TACAACAATA CTTCTTCAGA TTTTATAAAA GAAGAGATAG AGAAATATAT GAGACCGGTT ACTTGTCCTG CATGCCATGG GGCTAGATTA AAGCCAGAAG CGCTAGCTGT GACTGTTGGT GGACTTTCCA TAAAAGAAAT GACAGACCTT TCTGTTGGCG AGCTTATAAA ATTTATTGAG GAACTCAAAT TAACAGAAAA ACAAGAGATT ATTGCAAAGC CAATTTTAAA AGAAATAAAG GCAAGGCTTA ATTTTCTTGT GGATGTAGGA CTGGATTATT TGACTCTTTC AAGACCTGCA GCTACTTTAT CAGGTGGAGA AGCACAAAGG ATAAGATTGG CAAGCCAGAT AGGCTCTGGG CTTGTAGGAG TCACATACAT CCTGGACGAG CCGAGCATTG GACTTCATCA AAGAGATAAT GAAAGGCTTA TAAATTCCTT GAAAAAATTA AGAGACCAAG GCAATACTCT TATAGTAGTA GAGCACGATG AGGATACAAT ATATGCGGCA GACTACATTG TGGATGTAGG ACCAGGCGCA GGTGAGCATG GAGGAGAAAT TGTAATTGCG GGTACGATAG AAGATGTGTT AAAATGTGAA AAATCAATTA CAGGCCAGTA TTTAAGTGGT AAAATAAAGA TAGAAGTGCC AAAACAGAGG AGAAAACCTA ATGGAAAAGC TTTAATAGTG AAAGGAGCTA AGGAAAACAA TTTAAAGAAT ATAGATGTGG TTTTCCCCCT CGGAGTATTT ATATGTGTTA CAGGGGTTTC TGGCTCAGGC AAAAGCACCC TTATAAATGA GATACTGTAC AAAGCATTGG CACAGAAGAT TTATAAGTCC AAAGATAAAC CAGGTATGCA CGATGCAATA GAGGGTATCG ATAATATAGA TAAAGTAATA AATATTGACC AGTCTCCTAT AGGCAGGACT CCTCGCTCAA ATCCTGCTAC CTATACAGGA GTTTTCGACT ATATAAGAGA GGTTTTTGCA AATACTCCAG AAGCTAAAAT GAGAGGCTAT AAACCAGGAA GATTTAGTTT TAATGTTAAA GGTGGAAGGT GTGAAGCCTG TGGTGGAGAT GGTATAATTA AAATTGAGAT GAACTTTTTG CCGGATGTGT ATGTCCCTTG TGAAGTCTGC AAAGGGCAAA GGTATAATAG GGAAACATTG GAAGTAAAAT ATAAAGGGAA AAATATTTCG GATGTACTTA ATATGACGGT AGAAGAGGCG TTAGAATTTT TTGAAAATAT ACCCAGGATA AAAAATAAAT TGATGACTTT ATATGATGTG GGTTTAGGAT ATATTAAGCT AGGGCAACCT TCTACTCAGC TTTCAGGAGG AGAAGCACAA AGAGTAAAAT TAGCTACTGA ACTTTCTAAA AGGCCTACAG GTAAAACACT ATATATTTTG GATGAGCCTA CAACTGGATT ACATTTTGCA GATGTGCATA GGCTTCTTGA AGTTTTAAAT AGATTAACTG ATGCGGGCAA TACTGTTATT GTAATTGAAC ACAACTTAGA TATCATAAAA AGTGCAGACT ATATCATTGA CTTAGGACCA GAAGGTGGAG ACAAGGGTGG AAGGGTAATA GCTACAGGGA CACCTGAAGA AGTTGCTGCT AATGAGAATT CTTATACAGG CCATTTTTTG AAAAAAGTCC TTTCTCAAAA ATGA
|
Protein sequence | MAKDKIVIKG ARVHNLKNVD LEIPRDKLTV ITGLSGSGKS SLAFDTIYAE GQRRYVESLS AYARQFLGQM DKPDVDYIEG LSPAISIDQK TTNKNPRSTV GTITEIYDYL RLLYARAGIP HCPVCGKEIS MQTIDQMVDR VKELPEGTRI QVLAPVIRGR KGEYAKLLND IKKSGYVRVK IDGVMYDVNE EIKLDKNKKH TIEVVVDRVI IKPGIDMRLT DSIETALKLA DGIVSIDVID GESFTLSEKY ACTECNISIE ELSPRMFSFN SPYGVCPVCT GLGEFMKVDP ELLIQDPKKS LANGLLPGIV ASQDSYAYYN ILRLIEHFGY TENTPYEKFS EDLKNVLLYG KDTKGKSYGF EGIVNNLERR YNNTSSDFIK EEIEKYMRPV TCPACHGARL KPEALAVTVG GLSIKEMTDL SVGELIKFIE ELKLTEKQEI IAKPILKEIK ARLNFLVDVG LDYLTLSRPA ATLSGGEAQR IRLASQIGSG LVGVTYILDE PSIGLHQRDN ERLINSLKKL RDQGNTLIVV EHDEDTIYAA DYIVDVGPGA GEHGGEIVIA GTIEDVLKCE KSITGQYLSG KIKIEVPKQR RKPNGKALIV KGAKENNLKN IDVVFPLGVF ICVTGVSGSG KSTLINEILY KALAQKIYKS KDKPGMHDAI EGIDNIDKVI NIDQSPIGRT PRSNPATYTG VFDYIREVFA NTPEAKMRGY KPGRFSFNVK GGRCEACGGD GIIKIEMNFL PDVYVPCEVC KGQRYNRETL EVKYKGKNIS DVLNMTVEEA LEFFENIPRI KNKLMTLYDV GLGYIKLGQP STQLSGGEAQ RVKLATELSK RPTGKTLYIL DEPTTGLHFA DVHRLLEVLN RLTDAGNTVI VIEHNLDIIK SADYIIDLGP EGGDKGGRVI ATGTPEEVAA NENSYTGHFL KKVLSQK
|
| |