Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0255 |
Symbol | |
ID | 3833218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 259486 |
End bp | 262401 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637828191 |
Product | excinuclease ABC subunit A |
Protein accession | YP_429133 |
Protein GI | 83589124 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGGG ATAAAATCGT CATCAAGGGA GCACGGGCCC ACAACCTGAA AAATATCGAT GTTACCATTC CCCGGGACCA GCTGGTGGTC ATTACCGGCC TGTCCGGCTC GGGCAAGTCG TCCCTGGCCT TTGACACCAT TTATGCCGAG GGCCAGCGGC GTTACGTCGA GTCCCTTTCC TCCTACGCCC GGCAATTCCT GGGGCAGATG GATAAACCCG ATGTCGACGT TATCGAGGGG TTGTCCCCGG CCATCTCCAT TGACCAGAAG ACGGCCAGCC ATAACCCCCG CTCCACCGTG GGGACGGTGA CGGAGATCTA TGACTACCTG CGCCTCCTCT TTGCCCATAT CGGCCGCGCC CATTGTCCCC GTTGCGGCCG GCCCATCACT CCCCAGACGA TCTCCCAGAT GGTGGATCGC CTGCTGACCT ATCCGGAGGG TACCCGTCTC CAGGTCATGG CCCCCATTGT TCGGGGCCGC AAGGGGGAGT ATCGTAACGT TCTGGAAGAG ATTCGCCGGC AGGGTTACGT CCGTGTCCGG GTGGACGGGG AGATTCGGGA AACCAGTGAC AATATCAGCC TGGCCAAGAA TAAAAAGCAT ACCATCGAGG TAATCGTAGA TCGCCTCCAG GTGCGGCCCG GCGTAGCCAG CCGCCTGGCG GAATCCCTGG AAACGGCGCT GAAACTGGCC GACGGCGTTG TCCTGATTGA TATCGTCGGC CAGGAGGAAC TCCTTTTAAG TGAAAAATTT GCCTGCGTGG AGTGTGGCGT CAGCCTGCCG GAGGTGACGC CCCGCCTTTT TTCTTTTAAT AACCCCTACG GGGCCTGTCC GGCCTGCACC GGTCTGGGCG TAACCATGAA GGTAGACCCG GGCCTGGTCA TCCCGGATAA AAGCCTTACC CTGCGGGAAG GGGCCATCGC GCCCTGGAGC CGCGGTAATA ACGGTTACCA GCAGATGCTG GAATGCCTGG CGGACCACTA CGGTTTCAGC CTGGATGTGC CGGTGCGGGA ACTCAAGCCG GAGCACCTCC AGGTAATCCT CTACGGCTCC GGGGAGGAGC GTATTAAATT TCGTTATACC AACCGTTTCG GCGACCGGCG GGCCTATGAG GCTCCCTTCG AGGGGGTTAT TCCCAACCTG GAACGCCGTT ACCAGGAAAC CCAGTCGGAA TGGTCACGGG CGGAAATTGA GAATTATATG AGCCAGCAGC CCTGCCCGGC CTGCCGGGGA GCGCGCCTGA AACCCGAGGC CCTGGCCGTC AAAGTGGGGG GCCTCAATAT CTGCGAACTC GCGGCCCTGG ATGTCCGGGC GGCAGCTGAA TTTTTAAGGA ACCTCAACCT GAGCGAGCGC GAGAAGGTCA TCTCCCGCCA GATTTTAAAG GAGATCCTGG CCCGGCTGCA GTTTTTGCTG GACGTGGGCC TGGATTACCT GACCCTGGAT CGGACGGCGT CTACCCTGTC CGGGGGCGAG GCCCAGCGTA TCCGCCTGGC CACCCAGATT GGCTCCCAGT TGATGGGCGT CCTGTATATC CTGGACGAGC CCAGCATCGG CCTGCACCAG CGGGATAACG AGCGTCTCAT CGCCACCCTG AAGCACCTGC GGGACCTGGG TAATACGGTC ATCGTCGTCG AGCATGATGA GGATACCATG CGTGCCGCCG ATTATATCAT CGACATCGGC CCCGGAGCGG GGGAACAGGG CGGCCGGGTG GTGGCCGCCG GGACGGTCCC GGAGGTTATG GCCAACCCCA ACTCCCTGAC GGGCCAGTAC CTGAGCGGCA GGCGGCGTAT CCCGGTACCG GCAGAGCGGC GCCGGCCGGG GGACAAATGG CTGACCATTA AAGGAGCCAG GGAACACAAC CTGAAGGGTA TCGATGTTAG CTTTCCCCTG GGGCTCTTTA TCGGCGTCAC CGGGGTTTCC GGTTCCGGTA AGAGCACCCT GGTAAACGAG ATCCTCTACC GCGCCCTGGC CCAGCGCTTG AACGGCGCCC GTACCAATCC CGGTGCTTTT GCGGGCCTTA CCGGCACCGA ATACCTGGAC AAGGTAATCG AGGTCGACCA GTCACCCATC GGCCGGACGC CACGCTCCAA CCCGGCCACC TATACCGGCG TTTTTGACGA TATCCGCGCC CTTTTCGCCG CCACCCCCGA GGCCCGGGCC CGGGGCTACA AGCCGGGGCG CTTCAGCTTC AACGTCAAGG GCGGCCGCTG CGAGGCCTGC GGCGGCGACG GCATTATCAA GATCGAGATG CACTTCCTGC CCGATGTGTA TGTGCCCTGC GAGGTCTGCC AGGGCAAACG CTATAACCGG GAGACCCTGG CCGTAAAATA TAAGGGCAAG TCAATCGCCG ATGTCCTGGC CATGACCGTG GACGAAGCGG CGGAGTTTTT CGCCCCCATT CCCCGGCTGC ACCGGCGCCT GACGACCCTC CAGGACGTGG GTTTGGGCTA TATCACCCTC GGCCAGCCGG CTACCACCCT GTCCGGGGGC GAGGCCCAGC GGGTGAAGCT GGCCACGGAG CTGGCCCGGC GCAGTACGGG CCGGACCATG TATATCCTGG ACGAGCCCAC CACGGGTTTG CATATGGCCG ACATCGAGCG GCTGCTGAAC GTCCTGCAGC GTCTGGTGGA CGCCGGGAAT ACAGTGGTGG TCATCGAGCA TAATCTGGAT GTAATTAAAT CGGTAGATTA TATCATCGAT CTGGGGCCCG AAGGCGGTGA GGGCGGCGGC CGGGTGGTGG CCACCGGCAC GCCGGAAGAG GTCTGCCGGG TAGCAGCCTC CTATACCGGC CGCTTCCTGG CTCCTGTACT GGAGCGGGAC CGGGGCCTGC CGGCCCTGGA ACCCCGGACG GCAGGCCTGC CGGAGGAAGA ACCCCGCGGC CGCCGGGAGC TACCCCTGGT AGTTGGGCAG GAGTAG
|
Protein sequence | MARDKIVIKG ARAHNLKNID VTIPRDQLVV ITGLSGSGKS SLAFDTIYAE GQRRYVESLS SYARQFLGQM DKPDVDVIEG LSPAISIDQK TASHNPRSTV GTVTEIYDYL RLLFAHIGRA HCPRCGRPIT PQTISQMVDR LLTYPEGTRL QVMAPIVRGR KGEYRNVLEE IRRQGYVRVR VDGEIRETSD NISLAKNKKH TIEVIVDRLQ VRPGVASRLA ESLETALKLA DGVVLIDIVG QEELLLSEKF ACVECGVSLP EVTPRLFSFN NPYGACPACT GLGVTMKVDP GLVIPDKSLT LREGAIAPWS RGNNGYQQML ECLADHYGFS LDVPVRELKP EHLQVILYGS GEERIKFRYT NRFGDRRAYE APFEGVIPNL ERRYQETQSE WSRAEIENYM SQQPCPACRG ARLKPEALAV KVGGLNICEL AALDVRAAAE FLRNLNLSER EKVISRQILK EILARLQFLL DVGLDYLTLD RTASTLSGGE AQRIRLATQI GSQLMGVLYI LDEPSIGLHQ RDNERLIATL KHLRDLGNTV IVVEHDEDTM RAADYIIDIG PGAGEQGGRV VAAGTVPEVM ANPNSLTGQY LSGRRRIPVP AERRRPGDKW LTIKGAREHN LKGIDVSFPL GLFIGVTGVS GSGKSTLVNE ILYRALAQRL NGARTNPGAF AGLTGTEYLD KVIEVDQSPI GRTPRSNPAT YTGVFDDIRA LFAATPEARA RGYKPGRFSF NVKGGRCEAC GGDGIIKIEM HFLPDVYVPC EVCQGKRYNR ETLAVKYKGK SIADVLAMTV DEAAEFFAPI PRLHRRLTTL QDVGLGYITL GQPATTLSGG EAQRVKLATE LARRSTGRTM YILDEPTTGL HMADIERLLN VLQRLVDAGN TVVVIEHNLD VIKSVDYIID LGPEGGEGGG RVVATGTPEE VCRVAASYTG RFLAPVLERD RGLPALEPRT AGLPEEEPRG RRELPLVVGQ E
|
| |