Gene Moth_0255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0255 
Symbol 
ID3833218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp259486 
End bp262401 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content63% 
IMG OID637828191 
Productexcinuclease ABC subunit A 
Protein accessionYP_429133 
Protein GI83589124 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGGG ATAAAATCGT CATCAAGGGA GCACGGGCCC ACAACCTGAA AAATATCGAT 
GTTACCATTC CCCGGGACCA GCTGGTGGTC ATTACCGGCC TGTCCGGCTC GGGCAAGTCG
TCCCTGGCCT TTGACACCAT TTATGCCGAG GGCCAGCGGC GTTACGTCGA GTCCCTTTCC
TCCTACGCCC GGCAATTCCT GGGGCAGATG GATAAACCCG ATGTCGACGT TATCGAGGGG
TTGTCCCCGG CCATCTCCAT TGACCAGAAG ACGGCCAGCC ATAACCCCCG CTCCACCGTG
GGGACGGTGA CGGAGATCTA TGACTACCTG CGCCTCCTCT TTGCCCATAT CGGCCGCGCC
CATTGTCCCC GTTGCGGCCG GCCCATCACT CCCCAGACGA TCTCCCAGAT GGTGGATCGC
CTGCTGACCT ATCCGGAGGG TACCCGTCTC CAGGTCATGG CCCCCATTGT TCGGGGCCGC
AAGGGGGAGT ATCGTAACGT TCTGGAAGAG ATTCGCCGGC AGGGTTACGT CCGTGTCCGG
GTGGACGGGG AGATTCGGGA AACCAGTGAC AATATCAGCC TGGCCAAGAA TAAAAAGCAT
ACCATCGAGG TAATCGTAGA TCGCCTCCAG GTGCGGCCCG GCGTAGCCAG CCGCCTGGCG
GAATCCCTGG AAACGGCGCT GAAACTGGCC GACGGCGTTG TCCTGATTGA TATCGTCGGC
CAGGAGGAAC TCCTTTTAAG TGAAAAATTT GCCTGCGTGG AGTGTGGCGT CAGCCTGCCG
GAGGTGACGC CCCGCCTTTT TTCTTTTAAT AACCCCTACG GGGCCTGTCC GGCCTGCACC
GGTCTGGGCG TAACCATGAA GGTAGACCCG GGCCTGGTCA TCCCGGATAA AAGCCTTACC
CTGCGGGAAG GGGCCATCGC GCCCTGGAGC CGCGGTAATA ACGGTTACCA GCAGATGCTG
GAATGCCTGG CGGACCACTA CGGTTTCAGC CTGGATGTGC CGGTGCGGGA ACTCAAGCCG
GAGCACCTCC AGGTAATCCT CTACGGCTCC GGGGAGGAGC GTATTAAATT TCGTTATACC
AACCGTTTCG GCGACCGGCG GGCCTATGAG GCTCCCTTCG AGGGGGTTAT TCCCAACCTG
GAACGCCGTT ACCAGGAAAC CCAGTCGGAA TGGTCACGGG CGGAAATTGA GAATTATATG
AGCCAGCAGC CCTGCCCGGC CTGCCGGGGA GCGCGCCTGA AACCCGAGGC CCTGGCCGTC
AAAGTGGGGG GCCTCAATAT CTGCGAACTC GCGGCCCTGG ATGTCCGGGC GGCAGCTGAA
TTTTTAAGGA ACCTCAACCT GAGCGAGCGC GAGAAGGTCA TCTCCCGCCA GATTTTAAAG
GAGATCCTGG CCCGGCTGCA GTTTTTGCTG GACGTGGGCC TGGATTACCT GACCCTGGAT
CGGACGGCGT CTACCCTGTC CGGGGGCGAG GCCCAGCGTA TCCGCCTGGC CACCCAGATT
GGCTCCCAGT TGATGGGCGT CCTGTATATC CTGGACGAGC CCAGCATCGG CCTGCACCAG
CGGGATAACG AGCGTCTCAT CGCCACCCTG AAGCACCTGC GGGACCTGGG TAATACGGTC
ATCGTCGTCG AGCATGATGA GGATACCATG CGTGCCGCCG ATTATATCAT CGACATCGGC
CCCGGAGCGG GGGAACAGGG CGGCCGGGTG GTGGCCGCCG GGACGGTCCC GGAGGTTATG
GCCAACCCCA ACTCCCTGAC GGGCCAGTAC CTGAGCGGCA GGCGGCGTAT CCCGGTACCG
GCAGAGCGGC GCCGGCCGGG GGACAAATGG CTGACCATTA AAGGAGCCAG GGAACACAAC
CTGAAGGGTA TCGATGTTAG CTTTCCCCTG GGGCTCTTTA TCGGCGTCAC CGGGGTTTCC
GGTTCCGGTA AGAGCACCCT GGTAAACGAG ATCCTCTACC GCGCCCTGGC CCAGCGCTTG
AACGGCGCCC GTACCAATCC CGGTGCTTTT GCGGGCCTTA CCGGCACCGA ATACCTGGAC
AAGGTAATCG AGGTCGACCA GTCACCCATC GGCCGGACGC CACGCTCCAA CCCGGCCACC
TATACCGGCG TTTTTGACGA TATCCGCGCC CTTTTCGCCG CCACCCCCGA GGCCCGGGCC
CGGGGCTACA AGCCGGGGCG CTTCAGCTTC AACGTCAAGG GCGGCCGCTG CGAGGCCTGC
GGCGGCGACG GCATTATCAA GATCGAGATG CACTTCCTGC CCGATGTGTA TGTGCCCTGC
GAGGTCTGCC AGGGCAAACG CTATAACCGG GAGACCCTGG CCGTAAAATA TAAGGGCAAG
TCAATCGCCG ATGTCCTGGC CATGACCGTG GACGAAGCGG CGGAGTTTTT CGCCCCCATT
CCCCGGCTGC ACCGGCGCCT GACGACCCTC CAGGACGTGG GTTTGGGCTA TATCACCCTC
GGCCAGCCGG CTACCACCCT GTCCGGGGGC GAGGCCCAGC GGGTGAAGCT GGCCACGGAG
CTGGCCCGGC GCAGTACGGG CCGGACCATG TATATCCTGG ACGAGCCCAC CACGGGTTTG
CATATGGCCG ACATCGAGCG GCTGCTGAAC GTCCTGCAGC GTCTGGTGGA CGCCGGGAAT
ACAGTGGTGG TCATCGAGCA TAATCTGGAT GTAATTAAAT CGGTAGATTA TATCATCGAT
CTGGGGCCCG AAGGCGGTGA GGGCGGCGGC CGGGTGGTGG CCACCGGCAC GCCGGAAGAG
GTCTGCCGGG TAGCAGCCTC CTATACCGGC CGCTTCCTGG CTCCTGTACT GGAGCGGGAC
CGGGGCCTGC CGGCCCTGGA ACCCCGGACG GCAGGCCTGC CGGAGGAAGA ACCCCGCGGC
CGCCGGGAGC TACCCCTGGT AGTTGGGCAG GAGTAG
 
Protein sequence
MARDKIVIKG ARAHNLKNID VTIPRDQLVV ITGLSGSGKS SLAFDTIYAE GQRRYVESLS 
SYARQFLGQM DKPDVDVIEG LSPAISIDQK TASHNPRSTV GTVTEIYDYL RLLFAHIGRA
HCPRCGRPIT PQTISQMVDR LLTYPEGTRL QVMAPIVRGR KGEYRNVLEE IRRQGYVRVR
VDGEIRETSD NISLAKNKKH TIEVIVDRLQ VRPGVASRLA ESLETALKLA DGVVLIDIVG
QEELLLSEKF ACVECGVSLP EVTPRLFSFN NPYGACPACT GLGVTMKVDP GLVIPDKSLT
LREGAIAPWS RGNNGYQQML ECLADHYGFS LDVPVRELKP EHLQVILYGS GEERIKFRYT
NRFGDRRAYE APFEGVIPNL ERRYQETQSE WSRAEIENYM SQQPCPACRG ARLKPEALAV
KVGGLNICEL AALDVRAAAE FLRNLNLSER EKVISRQILK EILARLQFLL DVGLDYLTLD
RTASTLSGGE AQRIRLATQI GSQLMGVLYI LDEPSIGLHQ RDNERLIATL KHLRDLGNTV
IVVEHDEDTM RAADYIIDIG PGAGEQGGRV VAAGTVPEVM ANPNSLTGQY LSGRRRIPVP
AERRRPGDKW LTIKGAREHN LKGIDVSFPL GLFIGVTGVS GSGKSTLVNE ILYRALAQRL
NGARTNPGAF AGLTGTEYLD KVIEVDQSPI GRTPRSNPAT YTGVFDDIRA LFAATPEARA
RGYKPGRFSF NVKGGRCEAC GGDGIIKIEM HFLPDVYVPC EVCQGKRYNR ETLAVKYKGK
SIADVLAMTV DEAAEFFAPI PRLHRRLTTL QDVGLGYITL GQPATTLSGG EAQRVKLATE
LARRSTGRTM YILDEPTTGL HMADIERLLN VLQRLVDAGN TVVVIEHNLD VIKSVDYIID
LGPEGGEGGG RVVATGTPEE VCRVAASYTG RFLAPVLERD RGLPALEPRT AGLPEEEPRG
RRELPLVVGQ E