Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0254 |
Symbol | |
ID | 3833217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 255637 |
End bp | 259335 |
Gene Length | 3699 bp |
Protein Length | 1232 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637828190 |
Product | UvrD/REP helicase |
Protein accession | YP_429132 |
Protein GI | 83589123 |
COG category | [L] Replication, recombination and repair [S] Function unknown |
COG ID | [COG0210] Superfamily I DNA and RNA helicases [COG1379] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00375] conserved hypothetical protein TIGR00375 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.764036 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCAATTTA TCGCGGACCT TCACATCCAT TCCCGCTATT CCAGGGCTAC CAGCAAAGAT GCCACCCCGG AGAATCTTTA CCGCTGGGCC CTTTATAAAG GCGTTACCCT GGTGGGCAGC GGCGATTTTA CCCATCCGGC CTGGCGGGAG GAACTGAAAA ATAAGCTTGA ACCGGCGGAA GACGGCCTCT ACCGCCTCAA GGAAGAGTTC AGCAGGCCGG TAGATGAGGA GCCGGGACCG GGTCCGGCCG GGCCCGGGAA GGGTGCGAGG TGGCCAGCGG ACATGGGAAC CTCTGCCGGT TGGTTTTCAA GCATCCCCAG CGAGGCCTCC GGCCGTCCCG CGGTCCGCTT TATAATCTCG GGAGAAATCA GCACCATATA CAAAAAAAAC GGGCGGGTGC GGAAAATCCA CCACCTCATC CTCCTGCCCG ATCTTGAGGC GGCGGAAGCC CTCAGCCGCC GCCTGGAGGA GATCGGCAAC CTCCACTCCG ACGGCCGGCC GATCCTGGGA CTGGACAGCC GCCAGCTCCT GGAGATGACC CTGGAGGCCT GCCCGGAAGC CGTCTTCATC CCGGCCCATA TCTGGACCCC CCATTTCTCC CTCTTCGGCG CCAAGTCGGG TTTTGACAGC CTGGAAGAGT GCTTCGGGGA CCTGGCCGGG TATATCTACG CCGTGGAAAC GGGGCTCTCC TCCGACCCGC CCATGAACTG GCGCCTGGCC GCCCTGGACC GGTTGACCCT GGTTTCCAAC TCCGACGCCC ATTCACCCCG TAATCTCGCC CGGGAGGCCA ACCTTTTCAA TACCGAGCTT TCCTACCCGT CCATCCGCCG GGCCCTCAGG GACCGGGACG GAAAGGAATT CCTGGGGACC CTGGAGTTTT TCCCCGAAGA GGGCAAGTAT CATTATGACG GTCACCGGAA CTGCCGGGTT CGCTGGCGGC CGGCTCAGAC CCGGGAGGCC GGCGGCGTTT GCCCCGTCTG CGGTAAAAGG GTTACCGTCG GCGTCCTGCA CCGGGTGGAG GAACTGGCCG ACCGTGGCGA GGGTTTCCGG CCCGCCACCG CCCGCCCCTT TGAAAGCCTG GTCCCCCTGC CGGAGGTGAT CGCCGCCGCC CTGGGCTCGG GGGTGGCTAC CAAAAGGGTC ACCCGGCTTT ATTTTGACCT GATCGGCCGC CTGGGGCCGG AGCTGGCGGT TCTACGGGAG GCCCCCCTGG AGGCCATCGC CCGGGTGGGC GGGTCCCTGG TGGCCGAGGC CGTCCGGCGA ATGCGGACCG GTGAGGTTGA TGTCCAGCCG GGGTTTGACG GCGAGTACGG CAAGATTTTA TTACTCCGGC CGGAGGAGCG GCAGCTCTTC TCCGGGGAGC CCAGCCTGTT TGCCGGGGAA GCCGTAACCG GCCCGGTGGC GGCAATTGAG GAGAAACACC CGGAAATTGG CGCTACAGGC CCGGCGGGCG CGACTCTAGA GGAGGCTGGC GCAGGCCAGG GGGGAATCTT TGCCCCGCAG GTACCGGCGG GACGGAGGAC CCGGCACCCG GAGACTGCCG CGAGCGCTGT TATTCGCCCG GCCCCGGTGC CTGCCGGGTC CGGGGGTCCC GGATTAACCG GCGCCCTCAA CCCGGAGCAG CAGTCAGCCG TGACGGCGGC GACAGGTCCG GTGGTGGTCA TAGCCGGCCC GGGTACGGGC AAAACCCGCA CCCTGGTTTA CCGGCTGGCT TACCTGATAA AGGAGCGGGG CGTCGCCCCG GGGGAGATCG CAGCCGTCAC CTTCACCAAC AAGGCCGCGG CTGAGATTCG CCAGCGGGTG GCGGACCTGC TGGGGGACAG GGACGGTATG GAGTCCCTGA CTGTGGGGAC CTTTCACAGC ATCTGCCTGG ACCTCCTTCA ATCTTCCCTC ACCGGGTCAC CGGGAGCGGA TCTTCAAGGA AAACCAGTGG GCACGACGGA TACTCCGGCT TCCCCGCGGC TGACGGTCGT TGATGAAGCC GATGCCCGGG AGATCCTGGC CGGGGTTCTG GCCGAGAAAG GAACCGGGCG CCGGGGGTTG CCCGGCCTAC AACGGCGCAT CTCCCTTTTG AAGAGCCGCG GCCTCCTGCC CGATTCTCCG GAAGTGCCGG CCGACCTCCG GCCAATCTAC CGGGCTTACC AGGAGCGGCT GGCGGCCTAC GGTGTCCTCG ACTACGACGA TATCCTCCTG AAGGCCCTGG AACTTCCGGC CTTAAGGCCG GAAGGTGCTC CCGCATTCCA GGGGCCCGTT CCCTTCACCC ATCTCCTGGT GGACGAGTTT CAGGATGTCA ACGCCGTCCA GTACCGGCTG GTGAAGAAAT GGACCGGGGA CGGGAAGAAC CTTTTCGTTA TCGGTGATCC CGATCAGGCC ATCTACGGCT TTCGCGGGGC CGATTATCGC TTTTTCGGAC AACTGTTGGA GGACTTCCCC GCTGCCCGGG TCTTCCGGTT GACCAGGAAT TATCGCTCGA CGCCGGTTAT TCTCCGGGCA GCCGCCGCCG TGGTAGCCCA TAACCCGGGC GGCGATCTCA GGGCGGCTGC CGGGGGTGAG CTGGCAGCTA CCACCGCCAG TACCACCAGC CCGGCAGCCG GCCCGGGTAT ATCAGGTGGG GGCGGGCTGG TGGCCACCCG CCCGGAGGGA GGACCCATCC TCCACCTGGA AGCCCCGGGG GAAACGGCTG CGGGTATTGC CATCGTCCGG GAGATCGGGC GCCTGGTCGG CGGAGCCACC ATGTTGCAGG CCCACGGGCA GGACGGCGGC ACCTGGCGGG AAGGCCCGGA AACCGGAAGC GATGGGGCCT ATGGCTTCAG CGATATCGCC GTCCTCTGTC GTACCGGGCA CCAGCTAAAG GCCCTGGAGG AATGCTTTTT AAAAGAGGGA CTGCCCTACC GCATCGTAGG CCGGGAGAGT TTCCTGGAGG AACGCCCGGT GCGGGAGGCC CTGGCCTTCT GCCGCTGCCT GGTGAACCCG GAGGACGATT TTCACCTCCT TAAATACCTC GGTTCCGGAT CTGCCGGAGT CGGCGAGGAG GTTGCTATAC AGGTCAGGGA AACGGCCCTT AAGAGGGGAG TCCCGGCCTG GCTGGTTATC AGGGAACTGG CGGGCGAACT GCAAAACTTT ATAAACTCCC TGGAGGCTTA CCGCGCTGCC CTGGACAAGC AATCCCCGGC GGAACTCCTG GCCCGCTGGG CCGGGGAACA TGGCCTGCTG GATGAGCCCT CTATCAGCCG GTTGTTGCGG GTGGCGGAGC GTTTCCCGGA TTTATCTGCT TTCCTGCGCG GGGTAGTTCT GGCCGGGGAG GCCGACCACG AACGGCCGGG GAACAATGGC GCTTCTCCGG AAGTGGTAAC CCTTATGACC CTTCACGCCG CCAAGGGGCT GGAGTTCCCG GCGGTCTTTA TCGCCGGGGT AGAAGACGGG CTGCTGCCCC TGAAAGAGGG GCCGGGGGTA ACCTTAAGCC CTGAGGAAAT GGCTGAGGAA CGACGCCTCT TCTATGTGGG CATGACCCGG GCGCGGGAGC TGCTGGTCCT GGTTTCGTCC CGTAGGAGGG AGCTAAGGGG CACGGTGGTA CCGGCGGAGC CTTCTCCCTT CCTGGGGGAG ATACCTGCAG ACTGCCTGGT AAGGGAGACG TGGGATGCCA GGGGTAAGAA GAAAAACGGG AAAGATGAGG AGAAGTATCA GCAACTGAGC CTGTTTTAA
|
Protein sequence | MQFIADLHIH SRYSRATSKD ATPENLYRWA LYKGVTLVGS GDFTHPAWRE ELKNKLEPAE DGLYRLKEEF SRPVDEEPGP GPAGPGKGAR WPADMGTSAG WFSSIPSEAS GRPAVRFIIS GEISTIYKKN GRVRKIHHLI LLPDLEAAEA LSRRLEEIGN LHSDGRPILG LDSRQLLEMT LEACPEAVFI PAHIWTPHFS LFGAKSGFDS LEECFGDLAG YIYAVETGLS SDPPMNWRLA ALDRLTLVSN SDAHSPRNLA REANLFNTEL SYPSIRRALR DRDGKEFLGT LEFFPEEGKY HYDGHRNCRV RWRPAQTREA GGVCPVCGKR VTVGVLHRVE ELADRGEGFR PATARPFESL VPLPEVIAAA LGSGVATKRV TRLYFDLIGR LGPELAVLRE APLEAIARVG GSLVAEAVRR MRTGEVDVQP GFDGEYGKIL LLRPEERQLF SGEPSLFAGE AVTGPVAAIE EKHPEIGATG PAGATLEEAG AGQGGIFAPQ VPAGRRTRHP ETAASAVIRP APVPAGSGGP GLTGALNPEQ QSAVTAATGP VVVIAGPGTG KTRTLVYRLA YLIKERGVAP GEIAAVTFTN KAAAEIRQRV ADLLGDRDGM ESLTVGTFHS ICLDLLQSSL TGSPGADLQG KPVGTTDTPA SPRLTVVDEA DAREILAGVL AEKGTGRRGL PGLQRRISLL KSRGLLPDSP EVPADLRPIY RAYQERLAAY GVLDYDDILL KALELPALRP EGAPAFQGPV PFTHLLVDEF QDVNAVQYRL VKKWTGDGKN LFVIGDPDQA IYGFRGADYR FFGQLLEDFP AARVFRLTRN YRSTPVILRA AAAVVAHNPG GDLRAAAGGE LAATTASTTS PAAGPGISGG GGLVATRPEG GPILHLEAPG ETAAGIAIVR EIGRLVGGAT MLQAHGQDGG TWREGPETGS DGAYGFSDIA VLCRTGHQLK ALEECFLKEG LPYRIVGRES FLEERPVREA LAFCRCLVNP EDDFHLLKYL GSGSAGVGEE VAIQVRETAL KRGVPAWLVI RELAGELQNF INSLEAYRAA LDKQSPAELL ARWAGEHGLL DEPSISRLLR VAERFPDLSA FLRGVVLAGE ADHERPGNNG ASPEVVTLMT LHAAKGLEFP AVFIAGVEDG LLPLKEGPGV TLSPEEMAEE RRLFYVGMTR ARELLVLVSS RRRELRGTVV PAEPSPFLGE IPADCLVRET WDARGKKKNG KDEEKYQQLS LF
|
| |