Gene Moth_2013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2013 
Symbol 
ID3831967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2097721 
End bp2099721 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content63% 
IMG OID637829942 
ProductDNA ligase, NAD-dependent 
Protein accessionYP_430852 
Protein GI83590843 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID[TIGR00575] DNA ligase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTAG CTGCTGCCCG GCAACGGGTG GAAGAATTGC GACGCCTGAT CGAAGAACAT 
AACTACCGCT ACTACGTCCT CGACCAACCC TCGATCAGCG ACCGGGAATA CGACGCCCTG
ATGCAGGAAC TCATAGCCCT GGAGGAGGCT TACCCCGAAC TCCGGACCCC GGACTCCCCC
AGCCAGCGGG TAGGGGGTGC ACCCCGGGAA GAGTTCAACC AGGTGCGCCA TCCCCAGGTA
CTGCTGAGCC TCAACGACGC CTTTAATGAA GGCGACCTGC TGGAGTTTGA CCGTCGCGTC
CGGGATTTGG CGGGCCGCCC GGTGGAATAC GTCATCGAGG CTAAAATTGA CGGCCTGGCT
GTAGCCCTCA CCTACAGGGA CGGCCTTTTT ACCCTGGGCG CCACCCGGGG AGACGGCCAG
GTGGGAGAGG AGGTCACTGC GAATCTCAAA ACCATTCCCG CCCTGCCCCT GCGCCTGCGC
CGGCCACTGC CTTTTCTCGT GGTCCGAGGG GAGGTCTACA TGCCCAAGGT CGCCTTTACC
GCCCTGAATG CCGCCCGGGA GGAAGCCGGG GAACCCCTCT TTGCCAACCC CCGTAACGCG
GCCGCCGGTT CACTGCGCCA GCTGGACCCG AAAATAACGG CCAGTCGCAG CTTGAGCCTC
TTTGTTTATC AGGTAATCAG CCTTACCGGG GCGGAAGTTG CCACCCAGGC CGGGGCCCTG
GACTTCCTGG CGGAGCTGGG CTTTCCCGTC AACCCTTACC GGGTCGTGGC CCCGGACATT
CAGGCCGTCC TGGAGGAGGT AAAAGCCTGG ACGCCGGAAA GGCGGGCGAG TTTACCCTAT
GAAATCGACG GCCTGGTCAT CAAGGTCAAC GACCTGGCCC TTCATTCCGT CCTGGGGGCG
ACGGCCAAAG CCCCCCGTTG GGCTATGGCC TATAAATTCC CGGCTGAACA GGCCACCGCC
AGGGTGGAGG GGATTATCCT GCGGGTCGGG CGCACGGGGG TTCTGACCCC CACGGCCGTC
CTCACCCCGG TCCGCCTGGC CGGAACCACC GTTAGCCGGG CCACTTTACA TAATGAAGAT
TATATCCGGG AGAAGGACAT TCGCATCGGC GATACAGTCA TTGTTCAGAA GGCCGGCGAT
ATCATCCCGG AGGTGGTGGC TGTTATACCG GAGCGGCGTA CCGGGTCGGA AGAGGTCTTT
ACCATGCCGG AGCGCTGCCC GGCCTGCGGG GCTGCGGTAG TGCGGCCGCC GGGGGAGGCG
GCCCACCGCT GCACCGGCGG TCTGGCCTGT CCGGCCCAGG TCCTGGAGGG GATTATCCAC
TTCGCTTCCC GGGGGGCCAT GGACATCCAG GGCCTGGGTC CGGCCATCGT GGCCCAGCTC
CTGGAAGCCG GCCTCATCCA CGACGCCGCC GATCTCTATT ATTTAAAGGA AGAAGACCTG
TTAAAACTGG AGCGCTTCGG CCAGCAGTCG GCCAGCAATC TTCTGGCAGC CATTGCTGCC
AGTAAAAAGC AACCCCTGGA ACGCCTCATC TTTGCCCTGG GTATCCGTAA CGTCGGTCAG
CGGGCGGCCC GGGTCCTGGC CGATCATTTC GGTTCCCTGG ATAAACTGGC CGCGGCCACG
GTAGAGGAAT TGACCGCCCT GCCCGATATC GGCCCCCGTA TTGCCGAGAA TATCCGGGAG
TTTTTTGGCG AACCCCGGAA CCGGGCCGTA CTGGAAAAGC TTAAAAAGGC CGGGGTGCGG
CTGGAGGCTT TGAATGCAGC AGCCCCGGCC GGACCCTTAA CCGGCAAGGT CTTTGTCCTC
ACCGGCACCT TGCCGGGTAT GACCCGCCAG GAAGCGGAGG AACTGATCAC CAGGGCGGGG
GGTAAAGTCA GCAGCAGTGT GAGCCGGAAA ACAGACTATG TAGTCGCCGG TGAAAAACCT
GGCTCCAAGT TAGACCGTGC CCGGGAACTG GGAGTGCCGG TAATCAACGC GGATGGGTTG
CGACAACTCT TGCGCCAGTA A
 
Protein sequence
MDLAAARQRV EELRRLIEEH NYRYYVLDQP SISDREYDAL MQELIALEEA YPELRTPDSP 
SQRVGGAPRE EFNQVRHPQV LLSLNDAFNE GDLLEFDRRV RDLAGRPVEY VIEAKIDGLA
VALTYRDGLF TLGATRGDGQ VGEEVTANLK TIPALPLRLR RPLPFLVVRG EVYMPKVAFT
ALNAAREEAG EPLFANPRNA AAGSLRQLDP KITASRSLSL FVYQVISLTG AEVATQAGAL
DFLAELGFPV NPYRVVAPDI QAVLEEVKAW TPERRASLPY EIDGLVIKVN DLALHSVLGA
TAKAPRWAMA YKFPAEQATA RVEGIILRVG RTGVLTPTAV LTPVRLAGTT VSRATLHNED
YIREKDIRIG DTVIVQKAGD IIPEVVAVIP ERRTGSEEVF TMPERCPACG AAVVRPPGEA
AHRCTGGLAC PAQVLEGIIH FASRGAMDIQ GLGPAIVAQL LEAGLIHDAA DLYYLKEEDL
LKLERFGQQS ASNLLAAIAA SKKQPLERLI FALGIRNVGQ RAARVLADHF GSLDKLAAAT
VEELTALPDI GPRIAENIRE FFGEPRNRAV LEKLKKAGVR LEALNAAAPA GPLTGKVFVL
TGTLPGMTRQ EAEELITRAG GKVSSSVSRK TDYVVAGEKP GSKLDRAREL GVPVINADGL
RQLLRQ