Gene Athe_1618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1618 
Symbol 
ID7409448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1718403 
End bp1720424 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content39% 
IMG OID643715987 
ProductDNA ligase, NAD-dependent 
Protein accessionYP_002573485 
Protein GI222529603 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID[TIGR00575] DNA ligase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000164842 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGT TTATAAGAAA GAGAATAAGA GAACTTGTTG ATCTTATCAA TTATCATGAT 
TATAAATACT ATGTTGAAGA TAATCCAGAG ATCAGCGACT ATGAATATGA TATGTTATAT
CGTGAGCTTG TTGAGCTTGA AAAGCAATAC CCTGAATATG TGTTTCCAGA TTCTCCTACT
CAAAGGGTTG GTGGAAAGGT AAAAGAAGGT TTTAAAGAGG TTGTGCATCG TGTGCCGCTT
CTTTCGCTTT CAAATGTGTT CAATGAAGGG GAACTTTACG ATTTTGACAG AAGACTAAAG
GAACTTATTG GTACATCTAA TTTTGACTAT GTTGTTGAAT ACAAGATTGA CGGGCTATCT
GTTGCGCTTG AATATGAGAA TGGTCTTTTT ATCAGGGGTG CAACTCGTGG AGATGGCAAT
GTAGGTGAGG ATGTGACAGA AAACTTAAAG ACTATAAGGT CTATTCCGCT AAGGCTAAAA
GAAGACATCT CCATTGTTGT TCGTGGAGAG GTGTTCATGC CCAAGGATGA GTTTATAAAG
CTCAACCAGG AAAGGGAAGA GAATGAAGAA CCTCTTTTTG CAAATCCACG AAATGCAGCG
GCAGGGTCGC TTCGTCAGCT TGACCCAAAA ATAACAGCCC AAAGAAAACT TGATATATTT
GTCTTCAATA TCCAGTGGTG TGAAAAAGAG CTTGAAACAC ATGCACAAGC GCTTGAGTTT
TTAAAGCATC TTGGTTTTAA AGTTTCACCT GACTATGTTG TTTGCAGGGA TATTAAAGAA
GCATTTGAAG CTATAAAGAA AATTGAAGAA AAAAGGGACT TGCTTCCTTT TGAGATAGAC
GGTGCTGTTG TGAAGCTGAA CCAGCTAAGA CTGAGAGATG TTGCGGGGGA GACTGCAAAG
TCGCCCAGAT GGGCTGTTGC TTACAAATTT CCGCCTGAGA AAAAAGAAAC AAAGCTATTG
GATATTGAGG TCAATGTTGG ACGCACAGGT ATTCTAACAC CCACGGCAAT TTTGGAGCCT
GTAAGAATTT CAGGTTCTGT TGTTTCAAGG GCAACTCTTC ATAACATGGA TTATATAAGG
CAAAAGGACA TTAGAATTGG CGATACAGTA ATAGTGCAAA AAGCTGCAGA GATTATCCCT
GAGGTTGTTG AGGTTGTGTT TTCAAAACGA ACGGGGCAAG AAAGAATTTT TGAGATGCCA
AAGAAATGTC CTGTTTGCGG AGCAGATGTT ATAAAATTTG AAGATGAGGT TGCATACAGG
TGTACAGGTG TTGAATGCCC TGCTAAAAGT TACAGGTTAA TCTTACACTT TGTTTCGCGC
GATGCAATGG ACATCGCCGG CATGGGCGAA ATGGTTGTAA AAACCTTGTT TGAAAAAGGT
CTTATAAAGA CTCCTGCTGA TATTTATTAT TTGAAGTTTG AAGACTTGGT AAATTTAGAG
CGTTTTGGAG TAAAGTCTAC AAATAATTTG TTGAAAGCTA TACAAGCATC TAAAAACAGA
CCATTAGACA GGCTAATATA TGCGCTTGGG ATAAGGCACA TTGGGCAAAA AGCTGCAAAA
ACTCTTGCTG AGCATATATC GTCTATAGAT GACCTTTTTA CTATAACAGA AGAGCAGCTT
TTATGTCTTC CTGACTTTGG TGAAAAGATG GCAAAAAGCG TTGTGACATT TTTCAGGCAG
GAGCAGACAA GACACCTAAT AGAAAGGCTA AAAGCTGCTG GTGTAAATAC AGTTTCTGAA
AAGAAAGCAA AATCGGATAT TTTAAAAGGC TACACATTTG TTTTGACAGG GGCTTTGTCA
AAGTATAGCA GAAATGAGGC AAAAGAGATT TTAGAAAGTC TTGGTGCAAA AGTGACAGAA
AGTGTATCTA AAAAGACAAC AGCTGTGATT GTAGGTCAGG ACCCTGGTAG CAAATTTACA
AAAGCTCAGC AGCTTGGTGT TAAAATTTTG AACGAAGAAG ATTTTGAAAA GTTAGTAAAA
GCTTTGTCGC GCGAGGAAGC AGAAAAGATT TTAATGGAGT GA
 
Protein sequence
MSEFIRKRIR ELVDLINYHD YKYYVEDNPE ISDYEYDMLY RELVELEKQY PEYVFPDSPT 
QRVGGKVKEG FKEVVHRVPL LSLSNVFNEG ELYDFDRRLK ELIGTSNFDY VVEYKIDGLS
VALEYENGLF IRGATRGDGN VGEDVTENLK TIRSIPLRLK EDISIVVRGE VFMPKDEFIK
LNQEREENEE PLFANPRNAA AGSLRQLDPK ITAQRKLDIF VFNIQWCEKE LETHAQALEF
LKHLGFKVSP DYVVCRDIKE AFEAIKKIEE KRDLLPFEID GAVVKLNQLR LRDVAGETAK
SPRWAVAYKF PPEKKETKLL DIEVNVGRTG ILTPTAILEP VRISGSVVSR ATLHNMDYIR
QKDIRIGDTV IVQKAAEIIP EVVEVVFSKR TGQERIFEMP KKCPVCGADV IKFEDEVAYR
CTGVECPAKS YRLILHFVSR DAMDIAGMGE MVVKTLFEKG LIKTPADIYY LKFEDLVNLE
RFGVKSTNNL LKAIQASKNR PLDRLIYALG IRHIGQKAAK TLAEHISSID DLFTITEEQL
LCLPDFGEKM AKSVVTFFRQ EQTRHLIERL KAAGVNTVSE KKAKSDILKG YTFVLTGALS
KYSRNEAKEI LESLGAKVTE SVSKKTTAVI VGQDPGSKFT KAQQLGVKIL NEEDFEKLVK
ALSREEAEKI LME