Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2182 |
Symbol | |
ID | 7408375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2310970 |
End bp | 2312067 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716547 |
Product | DNA protecting protein DprA |
Protein accession | YP_002574030 |
Protein GI | 222530148 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00480655 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGAG AAGAACTCAT TTATTCTTTG TGGCTTTACA GCATAAAAGG GATAGGCCCT AAAAAGTTCA GGCAGATAAA GAATAAGTAT AAAAGTCTAA AAGATGCTTA TCTTAATAAA AAAGAGTTGG AAGTCGAAGG TATTATTGGC GCAGTAAAAG ATGAAATAAA AAATTCTGAC TTGCAGAGGG CTGAAAAAAT TCTTGAATTT TGTGAGAAAA ATAGTATAAA TATAATACTT GAAGACGATG AGCTGTATCC AGATGAGTTC AAAGTTTTTG ACCATGCACC AGTTATGCTC TTTGTAAAAG GGGATGCAAA GCTTCTTAAA TTTCCGCGTA AAATATCAAT GGTTGGAACA CGAGAACCTA CTTATTATGG CAAAAGAGTT GCAAAAGAGC TGGCAAGTCT GCTGGCTTCC TTAGGAATTT TGGTTGTTAG CGGAATGGCA AGGGGGATTG ACAGTTTTTG CCATACAGGG GCACTTGAAA ATGGGAAGAC AGTAGCTGTT TTGGGTTGTG GAGTTGATAT TGTATATCCA AAAGAAAATT TAAAGCTTTA CCGTCAGATT ATAGAAAATG GATGTGTTGT GTCTGAGTTT TTGCCAGGTA CTCTGCCGGA GAAAATGAAC TTTCCCCAGA GAAACAGAAT TGTTGCGATG TTTTCACCTT GCTTGGTTGT AATTGAAGCA TCAACCAAAA GTGGCACTTT TTCGACAGTT GACTTTGCTT TGGAGCAGGG GAAAGAGGTA TTTGCTGTGC CTGGCAACAT ATTTTCGCAA AAGAGTAGTG GTACAAACAG GCTCATAAAA GAGGGTGCAA GGATTATATG TTCATATGAG GATTTTCTTG AGGACATAAA GGAGATTTAT ACTTTAAAAC CTGCCCAGAT AAGCTTTGAT GCAAATGAAG ATGAAGAAGA GCTGACAGAT GATGAGAAAA GATTGATAAA GCTTTTAGAT GAAAATGGTG AGATGCACGT AGAAAGTTTG ATTGCGCTAA CAGGATGGGA ACCTGGCAAG CTTGCAAGTT TAATTACTTC GCTTGAGATA AAGTCCAAGG TTGTAAGAGG ACGAGGAAAC ATAATTTCTA AACTTTAA
|
Protein sequence | MNREELIYSL WLYSIKGIGP KKFRQIKNKY KSLKDAYLNK KELEVEGIIG AVKDEIKNSD LQRAEKILEF CEKNSINIIL EDDELYPDEF KVFDHAPVML FVKGDAKLLK FPRKISMVGT REPTYYGKRV AKELASLLAS LGILVVSGMA RGIDSFCHTG ALENGKTVAV LGCGVDIVYP KENLKLYRQI IENGCVVSEF LPGTLPEKMN FPQRNRIVAM FSPCLVVIEA STKSGTFSTV DFALEQGKEV FAVPGNIFSQ KSSGTNRLIK EGARIICSYE DFLEDIKEIY TLKPAQISFD ANEDEEELTD DEKRLIKLLD ENGEMHVESL IALTGWEPGK LASLITSLEI KSKVVRGRGN IISKL
|
| |