Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2143 |
Symbol | |
ID | 7408852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2275727 |
End bp | 2277112 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716508 |
Product | protein of unknown function DUF342 |
Protein accession | YP_002573991 |
Protein GI | 222530109 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1315] Predicted polymerase, most proteins contain PALM domain, HD hydrolase domain and Zn-ribbon domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0160634 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAAGTG AACAAAAAGT TGATATAAAG GTTATGGTAA CATCTGATAG GCTAAAAGCA AGTGTGGTGC TTGTACAAAA TCAAAATGGT GTGGATTTGA CGTTTGAAAA TGTGATGAAT AAATTGAGGG AAAATAAAAT TACGTTTGGG ATTGATGATG AAGCTATAAA AAAACTTGTT GAAAATCCCA TTTTTGGAAC ACCTATTGTG GTTGCGCAAG GGAAACCACC TGGCAAACCT GTTGATGGAA AACTCATATA TCATTTTGAT ATCAAACGTG AGATAAGGCC CAAAGAACTT CCTGATGGAA GAGTAGATTA CAAAGACTTG GGAATTGTTC AGAATGTGCG AAAAGATGAT GTTCTTGTTA CAATGATAGA CCCTGTTGAT GGAGAAAATG GTATGGATGT TTTTGGAGGG GTGATAAGAG GGCAAAAAGG GAGAAAGTTA AATCTTCCAA GGGGAAAAAA TACATACATA GATTCTGATG GGCATACATT AAAAGCTGCG TGCGATGGTC AGGTGTGCAT CATTGAGGGC AAGGTAACAG TTTTAAATAC ATTGGAGATT AACTCTGACA TAGACAATTC AACAGGTAAC ATAAACTTTG TTGGCAATGT TCATATAAAA GGGAGTGTGC TATCTGGTTT TAAGGTTGTT GCTGAAGGAA ATGTAGAAGT AGATGGTATT GTTGAGGCAG CTGAAATTGA AGCAAAAGGG AATGTAGTGC TGCACAAAGG GATTACAGGT ATGGGCAAAG GTAGAGTTGT TGCTGGCAAG AGCGTTTTTG CAAAGTTTAT TGAGAATGCT ACTATTGTTG CTGGTGAAGA TGTTCAGGCA GAAGCAATTG TTCACAGTGA CGTAAAGTGC GGGAATAAAC TTATTCTTGT TGGCAGGAAA GCTTCTATTG TTGGTGGGTC TTGCAAGGTT GGCAAAGAGG TTGAAGCAAA GGTAATAGGT TCGTATCTTT CCACCGCTAC TGAGATAGAA GTTGGGGTTG ACCCTCTGAT GGTGGAAAGA TACAGAGAGA TAAGGAGAGA GATGTCAGAG TTAAGAGAAA ATATAAAAAA ATGTGACCAG GGAATTGAGG TTTTGAGGAA GATAGAAGCA GCAGGTCTTT TGACAGACGA GAAAAGAGAA ATGCTTCAAA AGTTTACAAG GTCAAAGATT ACGGCATCAG AAAAATTAAA AGAATTGCAG AGTGAATTTG AGGAAATTGA AAAAAGACTT GAGGAGAGAA ATGAGGGGAT TGTTAAGGTT CAGGATACCA TTTACCCCGG GGTTAAGATA ACCATAGGAA ATGTGTGCAA ACTTATAAAG GAGCCAGTAA AATATTGCAA GATTTACAGA GAAGATGCTG ATATAAAGAT AGCGCCGTAT GCTTAA
|
Protein sequence | MLSEQKVDIK VMVTSDRLKA SVVLVQNQNG VDLTFENVMN KLRENKITFG IDDEAIKKLV ENPIFGTPIV VAQGKPPGKP VDGKLIYHFD IKREIRPKEL PDGRVDYKDL GIVQNVRKDD VLVTMIDPVD GENGMDVFGG VIRGQKGRKL NLPRGKNTYI DSDGHTLKAA CDGQVCIIEG KVTVLNTLEI NSDIDNSTGN INFVGNVHIK GSVLSGFKVV AEGNVEVDGI VEAAEIEAKG NVVLHKGITG MGKGRVVAGK SVFAKFIENA TIVAGEDVQA EAIVHSDVKC GNKLILVGRK ASIVGGSCKV GKEVEAKVIG SYLSTATEIE VGVDPLMVER YREIRREMSE LRENIKKCDQ GIEVLRKIEA AGLLTDEKRE MLQKFTRSKI TASEKLKELQ SEFEEIEKRL EERNEGIVKV QDTIYPGVKI TIGNVCKLIK EPVKYCKIYR EDADIKIAPY A
|
| |