Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1819 |
Symbol | |
ID | 7408606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1892243 |
End bp | 1893409 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643716196 |
Product | hypothetical protein |
Protein accession | YP_002573685 |
Protein GI | 222529803 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000132371 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGATA TAATTTGTTT TTCAACAACA CCATGGGATC CTATACCAAC ACGTAAACAA CAGATAATGA AAAGAATGCC ACAAAACTGT AGAATATTTT ATTTAGACCC ACCTGTGACC TTGATAGGTC CATTAAAAGA CCCTAGTTTG AGACCTTACC TAACAAGATT TAGAAAGTCT CCAAAAAGAA TAAAGGAGAA CCTTTTTGTA TTTGCTCTGC CACCAATTAT TCCCTTTTAT AACAAGAAAA GGTCCATCAA TAAGTTCAAT CAAAAAATGA TAGCAAATTT TGTAAAAGAA GTTATCTATC AAAACTTTGA TTTAAAGTCA CCTATAATAT GGACCTACAT GCCAAACACT GTTGATCTTC TTGAACATCT TTCTTACAGT TTTTTAGTTT ACGACTGTAT AGACAAACAT TCAGAGTTTC AAGGGTTTAT TGACAAGGCT TTGGTTGAAA GCATGGAAGA TGAGCTTGCT CAAAAGAGTA ATGTAGTTTT TACAACAACC CATGGATTAT ATAATAAGCT CAAGTTATTA AATCCTCACA CATATCTTGT GCCAAACGGT GCTGAGTTTG AACACTTTAA TAAAGCTTCA AATAAACTGC CTGTACCCGA TAAGATGAAT AATATACCCC GTCCTATCTT TGGCTTTGTG GGTGTTATCC ACACATGGAT AGACACTCAG CTTATAGAAT ATTTAGCAAA AGAAAAAAGA GAGTGGTCTT TTGTTTTGAT AGGACCTGTG GGTGCTGGTG TAAGCGTGGA TAATTTAAAG AAGCTGAGCA ATATTTATTT GCTTGGAAGG ATTGATAACA AGGATTTGCC GCAGTATGTA TCTCAATTTG ATGTTTGCTT AAACTTATTC AGAACAAACA AGCTCTCAGA GAATGTAAGC CCGCTAAAAT TTTATGAATA TTTGGCAACA GGAAAACCAA TTGTTTCAAC TTCAATGCCC CAAGTAGAAC AATTTTCCGA TGTTGTGTAT ATTGGCAAAA ACTATGAAGA TATGCTTGTA AAATGCATTC AAGCTCTGCA GGAGGCACAA AATCCTAATA TTGAAAAGAT AGAAAAAAGA ATAGAGTATG CAAAGCAAAC CTCATGGGAT AGCAGAGTAA CTCAAATTAT TGATATACTA AAGAGGGAAG GGATAAACAT TGAATAG
|
Protein sequence | MIDIICFSTT PWDPIPTRKQ QIMKRMPQNC RIFYLDPPVT LIGPLKDPSL RPYLTRFRKS PKRIKENLFV FALPPIIPFY NKKRSINKFN QKMIANFVKE VIYQNFDLKS PIIWTYMPNT VDLLEHLSYS FLVYDCIDKH SEFQGFIDKA LVESMEDELA QKSNVVFTTT HGLYNKLKLL NPHTYLVPNG AEFEHFNKAS NKLPVPDKMN NIPRPIFGFV GVIHTWIDTQ LIEYLAKEKR EWSFVLIGPV GAGVSVDNLK KLSNIYLLGR IDNKDLPQYV SQFDVCLNLF RTNKLSENVS PLKFYEYLAT GKPIVSTSMP QVEQFSDVVY IGKNYEDMLV KCIQALQEAQ NPNIEKIEKR IEYAKQTSWD SRVTQIIDIL KREGINIE
|
| |