Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0389 |
Symbol | |
ID | 7409319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 441119 |
End bp | 442330 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 643714773 |
Product | hypothetical protein |
Protein accession | YP_002572296 |
Protein GI | 222528414 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000244478 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAATA AAGTAGGAAG ACTTTACACA TTTTTTAGAT ACATTGTTGT AGTGCTTTTA GGATTAGCCA TTGTTTTGTC ATTTCAGTTT CGGATAGTTG CTCTTACACA TAAGAAAATA AGTTTTGTTG TGATGGTTAT TCTTCTTTTG TCAGCTTTTT TAATCTTTAA TATTTGGCTT TTTATATTTC AAAAACTGGC AAACAAAAAA CTCGCTATAC TCTTGTTAAT TCTTGTTATA GCAGCTCCAA GACTTATATG GATTTTTTTG ATGCCAACAA AGCCAGTTTC GGACTATTTA TGTTTTTATT CCTATGCCCA AAAAGCTTCT CAAGGCTTTT TAAAAGGGTA TGACATGACT TTTACACTCT TTAGGTTCAG ATTTGGATAT TCATTATTTT TGGCACTTGT TTTCAAGATT TTTGGAAGCA GCATAATTGT AGGAAAACTT TTCAATGTGT TTCTTTCAGT AGTTTTAGGA CTTATCATAT ATTTTACTGT TGACTATCTT TTTGGCAAAG AAGCAGCCAC ATATTCAGCA ATTTTGTATG CCTTTTGGCC ATCGCAGATA ATGTATAATT CAGTTTTGGC GTCTGAACAC CCATTTATTG TGTTTTTTGT GCTGGGGCTG TATTTTTTGT TAAGGGCAAT AAAAGAGAAA AAAGCCATTT TTGGCATATT TGCAGGAGTC CTTGTGGCAA TTTCAAATCA TATAAGACCT GTTGCTGTTG TGATCATAAT TGCGATGGTT TTCTTGTTTG CACTAAAAGC TCTTTGTAAA GATTTTAAAA TCTTGAAAAG TGCTATCTTA AGTATAATTT CATACGTTAT TACATTCTAT ACAGTAGGAT ATCTAATTTT TTGTCTCACA GGCATTCCTG TGTGGAAAAC ATCAATGGGG CTTAATCTCA TGATTGGTAC AGACTATACA ACATATGGTA TGAACAATCC TAAACATTCT TTGTTTGTTA AAAAATATGC TTATGATTTT CAAAAGATGC ACGGTGAGGT TATGAAAATA GGGTTAGAGA GACTTAAAAA AGAAACAAAA AAATTTATTG CTATTCTTCC TCGCAAACAT GCTATTATCT GGGGCGATGA TAGCTTTGGG TATTTTTGGA GTACTTTTAA AGTTTACAAA ACCACATATT TTGTTAATCT TGTAAAAATT CATCCAACCA TTTTCTATAT GTTCTCTCAG CTATACTACT AA
|
Protein sequence | MQNKVGRLYT FFRYIVVVLL GLAIVLSFQF RIVALTHKKI SFVVMVILLL SAFLIFNIWL FIFQKLANKK LAILLLILVI AAPRLIWIFL MPTKPVSDYL CFYSYAQKAS QGFLKGYDMT FTLFRFRFGY SLFLALVFKI FGSSIIVGKL FNVFLSVVLG LIIYFTVDYL FGKEAATYSA ILYAFWPSQI MYNSVLASEH PFIVFFVLGL YFLLRAIKEK KAIFGIFAGV LVAISNHIRP VAVVIIIAMV FLFALKALCK DFKILKSAIL SIISYVITFY TVGYLIFCLT GIPVWKTSMG LNLMIGTDYT TYGMNNPKHS LFVKKYAYDF QKMHGEVMKI GLERLKKETK KFIAILPRKH AIIWGDDSFG YFWSTFKVYK TTYFVNLVKI HPTIFYMFSQ LYY
|
| |