Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0481 |
Symbol | |
ID | 7407560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 552113 |
End bp | 553495 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643714868 |
Product | hypothetical protein |
Protein accession | YP_002572385 |
Protein GI | 222528503 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000859311 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAATA AAATTATTAC GAATTATAAC CTGATTAAGA ATACTCACAA CAATATTAAG GTCAATAGTT CCAATCATAG TGTTTCTGCA CAAACGCTTT CAACAGGTTG GATTGTGCCT GATAAACCTT CTGTGCAGTC TACTGCTGCA TACAAAGTTA CACTTTCAAA CACTTCTTTG CAAAAAGCTT CTACCAGCAG TAAAACTCAT GGTTCGAATA CACGAAATAG TGGAGGGATT TTGGGTATTT TTTCAAATAT CAAAAAAGAT ATTAGTGACA CTACAAAAAG TATTCAGAAC AAAGTCAACA ATTTTGTAAA ATCTACAGTT TCAAATTTGA ATAACACAGT TAAAACAGTA GAGCACAAAA TTTCTTCAGT AGTAAAATCA ACTGGGGAAA AATTAGAGAC TGTTGCAAAG GATATTAAAG AAGGTTTGAA AAAAGCTGTA GATGTTACAA CAACAAATAG TATTTCTGTT GTAGGGAATA AAACTATAAA AGAGAAAAAA ATAACCTTGA ATGTAGCAGG CAACAAAATG TATCTAAAAT TTACATCTTC AGTAAGTGGA GAAGTGGGAG TTGAAAAATC TTCTCAGTAC AAAGCAAATA CAGAGAAGGC TTCTGGTTTT GTTGAGCACA GCAATTCTGG TAAATTGAAT TTGGAATTAG AGAAGAGAAA AATTAGCAAG AGTTTTGAGA ATAGCACGAG TATGAAAGTT AATGATAAAA CTGAGATTGT AAGTAATGTA ACAGTTAACA AAAGGGGTGC TGAGATTGCA GGTGGAACAA AACTGGTGCT TTTGAAGACA CATAATCAGG AAGTTAATGT AACAGTTGGG GGAGCAGTAA AGAGCAATGG AAAAGCTGAA ATAAATTTAG CTAAGGTAAC TCATTCTTTA TCAACAGGTA AGATAACTTC CGAACAAAGC ATAGGGTTAA GCATTGATGA GAAAACTTTT AATAAGATAA AGACCACTAT GCAGGGTGTA TGGCTCGCAG TACCAAATGA TACAAAGTTT AAGATAGGAG TTGCTGTTGG AGTTATCAAA GGGGCTGTAA ATACTGTAAA GAGTTTAGTT GATGTTGTAA CTCATCCAAA GCAAATCGCA GAAGGAGCAA GAGAATTAAT TAAACATCCG CAGGTAGCAT TAAAATATGT AGAGCAATCT ATTGCTAAGG CAAAGGAAGA ATTTGTAAAT GGTGATGATT ACAAAAGAGG AGAAATGGTA GGGGAAGCAC TGTTTGAAGT AGGGGTTAGT ATAGCAGGTA CCAAAGGATT AGATAAATTA GCGAAGGCAG CCAAAGTATC TAATAATTTA GGAAAACTTA AAAAAGTATT TGATGTTACG ACAAAAGTAG CAAAACCAGC TTTTGGTCAT TAA
|
Protein sequence | MSNKIITNYN LIKNTHNNIK VNSSNHSVSA QTLSTGWIVP DKPSVQSTAA YKVTLSNTSL QKASTSSKTH GSNTRNSGGI LGIFSNIKKD ISDTTKSIQN KVNNFVKSTV SNLNNTVKTV EHKISSVVKS TGEKLETVAK DIKEGLKKAV DVTTTNSISV VGNKTIKEKK ITLNVAGNKM YLKFTSSVSG EVGVEKSSQY KANTEKASGF VEHSNSGKLN LELEKRKISK SFENSTSMKV NDKTEIVSNV TVNKRGAEIA GGTKLVLLKT HNQEVNVTVG GAVKSNGKAE INLAKVTHSL STGKITSEQS IGLSIDEKTF NKIKTTMQGV WLAVPNDTKF KIGVAVGVIK GAVNTVKSLV DVVTHPKQIA EGARELIKHP QVALKYVEQS IAKAKEEFVN GDDYKRGEMV GEALFEVGVS IAGTKGLDKL AKAAKVSNNL GKLKKVFDVT TKVAKPAFGH
|
| |