Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_1317 |
Symbol | |
ID | 5876235 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 1355879 |
End bp | 1356994 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641541667 |
Product | hypothetical protein |
Protein accession | YP_001662945 |
Protein GI | 167039960 |
COG category | [S] Function unknown |
COG ID | [COG0327] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00486] dinuclear metal center protein, YbgI/SA1388 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00297001 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTTAA AATGCCAAGT AATAGCTTCC ATAATGGATA AGCTTGCGCC GCGTAAATTC GCTGAAGAAT GGGATAACGT AGGATTATTG GTAGGAGATG GTTCTAAAGA TGTCTCAAAA ATTTTAGTAG CTTTGGATGC CACTTTTGAA GTAGTTAAAG AAGCAATTGA CAAAAAAGTA GATATGATTG TAACTCATCA TCCGTTAATT TTTAAGCCAA TAAAAAATGT CAAAGCTGAT AATCCAGTGG GCTCATTGTT AATACAACTT ATTAAAAATG ATATCTCGCT TTATGCAGCC CATACCTCTT TTGATATAGC TCCAAATGGG ATGAATGATA TTTTGTGTAA CGTCTTAGGA ATATATGATA GGGAAGTTTT GGATGTCACT TATTCCGAAG GATATAAAAA AATTGCAGTT TATGTGCCAC AAGGGTATGA AGAGATTGTA AAAAATGCTA TGTGCAATGC AGGAGCTGGT TTTATTGGAA ATTATAGTAA TTCTACTTTT CAGACCCAAG GCATTGGAAC TTATAAGCCT TTAGAGGGGA CGAATCCATT TATTGGTGAA ATAGGGAAGA TAGAAAAAGT GGAAGAAGTA AAAATAGAAA CAGTAGTGCC TCAGAAATAT TTAGAAAAAG TAATAAATGC AATGCTAAAT GTTCATCCCT ATGAAGAGGT TGCCTATGAT GTATATCCTT TGGAAAATCT TAAAGAAGAA TATGGGCTTG GAAGAATTGG AACTATTTCA GAAACAACTT TAAAAGAATT GGCACTACAA GTAAAAGCAA AACTTAAAAT CAATAATTTA AGAGTGGTAG GAGACCCTAA TAAAAAGATA AAAAAAGTGG CTGTGTGTGG TGGAAGCGGT GCAAGTCTTA TTCACAAAGC TGTTTCTAGA GGAGCAGATG TGTTGATAAC GGCAGATATT GGCTATCACG ATGCTGTAGA AGCCCAGCAC CTCGGATTAT CTTTAATTGA TGCGGGACAT TTTGCTACAG AGAATATTGC AGTTAGGTTT ATTGCAGAAT ATATAATTGA TGAAACTCAG AAACAGGGCA ACGAAATAGA AGTATTAGTT AGTGAAAGTC AAAAAGATCC TTTTATGTAT CTATAA
|
Protein sequence | MGLKCQVIAS IMDKLAPRKF AEEWDNVGLL VGDGSKDVSK ILVALDATFE VVKEAIDKKV DMIVTHHPLI FKPIKNVKAD NPVGSLLIQL IKNDISLYAA HTSFDIAPNG MNDILCNVLG IYDREVLDVT YSEGYKKIAV YVPQGYEEIV KNAMCNAGAG FIGNYSNSTF QTQGIGTYKP LEGTNPFIGE IGKIEKVEEV KIETVVPQKY LEKVINAMLN VHPYEEVAYD VYPLENLKEE YGLGRIGTIS ETTLKELALQ VKAKLKINNL RVVGDPNKKI KKVAVCGGSG ASLIHKAVSR GADVLITADI GYHDAVEAQH LGLSLIDAGH FATENIAVRF IAEYIIDETQ KQGNEIEVLV SESQKDPFMY L
|
| |