Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1714 |
Symbol | |
ID | 7409229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1803490 |
End bp | 1804674 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716090 |
Product | cysteine desulfurase NifS |
Protein accession | YP_002573581 |
Protein GI | 222529699 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000384465 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGGAA AGATTATTTA TTTTGACCAT GCAGCCACAA CCCCTCTTAA AAAGGAAGTA TTAGATGAAA TGATGCCGTA TTTGACAGAT CAGTACGGCA ATCCTTCAAC AATTTACAAG CTTGGAAGAG AAGCAAAAAA AGCCATTGAA CTTGCAAGAG AAAGGGTCGC AAAGGCCTTA AATGCTGATA TTCAAGAAAT TTACTTTACT TCCGGTGGAA CAGAATCAGA TAACTGGGCA TTAAAAGGAG TTGCTTTTGC AAATAAAGAT AAAGGCAAGC ATATTATAAC AACAACAATC GAGCACCATG CGGTTTTGCA TCCTCTAAAA TATCTTGAAG GTTTAGGATT TGAAGTAACA TATGTTCCTG TTGAGCCAAA TGGTATTGTA GACCCTCAAA AAGTCAAAGA GGCAATAAAA AATGGCACTA TTTTGATTTC TGTCATGCTT GCAAATAACG AAATTGGGAC AATCCAGCCT GTCAAAGAGA TAGCAAAGAT AGCAAAGGAA AAGGGAATAA TCGTTCATAC TGATGCTGTT CAAGCAGTTG GGCAAATTCC TGTTGATGTA AAAGATTTGG GTGTCGACCT TTTATCACTT TCTGCTCATA AATTCTATGG GCCAAAAGGT GTTGGTGCAC TTTATATCAG AAAAGGGACA AAGATTCATC CATTTTCGCA TGGAGGTGCA CAGGAGAAAA ATAGGCGTGC TGGAACAGAG AATGTAGCAG GGATTGTTGG ACTTGGCAAG GCTATAGAGC TTGCAACTCA GAATCTTTCT GAGTATGCTG CAAAGCTTCA AAAACTGAGA GATAAGCTCA TTGACGGGGT TTTAAGCAAA ATTGATTATG TTCGACTAAA TGGTGATAGA CATCAGAGAC TTCCTAACAA TGCAAACTTC TCATTTGAGT TTATTGAAGG TGAAAGCCTG CTTTTGATGC TTGACATGAA AGGAATTGCA GCATCAAGCG GGTCAGCATG TACATCAGGG TCTTTGGACC CTTCACATGT GCTTCTGGCA ATTGGACTTG AACATGAGGT TGCTCATGGA TCTTTGAGAA TAACACTTGG TGAAGATAAC ACCGAAGAAG ATATAGATTA TCTATTAGAA GTTTTGCCTG AAATTGTTTC AAGATTAAGA GAAATGAGTC CACTTTATGA AAGCGTAAAA AAAGGGGGTA ATTGA
|
Protein sequence | MEGKIIYFDH AATTPLKKEV LDEMMPYLTD QYGNPSTIYK LGREAKKAIE LARERVAKAL NADIQEIYFT SGGTESDNWA LKGVAFANKD KGKHIITTTI EHHAVLHPLK YLEGLGFEVT YVPVEPNGIV DPQKVKEAIK NGTILISVML ANNEIGTIQP VKEIAKIAKE KGIIVHTDAV QAVGQIPVDV KDLGVDLLSL SAHKFYGPKG VGALYIRKGT KIHPFSHGGA QEKNRRAGTE NVAGIVGLGK AIELATQNLS EYAAKLQKLR DKLIDGVLSK IDYVRLNGDR HQRLPNNANF SFEFIEGESL LLMLDMKGIA ASSGSACTSG SLDPSHVLLA IGLEHEVAHG SLRITLGEDN TEEDIDYLLE VLPEIVSRLR EMSPLYESVK KGGN
|
| |