Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1091 |
Symbol | |
ID | 6316616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 1152776 |
End bp | 1153687 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642643464 |
Product | cysteine synthase |
Protein accession | YP_001917263 |
Protein GI | 188585718 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000000826995 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.00308217 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGTGCTT TAAAATTAAT AGGTAATACA CCTTTGATTA GAATGAGCCG GAATATAGTG GGCACTGAAG CTGAGGTTTT TGCAAAACTA GAAATGTTTA ATCCAGGAGG TAGCGTAAAA GATAGAATTG CATTAAGTAT GATTAACTCG GCCGAACAAA ACGGGCATTT ATCACAGGGA GGAACAATTC TAGAACCGAC CAGTGGAAAC ACTGGGATCG GATTAGCTAT TGTAGCTGCT GTTAAAGGAT ATCAATTAAT TTTGACTATG CCAGAGAGTA TGAGTGAAGA AAGACGGGCA TTATTAAAAT CTTATGGAGC AGAGCTTGTA CTGACGCTAG CAGATAAAGG TATGGGAGGA GCTGTTGAGA AAGCTAATCA AATTAAAAGG GAAAATCCGG ATTACTTTAT CCCTCAACAA TTTAATAACA TCAGTAATCC AGAAATACAC AAACAAACTA CTGCCAGGGA AATTATTTCA GAATTAGATT CAGATATAGA TGGATTAGTA CTCGGTGTTG GTACTGGCGG AACAATTACA GGTGTAGGTG AAGTTTTAAA ACACAAAAAT CCTAATTTAA AAATCTTTGC AGTTGAACCA AAGGAATCAC CGGTACTGTC TGGAGGGAAT CCAGGTCCTC ATAAAATTCA AGGGCTAGGG GCAGGTTTTG TACCTCAAGT ATTAAAGACA GAATTGATTG ATGAAGTAAT TCAGGTTGAT AGTTCTGAAG CCTATGATAT GAGCAATCAA TTGGCAAAAC AGGAAGGTTT ACTGGCCGGA ATATCTAGTG GTGCTGCATT GAAAGGGGTA TTAAAGGCTT TAAAACAATT ACCATCAGGG GCAAGAGTAG TGACAGTTTT CCCTGACACG GGAGAGCGTT ACTTGAGCAT GGCTCCTTAT TTTAACTTAT AG
|
Protein sequence | MSALKLIGNT PLIRMSRNIV GTEAEVFAKL EMFNPGGSVK DRIALSMINS AEQNGHLSQG GTILEPTSGN TGIGLAIVAA VKGYQLILTM PESMSEERRA LLKSYGAELV LTLADKGMGG AVEKANQIKR ENPDYFIPQQ FNNISNPEIH KQTTAREIIS ELDSDIDGLV LGVGTGGTIT GVGEVLKHKN PNLKIFAVEP KESPVLSGGN PGPHKIQGLG AGFVPQVLKT ELIDEVIQVD SSEAYDMSNQ LAKQEGLLAG ISSGAALKGV LKALKQLPSG ARVVTVFPDT GERYLSMAPY FNL
|
| |