Gene Nther_1091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1091 
Symbol 
ID6316616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1152776 
End bp1153687 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content39% 
IMG OID642643464 
Productcysteine synthase 
Protein accessionYP_001917263 
Protein GI188585718 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01136] cysteine synthases
[TIGR01139] cysteine synthase A 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000826995 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00308217 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGTGCTT TAAAATTAAT AGGTAATACA CCTTTGATTA GAATGAGCCG GAATATAGTG 
GGCACTGAAG CTGAGGTTTT TGCAAAACTA GAAATGTTTA ATCCAGGAGG TAGCGTAAAA
GATAGAATTG CATTAAGTAT GATTAACTCG GCCGAACAAA ACGGGCATTT ATCACAGGGA
GGAACAATTC TAGAACCGAC CAGTGGAAAC ACTGGGATCG GATTAGCTAT TGTAGCTGCT
GTTAAAGGAT ATCAATTAAT TTTGACTATG CCAGAGAGTA TGAGTGAAGA AAGACGGGCA
TTATTAAAAT CTTATGGAGC AGAGCTTGTA CTGACGCTAG CAGATAAAGG TATGGGAGGA
GCTGTTGAGA AAGCTAATCA AATTAAAAGG GAAAATCCGG ATTACTTTAT CCCTCAACAA
TTTAATAACA TCAGTAATCC AGAAATACAC AAACAAACTA CTGCCAGGGA AATTATTTCA
GAATTAGATT CAGATATAGA TGGATTAGTA CTCGGTGTTG GTACTGGCGG AACAATTACA
GGTGTAGGTG AAGTTTTAAA ACACAAAAAT CCTAATTTAA AAATCTTTGC AGTTGAACCA
AAGGAATCAC CGGTACTGTC TGGAGGGAAT CCAGGTCCTC ATAAAATTCA AGGGCTAGGG
GCAGGTTTTG TACCTCAAGT ATTAAAGACA GAATTGATTG ATGAAGTAAT TCAGGTTGAT
AGTTCTGAAG CCTATGATAT GAGCAATCAA TTGGCAAAAC AGGAAGGTTT ACTGGCCGGA
ATATCTAGTG GTGCTGCATT GAAAGGGGTA TTAAAGGCTT TAAAACAATT ACCATCAGGG
GCAAGAGTAG TGACAGTTTT CCCTGACACG GGAGAGCGTT ACTTGAGCAT GGCTCCTTAT
TTTAACTTAT AG
 
Protein sequence
MSALKLIGNT PLIRMSRNIV GTEAEVFAKL EMFNPGGSVK DRIALSMINS AEQNGHLSQG 
GTILEPTSGN TGIGLAIVAA VKGYQLILTM PESMSEERRA LLKSYGAELV LTLADKGMGG
AVEKANQIKR ENPDYFIPQQ FNNISNPEIH KQTTAREIIS ELDSDIDGLV LGVGTGGTIT
GVGEVLKHKN PNLKIFAVEP KESPVLSGGN PGPHKIQGLG AGFVPQVLKT ELIDEVIQVD
SSEAYDMSNQ LAKQEGLLAG ISSGAALKGV LKALKQLPSG ARVVTVFPDT GERYLSMAPY
FNL