Gene Moth_1706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1706 
Symbol 
ID3833156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1742489 
End bp1743409 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content60% 
IMG OID637829631 
Productcysteine synthase 
Protein accessionYP_430551 
Protein GI83590542 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01136] cysteine synthases
[TIGR01139] cysteine synthase A 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000550718 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.263535 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAG CCCGGGATGT TACCCAGTTG ATAGGGCAGA CGCCGATCCT GCGCCTGAAC 
CGCCTGGTAG AGGACGGGAT GGAGGTATAC TTAAAAATCG AGTTTTTTAA TCCTGGAGGC
AGTGTTAAGG ACCGCATTGC CCTCAGCATG ATTACCGCTG CGGAAGCCGA CGGCCGCTTG
AAACCCGGGG ATACCATTGT CGAGCCCACA AGTGGTAATA CGGGTATCGG CCTGGCCATG
GTGGCGGCCG TACGGGGTTA CCGCCTGATC CTGGTTATGC CGGAAACAAT GAGTATTGAG
AGGCGCAAGC TCCTGGCGGC CTATGGGGCG GAGTTCGTCC TGACTCCGGG AAACCTGGGG
ATGAAGGGGG CTGTAGATAA AGCAAATGAA CTGGTACGGG AGAACCCCGG TTACTTTATG
CCCCAGCAGT TTGAAAATCC GGCCAACCCG GCCATCCACC GCCAGACCAC GGCCAGGGAG
ATCCTGGAGC AAATGGACGG TAAAGTAGAC GCCTTTGTGG CCGGCGTCGG GACCGGGGGT
ACCCTCACCG GGGTGGGTGA GGTCCTGAAA AAGGAAATCC CGGGGGTAAG GATTGTAGCT
GTGGAACCGG CGGCTTCACC GGTCCTTTCT GGAGGCCAGC CAGGACCCCA TAAGATTCAG
GGTATAGGTG CCGGCTTTGT CCCGCCGGTC TTGCGCCGGG AGGTGGTTGA CGAGATAATC
ACCGTCACCA ACGAAGACGC TATGGAAACG GCCCGGCGCC TGGCACGGGA GGAAGGCCTG
CTGGTGGGCA TCTCTTCGGG CGCCGCTGCT TTTGCCGCCC TCCAAGTGGC CCGGCGCCTG
GGCCGGGGGA AAAGGGTACT GGCCATCGCC CCCGACACCG GCGAGCGCTA CCTGAGTACG
GAGTTGTTCA AGATAGGCTA A
 
Protein sequence
MKIARDVTQL IGQTPILRLN RLVEDGMEVY LKIEFFNPGG SVKDRIALSM ITAAEADGRL 
KPGDTIVEPT SGNTGIGLAM VAAVRGYRLI LVMPETMSIE RRKLLAAYGA EFVLTPGNLG
MKGAVDKANE LVRENPGYFM PQQFENPANP AIHRQTTARE ILEQMDGKVD AFVAGVGTGG
TLTGVGEVLK KEIPGVRIVA VEPAASPVLS GGQPGPHKIQ GIGAGFVPPV LRREVVDEII
TVTNEDAMET ARRLAREEGL LVGISSGAAA FAALQVARRL GRGKRVLAIA PDTGERYLST
ELFKIG