Gene Cthe_3093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3093 
Symbol 
ID4809967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3646611 
End bp3647885 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content44% 
IMG OID640108516 
Productadenylosuccinate synthetase 
Protein accessionYP_001039481 
Protein GI125975571 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0104] Adenylosuccinate synthase 
TIGRFAM ID[TIGR00184] adenylosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000909449 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACCA GAGTTGTAGT AGGCACCCAG TGGGGAGATG AGGGAAAAGG CAAGTATATT 
GACATGCTGG CCAAAGACTC GGACATGGTG GTGCGATTTT CAGGAGGAAA CAATGCCGGA
CACACGATAG TGGCCAACGG TGTAAAATAT GCGTTGCATC TTATACCGTC GGGCATATTG
AATGAAGGCA AAACTTGTAT TATAGGCAAC GGTGTTGTGG TTGATCCGGC AGTTTTGCTA
AAGGAAATTA AGGAGCTTAA TGAGAAAGGG ATAAGTACTG ACAGGCTTTT GATAAGTGAC
AGGGCTCATG TTATCATGCC GTACCACAAA CTTTTGGATG AGCTTCAGGA GAAGTTTCGT
GGAGAGAATT CAATAGGGAC AACCAAAAGA GGAATTGGGC CGTGCTACTC TGACAAGACG
GAACGATCGG GAATCAGAAT GTGCGACCTT GTTGATGAAG ATGAATTTGT CAGGAAGGTA
AGAGAAAACT TGAAGGTTAA GAACCTCATA ATTGAAAAGG TATACGGCGG ACAAAAACTG
GATGAGGAAC AGGTTATATC CGAATATCTT GAATATGGAA GAAAGCTTAA GGAATACGTT
GCGGATGTAA ACAGCATTAT ATTTGAGGCC ATAGAGCAGG GAAAAAATAT ATTGTTTGAA
GGAGCCCAGG CAACATTTTT GGATCTTGAT TTCGGAACCT ACCCTTATGT CACTTCTTCC
AATCCTGTGG CAGGTGGAGT TTGTACAGGT GCAGGAGTCG GACCTGTTTT TATCAATGAG
GTATATGGGG TTCTGAAAGC CTATACGTCA AGAGTTGGCG CAGGACCGTT CCCGACGGAA
CAGAACAACG AAATAGGCGA CAGAATAAGA GAACTTGGAT GGGAATATGG CACAACTACG
GGAAGGCCAA GACGCTGCGG GTGGCTTGAT CTCGTTATGA TAAAGTATGC TGCCAGAGTA
AACGGACTTA CCGCACTGGC AATAAACCAT GTTGATACAA TAGGAAAGCT GCCAAAAATC
AAGCTTTGTG TTGCGTATAA AAAGAACGGG CAGGAAACGC GCAATTTCCC GTGCAGCTTA
AAAGAGCTTG CCCAATGTGA ACCCGTATAT GAGGAATTTG ACGGTTGGGA TGAAGACATA
TCAAACGTAA AGTCCTTTGA TGATCTTCCT GACAACGCGA AAAAGTATCT GAGCAGAATA
GAAGAAATTG TCGGAGTAAA AATAAAACTG ATTGGTGTGG GGAAGGAAAG AGAGCAGACT
ATAGTCGTAA ACTAA
 
Protein sequence
MATRVVVGTQ WGDEGKGKYI DMLAKDSDMV VRFSGGNNAG HTIVANGVKY ALHLIPSGIL 
NEGKTCIIGN GVVVDPAVLL KEIKELNEKG ISTDRLLISD RAHVIMPYHK LLDELQEKFR
GENSIGTTKR GIGPCYSDKT ERSGIRMCDL VDEDEFVRKV RENLKVKNLI IEKVYGGQKL
DEEQVISEYL EYGRKLKEYV ADVNSIIFEA IEQGKNILFE GAQATFLDLD FGTYPYVTSS
NPVAGGVCTG AGVGPVFINE VYGVLKAYTS RVGAGPFPTE QNNEIGDRIR ELGWEYGTTT
GRPRRCGWLD LVMIKYAARV NGLTALAINH VDTIGKLPKI KLCVAYKKNG QETRNFPCSL
KELAQCEPVY EEFDGWDEDI SNVKSFDDLP DNAKKYLSRI EEIVGVKIKL IGVGKEREQT
IVVN