Gene Cthe_0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0022 
Symbol 
ID4808787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp28483 
End bp29649 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content44% 
IMG OID640105432 
Product2-amino-3-ketobutyrate coenzyme A ligase 
Protein accessionYP_001036457 
Protein GI125972547 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes 
TIGRFAM ID[TIGR00858] 8-amino-7-oxononanoate synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0282575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGATT TTCAAGGTGC TTTGAAAGAT ATAAAAAACA AGGGATTATA CAGGGAGTTT 
CGAAATGTTA ATGCTGCCCA GGGCCCGTAT ACCGTTATTG ACGGAAGAAA AATGCTTATG
ATGTCATCAA ATAATTACCT GGGCTTGTGT GATGATATAA GGCTTAAAAG AGCTGCGATA
GAATCTATTC GTAAATTTGG TGTGGGAGCC GGAGGCTCAA GGCTGACTTG CGGAAACTTT
GAGCTTCACA GGGAGCTGGA GGAGAGGCTT GCAAAATTTA AGGATGTGGA AAGCTGTATT
GTTTTTGGAA GCGGATATGC CGCAAATATA GGAGCAATAT CGGGAATTGC GGACAAAAAC
TGGGTCATAT TCTGCGACCG TCTGAACCAT GCCAGCATTG TGGACGGCAT TCGCCTAAGC
GGTGCAAAAC TTGTGGTGTA TAAACACTGC GACATGGAGG ACCTTGAAAG CAAGATTGTA
CGCTATCATA CCGGCAAAAG CCTTATAGTA ACGGATGGCG TGTTCAGCAT GGACGGGGAT
GTGGCACCGG TGGATAGGAT TGTGAAGTTG GCTAAAAAAT ACAATCTTAT GACAATGGTG
GATGATGCCC ATGCCACAGG AATTTTGGGA GAAAAGGGAA GGGGGACGTC GGAGTACTTT
GGCCTTAAAG ATGCTGTTGA TATAAGCATG GGTACTTTGA GCAAGGCTTT TGGTGTTGAA
GGGGGATTTG TTGCAGGAAA GAGAAAGCTT GTTGATTTTT TACGGCACAA GGCCAAAAGC
TTTATTTACT CTACTGCTCC GCCGCCTCAT AATATGGCTG CGGCGTTAGA AGCTTTGAAT
ATCATAGAAA CGGAGCCGCA GGCAAGAAAG GAATTGGCTG AAAAATCCGT GTGGCTAAGA
AACAGGCTTA TAGAAAAAGG TTTTAACGTG CCCAAAGGGG TGACGCCGAT AATACCGCTT
ATGGTGGGAG ATGTAAATAC TGCAGTAGAG TTTAGTATGC TGCTTTATAA CGAAGGGATA
TATATTCCTG CCATCAGGCC GCCAACAGTT CCTAAAGGAA CGAGCAGGCT TAGAATTTCC
ATAATGGCTT CCCATTCCTA TGAAGACATG GAGTTTGCCC TTAAAAACCT TGTCCGGTTC
GGAAGGAAGT TGGGGATAAT ACCATAA
 
Protein sequence
MYDFQGALKD IKNKGLYREF RNVNAAQGPY TVIDGRKMLM MSSNNYLGLC DDIRLKRAAI 
ESIRKFGVGA GGSRLTCGNF ELHRELEERL AKFKDVESCI VFGSGYAANI GAISGIADKN
WVIFCDRLNH ASIVDGIRLS GAKLVVYKHC DMEDLESKIV RYHTGKSLIV TDGVFSMDGD
VAPVDRIVKL AKKYNLMTMV DDAHATGILG EKGRGTSEYF GLKDAVDISM GTLSKAFGVE
GGFVAGKRKL VDFLRHKAKS FIYSTAPPPH NMAAALEALN IIETEPQARK ELAEKSVWLR
NRLIEKGFNV PKGVTPIIPL MVGDVNTAVE FSMLLYNEGI YIPAIRPPTV PKGTSRLRIS
IMASHSYEDM EFALKNLVRF GRKLGIIP