Gene Cthe_2585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2585 
Symbol 
ID4809192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3056842 
End bp3057858 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content42% 
IMG OID640107999 
Productbiotin--acetyl-CoA-carboxylase ligase 
Protein accessionYP_001038978 
Protein GI125975068 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0340] Biotin-(acetyl-CoA carboxylase) ligase 
TIGRFAM ID[TIGR00121] birA, biotin-[acetyl-CoA-carboxylase] ligase region
[TIGR00122] BirA biotin operon repressor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000643253 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAGG TGAAAGAGGT AATTTTGAAA AAATTAAAAG AATCAACACA AGACTATGTT 
TCCGGCGAGG AACTGAGCAA TATCCTTGGT GTGTCAAGGA CGGCGGTATG GAAATGTATT
AACGAGTTGA AAAAGGAAGG GTATGTTATT GATTCTTCTT CAAAAAAGGG CTATAAATTA
TGTTATGTGC CTGATATAAT TAACTCATGG GAGATAAAAG AAGGCCTTGG CACCAGGATT
ATCGGCCAAA ATATTCACTG CTTTTCCGAG ATTGATTCAA CCAACAACTA TGCGAAAACA
CTTGCTCAAA AGGGTTGCGA TGACGGTACG GTTGTTTTGG CCGAACACCA GACCCAGGGA
CGGGGAAGGC TTGGCAGAAG CTGGGATTCC ATGGGCGGAA AAGGAATATG GATGTCCATT
GTGCTTCGAC CGGCTGTTGG ATTGGAGGAT GTGCAGATTA TCACTCTTGC TGCCGCTGTG
GCAGTGGTTT TGGCATTCAA GAAAGTAATG GGCATAGATG CCGGGATAAA GTGGCCCAAC
GATATAGTGC TGGACGGAAA AAAAGTTTGC GGTATACTCA CTGAAATGAG CATGGAGATG
GAAAGAATCA ATTTCCTCAT TCTTGGGATT GGAATAAACT TTAGTCATGA GGAATCTGAA
TTTCCTGAAG AGATAAGAGA CAGAGCTACA TCTTTGGGCA TTTATTTAAA AGAGAAAAAA
GGTATGGATA TTTCCCGTTT TAAAAGGAGT CATCTTATAA GAGCAATATT GTCGGAACTG
GAAGAAGTAT ATGACATGAT TAACGAAGGC AAAGCAGGTG TAATTGTGGA AGAATGGAAA
AAGTATTCGG TGACACTGGG AAAAGAAGTG GTTATAAAGT ACAGGGAGGA GCAGTACACC
GGGATTGCGC AGGATGTAGA CCAAAGCGGC AGGCTTATAG TAAAGCAGGA TGACGGGACA
GTGAGGGAAA TTTTGTCGGG GGAGGTTTCT GTAAGAGGAC TTTTAGGATA TACATAG
 
Protein sequence
MIKVKEVILK KLKESTQDYV SGEELSNILG VSRTAVWKCI NELKKEGYVI DSSSKKGYKL 
CYVPDIINSW EIKEGLGTRI IGQNIHCFSE IDSTNNYAKT LAQKGCDDGT VVLAEHQTQG
RGRLGRSWDS MGGKGIWMSI VLRPAVGLED VQIITLAAAV AVVLAFKKVM GIDAGIKWPN
DIVLDGKKVC GILTEMSMEM ERINFLILGI GINFSHEESE FPEEIRDRAT SLGIYLKEKK
GMDISRFKRS HLIRAILSEL EEVYDMINEG KAGVIVEEWK KYSVTLGKEV VIKYREEQYT
GIAQDVDQSG RLIVKQDDGT VREILSGEVS VRGLLGYT