Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2140 |
Symbol | |
ID | 7310836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 2507717 |
End bp | 2508928 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643609073 |
Product | proposed homoserine kinase |
Protein accession | YP_002506464 |
Protein GI | 220929555 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3635] Predicted phosphoglycerate mutase, AP superfamily |
TIGRFAM ID | [TIGR00306] 2,3-bisphosphoglycerate-independent phosphoglycerate mutase, archaeal form [TIGR02535] proposed homoserine kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATATG TTGTGATTCT TGGTGATGGT ATGGCTGATT ATACCATACC TGAACTTGAT AACAAGACCC CTCTGCAATA CGCAAAAAAG CCGTCAATTG ATATGCTGGC TTCAAAGGGT ACTGTAGGGC TTGTTCGTAC TGTTCCCGAA GGTATTGCCC CCGGTAGTGA TGCTTGTAAC CTTTCGGTGA TGGGCTATAA CCCCGAAGTA TATTACACAG GTCGTTCACC TCTTGAGGCT GTAAGTATGG GAATAGAACT GTCACCTACG GATGTTGCAC TCAGATGTAA TCTTGTACAC CTTTCCGAAG AAGAATCTGA GTATTCCCAA AAAATAATGA TAGATTACAG TTCAGATGAA ATTTCAACGG CTGAATCTAA AGTTCTTATT GATGCAGTAA ATAATGCTCT TAAGACTGAG AACATAGTTT TTCACCCCGG AATAAGCTAC AGGCATTGTG TGGTGTGGAG TAACGGCAGG ACAGGTCTTG GTTGTACTCC TCCCCACGAT ATTTCCGAGA AAAGGATTTC AGATTTTCTT CCAAAGGAAG AGTCAGGATT GCTGCTTGAT TTGATGAAGA AAAGTTATGA CATTTTAAAA GATCATCCCA TAAACCAGGC AAGAAGGGCA AAGGGACTAA GAACCGCAAA CTCCATCTGG CTTTGGGGCG AAGGTAAGAA ACCTGCTTTG TCTTCTTTCC AGGAAAAATA CCATATAACA GGAGCAATGG TTTCCGCAGT GGATTTACTT AAAGGTATCG GTATTTGTGC GGGACTTGAT TCCATTGATG TTGAAGGTGC TACAGGTAAT ATTGATACAA ATTTTATCGG TAAGGCTAAT GCAGCTATTC AGGCCCTTGA AAGCGGAAAA GATTTTGTTT ACGTACATGT TGAAGCCCCT GATGAGTGCG GTCACAGACA TGAAATCGAA AACAAGGTCA AGGCCATAGA GTTGTTGGAT TCACAGGTTG TAAAACCTAT ACTGGAAGGT ATAAGTAAGT ATGATTACCG TGTATTGGTA CTCCCTGACC ATCCTACGCC TCTTCGCTTA CGCACTCATA CATCCGAACC GGTTCCATTT ATTATTTATG ACAGCACTAA TGAAATTCAA TCACAGGCTA AGAGCTATGA TGAATTTGAA GCTAAAAAAT CAGGTGTATT CATAGAAGAC GGTTACAAAT TGATGGATTT GTTAATTAAA GGAAGTTTTT AA
|
Protein sequence | MKYVVILGDG MADYTIPELD NKTPLQYAKK PSIDMLASKG TVGLVRTVPE GIAPGSDACN LSVMGYNPEV YYTGRSPLEA VSMGIELSPT DVALRCNLVH LSEEESEYSQ KIMIDYSSDE ISTAESKVLI DAVNNALKTE NIVFHPGISY RHCVVWSNGR TGLGCTPPHD ISEKRISDFL PKEESGLLLD LMKKSYDILK DHPINQARRA KGLRTANSIW LWGEGKKPAL SSFQEKYHIT GAMVSAVDLL KGIGICAGLD SIDVEGATGN IDTNFIGKAN AAIQALESGK DFVYVHVEAP DECGHRHEIE NKVKAIELLD SQVVKPILEG ISKYDYRVLV LPDHPTPLRL RTHTSEPVPF IIYDSTNEIQ SQAKSYDEFE AKKSGVFIED GYKLMDLLIK GSF
|
| |