Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1375 |
Symbol | |
ID | 4809370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1678188 |
End bp | 1679537 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106799 |
Product | aspartate kinase |
Protein accession | YP_001037800 |
Protein GI | 125973890 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0527] Aspartokinases |
TIGRFAM ID | [TIGR00657] aspartate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000974637 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTTG CAAAATTTGG AGGTTCGTCA CTGGCGGATG CGAATCAAAT AAGAAAAGTT TGTGATATTA TTTTAAGTGA CAAGGACAGA AAGCTGATTG TCGTTTCCGC TCCGGGTAAA CGCTGTAAGG AAGATACAAA GGTTACGGAC CTTTTAATTG CTTTAGGAGA AAAATATTAT AAGGAAGGAA AAGCGGATGC CGAACTTCAG GCTGTTATTG ATAGATTTGA TGATATTGTA AAAGGCCTTG AGCTTTCGCC CGATATTACA CAAATGGTTG CAGATGATTT GAAAAAAAGG CTTGAATGCA ATAACGGCAA CAAGGATAAG TTTATGGACA CCATAAAGGC TGCCGGAGAG GATAACAATG CAAAAGTGGT TGCAGCTTAT CTGGTAAGCA GGGGAATTGA TGCCGAGTAT GTAAATCCAA AAGATGCGGG ACTTTTGCTT AGTGAGGAAT ACGGAAATGC AAGGGTGCTG CCGGAATCAT ATGAGAATTT GAAACGCCTG CGTGAAAGGG ATAAAATAAT GATTTTCCCC GGCTTTTTCG GATATTCAAA GAAGGGGGAT GTTGTTACAT TCCCGAGGGG AGGTTCCGAC ATAACGGGAG CCATACTTGC AGCTGCGGTA AAAGCTGATG TGTATGAAAA CTTTACCGAC GTTGACTCGG TTTTTGCCGC AAATCCCAAC ATTATCGAAA ACCCGAAACC GATTGCGACT TTTACATACA GGGAAATGAG GGAGCTTTCT TATTCAGGTT TTTCAGTGCT GCATGAGGAA ACTCTTGAAC CGGTTTACAG AATGGAAATT CCTGTATGTA TTAAAAACAC CAACAATCCG TCTGCTCCCG GAACTACAAT TGTGCCGAAA AGGAAACTGG ACAACGGCCC TGTTATCGGC ATAGCAAGCG GTACCGGATT CTGCTGCATT TATATAAGCA AGTACATGAT GAACAGGGAA ATTGGTTTTG GAAGAAAGGT GCTTAGTATT TTGGAAGATG AAGGGCTGTC CTATGAGCAT ATTCCTTCAG GGATTGACAA CATGTCCATT ATAATTGAGC AAAAGCAGCT CGACAAAGCT AAGGAAGAGA GAGTGGTAAG AAGGATAAAG GATGAATTGA ATGTTGATGA CATAAAGATA GAATATGACC GTGCGCTGGT TATGATTGTA GGAGAAGGCA TGATGAGCAC GGTGGGAATT GCTGCAAGAG CTTGTACTGC TTTGGCAAAA GCAAATGTAA ACATAGAGAT GATAAATCAG GGTTCATCGG AAGTAAGCAT GATGTTTGGT GTAAAGGCTG AAGATAATGT CAAGGCGGTA AAGGCTTTGT ATGATGAGTT TTTCAGCTAA
|
Protein sequence | MKVAKFGGSS LADANQIRKV CDIILSDKDR KLIVVSAPGK RCKEDTKVTD LLIALGEKYY KEGKADAELQ AVIDRFDDIV KGLELSPDIT QMVADDLKKR LECNNGNKDK FMDTIKAAGE DNNAKVVAAY LVSRGIDAEY VNPKDAGLLL SEEYGNARVL PESYENLKRL RERDKIMIFP GFFGYSKKGD VVTFPRGGSD ITGAILAAAV KADVYENFTD VDSVFAANPN IIENPKPIAT FTYREMRELS YSGFSVLHEE TLEPVYRMEI PVCIKNTNNP SAPGTTIVPK RKLDNGPVIG IASGTGFCCI YISKYMMNRE IGFGRKVLSI LEDEGLSYEH IPSGIDNMSI IIEQKQLDKA KEERVVRRIK DELNVDDIKI EYDRALVMIV GEGMMSTVGI AARACTALAK ANVNIEMINQ GSSEVSMMFG VKAEDNVKAV KALYDEFFS
|
| |