Gene Cthe_1375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1375 
Symbol 
ID4809370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1678188 
End bp1679537 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content42% 
IMG OID640106799 
Productaspartate kinase 
Protein accessionYP_001037800 
Protein GI125973890 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000974637 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTG CAAAATTTGG AGGTTCGTCA CTGGCGGATG CGAATCAAAT AAGAAAAGTT 
TGTGATATTA TTTTAAGTGA CAAGGACAGA AAGCTGATTG TCGTTTCCGC TCCGGGTAAA
CGCTGTAAGG AAGATACAAA GGTTACGGAC CTTTTAATTG CTTTAGGAGA AAAATATTAT
AAGGAAGGAA AAGCGGATGC CGAACTTCAG GCTGTTATTG ATAGATTTGA TGATATTGTA
AAAGGCCTTG AGCTTTCGCC CGATATTACA CAAATGGTTG CAGATGATTT GAAAAAAAGG
CTTGAATGCA ATAACGGCAA CAAGGATAAG TTTATGGACA CCATAAAGGC TGCCGGAGAG
GATAACAATG CAAAAGTGGT TGCAGCTTAT CTGGTAAGCA GGGGAATTGA TGCCGAGTAT
GTAAATCCAA AAGATGCGGG ACTTTTGCTT AGTGAGGAAT ACGGAAATGC AAGGGTGCTG
CCGGAATCAT ATGAGAATTT GAAACGCCTG CGTGAAAGGG ATAAAATAAT GATTTTCCCC
GGCTTTTTCG GATATTCAAA GAAGGGGGAT GTTGTTACAT TCCCGAGGGG AGGTTCCGAC
ATAACGGGAG CCATACTTGC AGCTGCGGTA AAAGCTGATG TGTATGAAAA CTTTACCGAC
GTTGACTCGG TTTTTGCCGC AAATCCCAAC ATTATCGAAA ACCCGAAACC GATTGCGACT
TTTACATACA GGGAAATGAG GGAGCTTTCT TATTCAGGTT TTTCAGTGCT GCATGAGGAA
ACTCTTGAAC CGGTTTACAG AATGGAAATT CCTGTATGTA TTAAAAACAC CAACAATCCG
TCTGCTCCCG GAACTACAAT TGTGCCGAAA AGGAAACTGG ACAACGGCCC TGTTATCGGC
ATAGCAAGCG GTACCGGATT CTGCTGCATT TATATAAGCA AGTACATGAT GAACAGGGAA
ATTGGTTTTG GAAGAAAGGT GCTTAGTATT TTGGAAGATG AAGGGCTGTC CTATGAGCAT
ATTCCTTCAG GGATTGACAA CATGTCCATT ATAATTGAGC AAAAGCAGCT CGACAAAGCT
AAGGAAGAGA GAGTGGTAAG AAGGATAAAG GATGAATTGA ATGTTGATGA CATAAAGATA
GAATATGACC GTGCGCTGGT TATGATTGTA GGAGAAGGCA TGATGAGCAC GGTGGGAATT
GCTGCAAGAG CTTGTACTGC TTTGGCAAAA GCAAATGTAA ACATAGAGAT GATAAATCAG
GGTTCATCGG AAGTAAGCAT GATGTTTGGT GTAAAGGCTG AAGATAATGT CAAGGCGGTA
AAGGCTTTGT ATGATGAGTT TTTCAGCTAA
 
Protein sequence
MKVAKFGGSS LADANQIRKV CDIILSDKDR KLIVVSAPGK RCKEDTKVTD LLIALGEKYY 
KEGKADAELQ AVIDRFDDIV KGLELSPDIT QMVADDLKKR LECNNGNKDK FMDTIKAAGE
DNNAKVVAAY LVSRGIDAEY VNPKDAGLLL SEEYGNARVL PESYENLKRL RERDKIMIFP
GFFGYSKKGD VVTFPRGGSD ITGAILAAAV KADVYENFTD VDSVFAANPN IIENPKPIAT
FTYREMRELS YSGFSVLHEE TLEPVYRMEI PVCIKNTNNP SAPGTTIVPK RKLDNGPVIG
IASGTGFCCI YISKYMMNRE IGFGRKVLSI LEDEGLSYEH IPSGIDNMSI IIEQKQLDKA
KEERVVRRIK DELNVDDIKI EYDRALVMIV GEGMMSTVGI AARACTALAK ANVNIEMINQ
GSSEVSMMFG VKAEDNVKAV KALYDEFFS