Gene Cthe_2255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2255 
Symbol 
ID4809993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2683143 
End bp2684555 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content38% 
IMG OID640107661 
ProducttRNA(Ile)-lysidine synthetase 
Protein accessionYP_001038650 
Protein GI125974740 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG0037] Predicted ATPase of the PP-loop superfamily implicated in cell cycle control 
TIGRFAM ID[TIGR02432] tRNA(Ile)-lysidine synthetase, N-terminal domain
[TIGR02433] tRNA(Ile)-lysidine synthetase, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000164425 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGACA AGGTATTGGA AACAATTAAA AAATACAATA TGATAAACAA CAAAGATAAA 
ATAGTGGTCG GAGTGTCCGG CGGTCCTGAT TCGGTGTGCC TTCTGCACAT TCTGTGCAAA
CTTAGGGAAA GCATGGATTT GGGGCTTGTG GCAGTTCATG TCAACCATAT GTTGAGAGGG
GACGAAGCTT CCGGAGATGA GGCTTTTGTT GAGAACTTGT GCAAAAAATT GAATGTTGAA
CTTGTCACCC GGCGTATTGA CATAAAAAAG CTTGCAAAGG AACGAAGGCT TTCCCTGGAG
GAGACCGGCC GGATAGAGAG GTATAGACTT TTTGAGGAAG TTGCGGATAA CTTCGGCGCC
CAAAGAATTG CTGTGGCCCA CAATAAAAAT GACCAGGCGG AAACCGTGCT GATGAATATT
ATAAGAGGCA CAGGTCTTGA CGGACTTAGG GGAATGGATT ATATTCGAGG TAGGATAATA
AGACCTCTTT TGGGTGTTGA GAGAACGGAA ATTGAAAATT ACTGCCGGAT TCACAACCTA
AATCCGCGTA TTGACAGTAC AAATTTGGAG AATATTTATA CCAGAAACAA AATAAGACTT
GATCTGATAC CCTATATAGA GAAATTATTC AATGCCGATA TAGTAAATGG CATAAGCAAA
ATGGCAGATT TAATAAAAGA TGACGTAAGC TTTATCGAAA GCCGGACAGA TGAAATTTAC
AACAAAGCAA AAATAAAGAG CGATGACAAG GGAGTAATTT TAGATCTTAA TATTTTAAAG
GAATGTCACA TTGCGGCTCG GAAAAGAATA ATTCGAAATT CCATAAAACA AATCAAAGGT
GACATAAAAG GAATTGCAAC AGTGCATATT GACAGTATAA CAGATCTGAT TGAAAACGGT
AAAACGGGAT CAATGCTGCA CCTTCCCCAC GGAGTAAGGG CAGTGAAATC CTATAATACC
TTGAAAATAT GTTTGCACGA GCTTAAAGAG GAAGACATAT ATTTCAATAA GGAAGTAAAC
ATACCCGGAA TTACGGTTGT TGATGAAATA TGTGGCAGCT TGGAGGCAAC CCTTATTGAT
GTTTCCGCAG ATTGCTTTAG TATAGAGGAT TTTACAAAGG TACCGGACAA AAGCAAGGTT
CAGTTTTTTG ATTATGACAG GCTGAAAGAG GGAATATACT TAAGAAACAG AAGAGACGGT
GATGTTTTCA GGCCCCGTAA CTCAAACGGC ACCAAAAAAC TCAAGGAGTT TTTTATTGAC
AATAAAATCC CAAGAGAAAC AAGAAATCAA ATACCGTTAA TTTCAACACG TAAAGAAATT
GTATGGATAA TAGGTTACAA AATCAGTGAT AAATTTAAAG TAACTGAAAA TACTAAAATC
ATACTGAAAT TATCCTATGA TAACTCGCAC TGA
 
Protein sequence
MIDKVLETIK KYNMINNKDK IVVGVSGGPD SVCLLHILCK LRESMDLGLV AVHVNHMLRG 
DEASGDEAFV ENLCKKLNVE LVTRRIDIKK LAKERRLSLE ETGRIERYRL FEEVADNFGA
QRIAVAHNKN DQAETVLMNI IRGTGLDGLR GMDYIRGRII RPLLGVERTE IENYCRIHNL
NPRIDSTNLE NIYTRNKIRL DLIPYIEKLF NADIVNGISK MADLIKDDVS FIESRTDEIY
NKAKIKSDDK GVILDLNILK ECHIAARKRI IRNSIKQIKG DIKGIATVHI DSITDLIENG
KTGSMLHLPH GVRAVKSYNT LKICLHELKE EDIYFNKEVN IPGITVVDEI CGSLEATLID
VSADCFSIED FTKVPDKSKV QFFDYDRLKE GIYLRNRRDG DVFRPRNSNG TKKLKEFFID
NKIPRETRNQ IPLISTRKEI VWIIGYKISD KFKVTENTKI ILKLSYDNSH