Gene Cthe_0214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0214 
Symbol 
ID4808632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp261339 
End bp262358 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content45% 
IMG OID640105627 
Productphenylalanyl-tRNA synthetase, alpha subunit 
Protein accessionYP_001036648 
Protein GI125972738 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0016] Phenylalanyl-tRNA synthetase alpha subunit 
TIGRFAM ID[TIGR00468] phenylalanyl-tRNA synthetase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGAGC AGTTGAACAG TATCAGAGTT CAGGCAGAGC AGGAACTTTC CAATGTTGGA 
ACCATTGCGG AACTGGAAAA CATCAGAGTC AAATATTTGG GCAAAAAAGG GGAACTTACC
GCCGTATTAA GAGGCATGGG TTCTCTGTCG CCGGAAGAAA GGCCGGTAAT AGGCCAGCTG
GCAAATGAGA TTAGGGCTTA TATTGAGAGC CGTATAGAAG AGGCCAGGAA CGAGCTTATA
AAAAAAGAAA GAAGTCAAAA ACTCGAGAGA GAAGTCATTG ACGTAACCAT GCCGGGAAAA
AGAAAGATGC TTGGAAAGAA ACATCCTTTG AGCATTGTGA TAGACGAGAT TAAAGATGTG
TTTATAGGAA TGGGTTACGA AATAGCCGAA GGTCCTGAGG TTGAGCTGGA CTATTACAAC
TTTGAAGCCC TCAACATTCC CAGAAACCAC CCGGCAAGAG ATGTACAGGA CACTTTCTAC
ATAAATAACA ACATATTACT CAGAACCCAG ACATCTCCCG TTCAGATCAG GGTTATGGAG
AACAAAAAAC CACCGATTAA AATAATCTGC CCGGGAAGGG TGTACCGCTC CGATGCTGTG
GATGCCACCC ATTCACCTAT ATTCCATCAA GTGGAAGGAC TGGTGGTGGA CAAGGGAGTT
ACAATGGGAG ACCTTGTCGG AACTTTAAGA GTGTTTGCAA AGAGTTTGTT CGGAGAAAAG
ACAGAAATAA GGCTGAGACC TCACCATTTC CCATTCACAG AGCCCAGTGC CGAGGTTGAT
GTTTCATGCT GGGCTTGCGG AGGAACAGGC TGCAGAATAT GTAAAAATGA AGGCTGGATA
GAGATTTTGG GCGCCGGAAT GGTTCATCCG AAGGTTCTGG AGGTTTGCGG CATAGACCCT
GAAGTATACA GCGGATTTGC TTTTGGTCTT GGCGTCGAAA GAACGGCCAT GGGAAGATTT
AATATCGATG ACATGAGACT TTTGTATGAA AACGATATCA GGTTCTTAAA ACAGTTTTAA
 
Protein sequence
MKEQLNSIRV QAEQELSNVG TIAELENIRV KYLGKKGELT AVLRGMGSLS PEERPVIGQL 
ANEIRAYIES RIEEARNELI KKERSQKLER EVIDVTMPGK RKMLGKKHPL SIVIDEIKDV
FIGMGYEIAE GPEVELDYYN FEALNIPRNH PARDVQDTFY INNNILLRTQ TSPVQIRVME
NKKPPIKIIC PGRVYRSDAV DATHSPIFHQ VEGLVVDKGV TMGDLVGTLR VFAKSLFGEK
TEIRLRPHHF PFTEPSAEVD VSCWACGGTG CRICKNEGWI EILGAGMVHP KVLEVCGIDP
EVYSGFAFGL GVERTAMGRF NIDDMRLLYE NDIRFLKQF