Gene Cthe_2883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2883 
Symbol 
ID4809090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3409985 
End bp3411061 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content41% 
IMG OID640108302 
Producthistidinol phosphate aminotransferase 
Protein accessionYP_001039274 
Protein GI125975364 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.017474 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAGG ATTTGGTAAG ACCTGAGCTG CAAAAATTAG TACCGTATGT TTCCCATCAG 
GTCCCGTACA GAATAAAGCT TGACGCAAAT GAAAGTCCCT TCGAGCTTCC TGAAAGTATC
AGGAAGAAAC TGGCGGACTA CTTTTTAAAA GGGCCGGGTT TAAACATTTA TCCAGATAAT
GAGTCTGTGG AGCTTAGAAA GACCATCGCA AAATACTGGA ATGTTGATGC GGATGAGGTT
ATCGTGGGTA CCGGTTCCAA CCAGCTTATT CAGCTGATAA TAACAGTGTT TGTGGGGAAA
GGGGAGAAGG TGCTGTATCC CTGGCCCACG TTTTCGATGT ATAAAATAAA CACTCTGATA
GCGGGCGGTG AACCGGTGGC ATTTCCCCTT GACAAGGAAA AAGACTTTGT TTTGGATACC
GACAAGTTTA TTGAGGCTGT AAAAACGGAA AATGCCAAGG TGGTATTCCT TTGCAATCCC
AACAATCCGA CAGGAGGGCT TGTTCCTCTG GAGCATATTG AAAAAATAGT GAAAGAGTGC
AAAAGTTCAA TTATTGTTGT TGATGAAGCG TATGCCGAAT TTTGCCCGCA GAGTGTGATA
CCTCTTGTAA AGAAGTATGA AAACCTTGTG GTGCTGCGCA CTTTCTCCAA AGCGTATCTT
CTTGCCGGAG CAAGGTGCGG ATATTCCATA AGCGGAATTG AGATTGCAAA TGAGATAAAC
AAAGTAAGAC CGACATACAA TGTAAGTTCA TTGACCCAGC TTATAGCAAA AATGGTGTTT
GAGGAACAGG AAGAGATGCA GAAAATGATA CGGTATTTGA TTGAGCAAAG AGGAGAGCTG
GAGAAATCTT TGAAAAAGAT AAAGGATGTC TGCATTTATC CTTCCCATGC AAATTATATA
CTTGTCAAAG TGCCTGAAGC TGAGATGATA AGCAAGGAAC TTCAAAAGCG GGGAATACTC
ATCCGAAGCT TTCCCAATGA TCCGGTGCTT TACGACTGTA TCAGGATAAC TGTAGGAACC
AAAGAACAAA ATGATATTTT CTTAGAGGAG TTTTCCGACA TAATTAATAA TTTTTAA
 
Protein sequence
MIKDLVRPEL QKLVPYVSHQ VPYRIKLDAN ESPFELPESI RKKLADYFLK GPGLNIYPDN 
ESVELRKTIA KYWNVDADEV IVGTGSNQLI QLIITVFVGK GEKVLYPWPT FSMYKINTLI
AGGEPVAFPL DKEKDFVLDT DKFIEAVKTE NAKVVFLCNP NNPTGGLVPL EHIEKIVKEC
KSSIIVVDEA YAEFCPQSVI PLVKKYENLV VLRTFSKAYL LAGARCGYSI SGIEIANEIN
KVRPTYNVSS LTQLIAKMVF EEQEEMQKMI RYLIEQRGEL EKSLKKIKDV CIYPSHANYI
LVKVPEAEMI SKELQKRGIL IRSFPNDPVL YDCIRITVGT KEQNDIFLEE FSDIINNF