Gene Cthe_0856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0856 
Symbol 
ID4810474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1031425 
End bp1032495 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content41% 
IMG OID640106272 
Productbranched chain amino acid aminotransferase 
Protein accessionYP_001037283 
Protein GI125973373 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 
TIGRFAM ID[TIGR01123] branched-chain amino acid aminotransferase, group II 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTACC AAATCAGTAT TCAAAAAACA CAAAATCCAA AAAACAAACC TGACCAGGAT 
AACCTAGGCT TTGGCCAGAT TTTTACCGAT CACATGTTTA TAATGGACTA TACTGAAGGA
AAGGGCTGGC ATGATCCAAG AATTGTTCCA TACGGCCCTC TTTCTTTGGA ACCCAGCACA
ATGGTATTTC ATTATGGTCA GGCAGTTTTT GAAGGTCTGA AAGCTTACAA AACAGAGGAT
GGAAGAATCC TTCTTTTCAG ACCAAGAAAA AACATGGAGA GAATAAATAT TTCAAATGAA
AGAGTTTGCA TACCGAAAAT AGACGTTGAT TTTGCCGTAG AGGCATGCAA AACTCTTGTA
AGTGTCGACA GAGACTGGAT TCCGGAAGCT GAAGGCACTT CTCTTTATAT ACGCCCGTTT
ATAATCTCTA CCGATCCTTT CTTAGGAGTA AGACCGTCCT GGACATACAA ATTCATAATT
ATTCTATCTC CTGTAGGAGC TTATTATAAA GAGGGAATCA ATCCCGTAAA AATATACGTT
GAAAGCGAGT ACGTACGTGC CGTAAAGGGA GGCACAGGTT ATGCAAAGAC TCCCGGCAAC
TATGCCGCAA GTCTCATAGC GCAGGTTAAA GCAAAGGAAC TGGGTTACAC CCAGGTACTC
TGGCTTGACG GAGTTGAGAA AAAGTACATA GAAGAAGTAG GTACAATGAA TGTGTTCTTT
AAAATAAACG GCGAAGTCAT CACTCCTTCC CTTGACGGAA GCATCCTTGC CGGAATAACC
CGTGAATCTA CAATTGAGCT GCTTAGAGCA TCCGGCATAA AAGTAACTGA AAGAAAAATA
ACCATAGAGG AAATTTACAA TGCTCATGAA GCGGGAACTT TAGAGGAGGC TTTTGGAACC
GGAACTGCTG CGGTAATTTC GCCTATCGGC GAATTAAGCT GGAACGGCAA GGTTATAAAA
ATAAATGACG GTAAAATCGG AGAAACAGCC TCGTTTGTTT ACAATACCAT CACAGGCATT
CAAAGCGGCA AGATCGAAGA CAAGTTCGGA TGGACCGTTG AAGTAAAATA A
 
Protein sequence
MSYQISIQKT QNPKNKPDQD NLGFGQIFTD HMFIMDYTEG KGWHDPRIVP YGPLSLEPST 
MVFHYGQAVF EGLKAYKTED GRILLFRPRK NMERINISNE RVCIPKIDVD FAVEACKTLV
SVDRDWIPEA EGTSLYIRPF IISTDPFLGV RPSWTYKFII ILSPVGAYYK EGINPVKIYV
ESEYVRAVKG GTGYAKTPGN YAASLIAQVK AKELGYTQVL WLDGVEKKYI EEVGTMNVFF
KINGEVITPS LDGSILAGIT RESTIELLRA SGIKVTERKI TIEEIYNAHE AGTLEEAFGT
GTAAVISPIG ELSWNGKVIK INDGKIGETA SFVYNTITGI QSGKIEDKFG WTVEVK