Gene Cthe_1866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1866 
Symbol 
ID4809197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2212017 
End bp2213219 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content43% 
IMG OID640107285 
Productacetylornithine aminotransferase 
Protein accessionYP_001038280 
Protein GI125974370 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4992] Ornithine/acetylornithine aminotransferase 
TIGRFAM ID[TIGR00707] acetylornithine and succinylornithine aminotransferases
[TIGR01885] ornithine aminotransferase 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTGG AAGAAATTAT CAACCTGGAT AAAAAATACT TCATGAATAC ATTCGGAAAC 
AGAACCCCTG TATGTTTCTC CCACGGAAAA GGAATCAACC TTTGGGACAT AAACGGAAAA
AAGTACTACG ACTTTCTTGC CGGCATAGCG GTAAATGCTT TGGAACATTC ACACCCCAAA
CTTGTGAATG CCATAAAGCA GCAGGCTGAA AAGCTCATAC ATTGCTCCAA CCTCTACTAC
ATAGAATCCC AGGCAAAACT TGCGGAAAAA CTTGTGAGCA TATCCTGCGC CGACAAAGTA
TTCTTTGCAA ACAGCGGCGC GGAAGCAAAC GAAGGAGCCA TAAAGCTTGC CCGCATCTAT
TTTAAAAAGA AAGGAATGCC GGAAAAGTAC GAAATTATAA CTTTGGAAAA GTCTTTCCAC
GGAAGAACCC TTGCCACCAT AGCTGCAACC GGGCAAGACA AATATCAAAA ACCTTACTGT
CCTCTCACTC CTAAATTTCT AAAAGTCCCG ATAAACGATC TTGAAGCTCT TGAAAAAGCA
ATAAACAGCT CAACCTGCGC AGTAATGATT GAACCTATAC AGGGTGAAAG CGGAGTAAAT
CTTACCTCGG TGGAATATAT GAAAGGCGTT CGCAAGCTCT GTGATGAAAA AGGCATACTC
CTTATATTTG ATGAGGTTCA GTGCGGTTTG GGCCGTACAG GAAAGCTCTT TGCATATGAA
CATTTCGGTG TGGAGCCTGA CATATTCACC CTTGCCAAAG CCCTGGGAGG AGGATTCCCA
ATAGGTGCGC TCTGCGCAAA AGAACATGTG GCAAGCGCTT TTGAACCGGG AGATCACGGT
TCAACCTTTG GAGGCAATCC TCTTGCATGT ACAGCTGCAT TGGCTGCTCT GGATGTCATA
ATAGAAGAAG GACTCGTAGA AAATTCAGCA AAAATGGGAA CCTACTTTAT GAGCAAGCTT
TCGGAACTTG CCGAAAAATA CAGCATCATT CAGGAAGTAA GAGGCAAAGG CCTTATGATA
GGTGTTCAGC TTTCAATAGA TGCAGCAGTG GAAATCAAAA ACAAATGCTT TGAAAAAGGC
TATTTGATTG GAAGCATCGG CAACAACATC TTAAGGATGC TGCCCCCATT GATTGTTACA
GAACAGGACA TTGACGGCAT GATAGACACA CTGGACAGTG TTTTTCAAGA ATATCAGATT
TAA
 
Protein sequence
MQLEEIINLD KKYFMNTFGN RTPVCFSHGK GINLWDINGK KYYDFLAGIA VNALEHSHPK 
LVNAIKQQAE KLIHCSNLYY IESQAKLAEK LVSISCADKV FFANSGAEAN EGAIKLARIY
FKKKGMPEKY EIITLEKSFH GRTLATIAAT GQDKYQKPYC PLTPKFLKVP INDLEALEKA
INSSTCAVMI EPIQGESGVN LTSVEYMKGV RKLCDEKGIL LIFDEVQCGL GRTGKLFAYE
HFGVEPDIFT LAKALGGGFP IGALCAKEHV ASAFEPGDHG STFGGNPLAC TAALAALDVI
IEEGLVENSA KMGTYFMSKL SELAEKYSII QEVRGKGLMI GVQLSIDAAV EIKNKCFEKG
YLIGSIGNNI LRMLPPLIVT EQDIDGMIDT LDSVFQEYQI