Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2589 |
Symbol | |
ID | 4809011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3060203 |
End bp | 3061561 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640108003 |
Product | hypothetical protein |
Protein accession | YP_001038982 |
Protein GI | 125975072 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTTA ATTTGGGCAT CATAAAAAAG GCAAAATTTC AAAAAGTTTT ACTTTTATAT TTGCTCTTGA TTTCAGTGGT TCGTCTTGTA ATTTTTTTGG TATCGGAGCC CATTATCTGT CCTGATACAC AGTCTTACGT GGACCTGGCA GAGATGATTT CAAAACTTGA TCTTTCCGAT TTTGACGGAC TTAGAACTCC AGGCTATCCT TTTATTATAT TGTTATCCGG ACTAAATCTG AAAGTTGTAG TTCTTGTTCA GATGATAATG GGAGTCTTTA TTTCATATCT TATAGCCACA ACTGTATATA AATTTACAAA CAATACAGTA TTATCCTTGG TCGCTTCATG CCTTTATTCG CTGTTTATCC CGTTCTTGTT TTATGAAATG GCCATATTAA GCGAAACTAC GGCGGCTTTT TTCATACTCT TAAGTTTTGT CTTGTTTGCC AACATTGATA ACAACCAAAG TAGAATTTTG CCAAAAATGG TGGGTATTAT AATTATTTCA TGTTTTGCCG CGCTAACAAG GCCTATTTAC TTTGTACTGC CGTTGTTGTA TACCGTATTT TTCATGTTTA TTCTTGCAAG AGGAAAATAT AAATTTCCAA AAATATTATT ATATTCTTTT GTAAGTCTGG TACCCTTGTT TGTGTCTATA ATCGGATGGT CATTTGTAAA TTGGAAGTAC AATGACACAT TCAGCATAGC CACGGGCCGC GGGTTCGCTT TCATGGAAAT GGCCGGCGAC TACATTGAAC TTGCTCCTGA CGACTACCCA TACAATGTAA TCAAAGAGGT ATATATAAGA GAAAGGGATA AAAACATCCA AGACGGAAAT CCCCACATCG ACACAATATG GGGAATAACG GATGAGCTTA TGGAAAAAAC CGGGCTCGGT TACGATGAAC TGGCAAAAAA AGTCAAAGAC ATGTGTATAA AGGTTATGTT AAAAAAGCCG GAAATATATA TTAAAGCCGT AATAAGATCC GAAATCAATT TCTGGAAGAC TTTTGGAATC CTTATCCGCA GGGAAACAAA TGTGTCCAGG CCGATCAAGT TTGTAAACAT TATCCAAAGA AGTATTTTGA TTTTATTGCA AATCCCCTTC ATAACAGCAC CATTTGTATT CTTCATAAAT CAGCGAAAAA GAGAAAACAG GCCGGACAGC CATAAAACAC TGCTTATTTT CGCTTTTGCA TATATTTTGA TTGTCGGCGT GTCTTTCCTG ATAGCTCTTG TGGAAGGCGG CGAAGGCAGA TTTGCAATGC CGACATTTCC GCTTCTTATC ATAAGCACTT TCGCACTTTA CAACTTGATT TTACAAGAAA AAAGATTGCA TCATGATATA TCATCATGA
|
Protein sequence | MKLNLGIIKK AKFQKVLLLY LLLISVVRLV IFLVSEPIIC PDTQSYVDLA EMISKLDLSD FDGLRTPGYP FIILLSGLNL KVVVLVQMIM GVFISYLIAT TVYKFTNNTV LSLVASCLYS LFIPFLFYEM AILSETTAAF FILLSFVLFA NIDNNQSRIL PKMVGIIIIS CFAALTRPIY FVLPLLYTVF FMFILARGKY KFPKILLYSF VSLVPLFVSI IGWSFVNWKY NDTFSIATGR GFAFMEMAGD YIELAPDDYP YNVIKEVYIR ERDKNIQDGN PHIDTIWGIT DELMEKTGLG YDELAKKVKD MCIKVMLKKP EIYIKAVIRS EINFWKTFGI LIRRETNVSR PIKFVNIIQR SILILLQIPF ITAPFVFFIN QRKRENRPDS HKTLLIFAFA YILIVGVSFL IALVEGGEGR FAMPTFPLLI ISTFALYNLI LQEKRLHHDI SS
|
| |