Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0720 |
Symbol | |
ID | 4810338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 874656 |
End bp | 875840 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106137 |
Product | aminotransferase, class V |
Protein accession | YP_001037148 |
Protein GI | 125973238 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00090765 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGACA ACAGGTTTGT TTATCTTGAT CACGCTGCAA CAACCCCGGT TAAACCAGAG GTACTTGAAG CTATGCTTCC GTACTTTTCA AATAAGTTTG GAAACGCTTC CTCAATTTAC TCTATAGGAA GGGAAAGCAA AAAGGCAATT GAGGAGGCAA GGGAAAAGGT TGCCAAGGCA ATTGGCGCGC TTCCCAGGGA AGTTTTCTTC ACCGGTTCCG GCACCGAAGC AGATAATTGG GCTATCAAAG GTGTTGCATA TGCCAACAGG GACAAAGGCC GGCATATAAT AACCACTGCA ATAGAACATC ATGCAGTACT CCACGCATGT CAGTACCTGG AAAGTGATGG CTTTGAGGTA ACCTACCTGC CTGTTGATGA AAACGGTTTG GTGTCGCCTC AGCAGGTACA AGATGCAATA AGACCTGACA CAATACTCAT TACAATTATG TTTGCCAACA ATGAAATCGG CACCATTCAG CCCATAGCGG AAATAGGTAA AATAGCAAGG GAAAGAGGCG TAATATTTCA TACCGACGCA GTCCAGGCAG TTGGAAATAT ACCCATAAAT GTTGTTGACT TGAATGTTGA CCTTCTGTCA ATGTCAGGCC ACAAGTTTTA TGGACCGAAA GGCGTAGGTG CTCTGTACAT CAGAAAAGGT GTAAAAATCG TATCGTTTAT GCATGGAGGA GCTCAGGAGA GAGGCCGCAG AGCAAGTACT GAAAATGTTG CAGGTATAGT GGGCATGGGT AAAGCCATCG AACTTGCAGT GGAAAATATG GAAGAAAATA ATAAAAAGCT TATTGAATTG AGAGACAGGA CAATCGAGGA AGTAATGAAA AAGATACCTT TTGTAAGGCT GAACGGAGAC AGATACAAGA GACTTCCGGG CAACGTAAAC TTCTCCTTTG AATTTATCGA AGGAGAATCC CTTCTTCTGA TGCTGGACAT GAAGGGTATT GCCGCATCAA GTGGATCTGC ATGTACTTCA GGTTCTCTTG ATCCGTCTCA CGTACTGCTT GCAATAGGTC TTCCTCATGA AATAGCCCAT GGTTCTTTGA GGCTTACTTT CGGTATTGAA AACACACATG AAGATATAGA TTACCTTATG GAAGTACTGC CAACAATTGT TGACAGGCTA AGAGAAATGT CTCCATTATA TGAAAAAGTG AAAAGAGGGA ATTAA
|
Protein sequence | MSDNRFVYLD HAATTPVKPE VLEAMLPYFS NKFGNASSIY SIGRESKKAI EEAREKVAKA IGALPREVFF TGSGTEADNW AIKGVAYANR DKGRHIITTA IEHHAVLHAC QYLESDGFEV TYLPVDENGL VSPQQVQDAI RPDTILITIM FANNEIGTIQ PIAEIGKIAR ERGVIFHTDA VQAVGNIPIN VVDLNVDLLS MSGHKFYGPK GVGALYIRKG VKIVSFMHGG AQERGRRAST ENVAGIVGMG KAIELAVENM EENNKKLIEL RDRTIEEVMK KIPFVRLNGD RYKRLPGNVN FSFEFIEGES LLLMLDMKGI AASSGSACTS GSLDPSHVLL AIGLPHEIAH GSLRLTFGIE NTHEDIDYLM EVLPTIVDRL REMSPLYEKV KRGN
|
| |