Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2930 |
Symbol | |
ID | 7311544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 3488779 |
End bp | 3490026 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643609830 |
Product | TPR/glycosyl transferase domain-containing protein |
Protein accession | YP_002507204 |
Protein GI | 220930295 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000283955 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGG TACTTATGAT CGCACACCAG TTTCCTCCTA TTGGGGGCTC AGGTGTTCAA AGAACTGTAA AATTTGTTAA ATATTTGAGA AATTTCGACT ATGAACCGAT TATTCTGACC AGGGATGCAT CAAATGCCGC ATTAAAGGAC GAAACACTTT TATCTGACAT TCCCAAAGGA ATAAAGGTTG TCAGAACAAA TGCTTGTGAT TTTGCTGCTC TTCCGGGAAT ATTCAAATAC TTTGGGAAGG TCGTCAACAA GCTTTTGATA CCAGATTCCG AGAGAGTATG GCAGCATTTT GCCAGAAAAC AGGCGTTGGA TGCAGTAAAG GATAACAAAA TAGATGTTAT ATATACCACT TCTGCCCCCT ACAGCGATCA TCTTCTTGGT GTATACCTGA AAAAGCACTA CCCTGAAATT CCTCTGGTTT GCGATTTCAG AGATGAATGG ACCAATAACC CCTATCATGT CAGGAAAGGG TTAAGGGCAA AAATTGAACG AGATCAGGAA AAAATGGTCC TCAAGTATGC TGACTGTCTT ATTACCAATA CTCCGGTAAT GCTTTCAAAT TTTCTAAGGG ACAACCCTGA AACCAAAGGT AAATTTTATG TTATACCAAA CGGCTATGAC GATGAAGATT TTGTTGGTAT GGAAGATATT AAACCTGCAA ACGTCAGGTT TACACTGACC TATACCGGGC TTTTATATGG TAAGAGAAAG CCTGACAATT TCTTTGAAGC TTTAAAAAGA GCTATTGATG AAGGAAGTGT AGATAAGTCC AAAATAAATG TAAGGCTCAT AGGAAATTAT AAGGTTGATC AGCTTCAAGC AGTTATAGAC AGCTATAATT TAAGCGATGT TGTTGCTCTT ATGCCATACA TGAAGCACAG GGAATGTCTA TTGGAATTGG TAAAATCTGA TGCACTTCTT CTTTTAGAAC CGTCAGGCCC CGGTGCCGAA GCTTTTTATA CCGGAAAGGT TTTTGAATAT ATGAATACCA AGCGGCCTAT ACTTGCATCA ATTCCTGAGC GAGGTGCCGC AGCACAGCTT ATAACAGATA CAAAAACAGG TCTGGTTTCA GACTTTAACG ACATTGAAAA TACTAAAAAG AACCTTATTC ACCTTTATAA TTGTTGGGAT AACGGCACGA ACCCAATAAA TCCGGTAATT GAAGAGGTAA AGAAGTTTGA AAGAAAAGAG CTTACAAAGG CATTGGTAGA AGTGTTAAAT AATTCATTTA AAAAGTAA
|
Protein sequence | MKKVLMIAHQ FPPIGGSGVQ RTVKFVKYLR NFDYEPIILT RDASNAALKD ETLLSDIPKG IKVVRTNACD FAALPGIFKY FGKVVNKLLI PDSERVWQHF ARKQALDAVK DNKIDVIYTT SAPYSDHLLG VYLKKHYPEI PLVCDFRDEW TNNPYHVRKG LRAKIERDQE KMVLKYADCL ITNTPVMLSN FLRDNPETKG KFYVIPNGYD DEDFVGMEDI KPANVRFTLT YTGLLYGKRK PDNFFEALKR AIDEGSVDKS KINVRLIGNY KVDQLQAVID SYNLSDVVAL MPYMKHRECL LELVKSDALL LLEPSGPGAE AFYTGKVFEY MNTKRPILAS IPERGAAAQL ITDTKTGLVS DFNDIENTKK NLIHLYNCWD NGTNPINPVI EEVKKFERKE LTKALVEVLN NSFKK
|
| |