Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2933 |
Symbol | |
ID | 7311547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 3491121 |
End bp | 3492497 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643609833 |
Product | O-antigen polymerase |
Protein accession | YP_002507207 |
Protein GI | 220930298 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000205816 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAA AAACTGCTGT AATATTATCA ATATTGGGAT TGCTGGCAGG GGTATGCGGA GCTTTTGCAC CGACCTTTCT GCTGATTGCA TTTATTGCAG GCGTAATATT TACTGCCGCA ATGATGTTTG ATTATTCAAA ATTCCTATAT GTTCTAGGGT TTTATGTTTT AATTGACTAC GTATTGAGGT ATGTCATTTC AAGTGCTTTG CTTGCAAGCT TCTGGGATGA ACTACTGTTT ATATTTGCAC TGTGCTTGTG GCTCTACAAA TGGGTAGTTT ACCGTAAACA GGATGGCTAT GTTGCAACAC CTTTGGATAT ACCTCTTGTA TTTTTTGTGG TAATCTCCAT ATGTTTGCTT CTTGTTAATT CACCTGTTAT GGCCATCGGA ATTGCCGGAT TCAGACAGGT TATACAGCAG ATGCTATGGT ATTTTATTGT TGCACAGCTT GTAACCTCCA GCAGAAACAT CAGATGGTTT TTGTACATTA TGGTTTTTAT CGGAGGACTT CTTGGGCTTC ATGGTATTTA CCAATATGTT ACCCATGCAG AGATGCCGTC TTATTGGGTT GACAGGCTTG AATCGGGTAT AACAACAAGG GTTTTCTCCA TTATTGGAAG CCCTAATATT CTGGGAAGCC TTATGGTACT GCTGATACCC GTTGCGATTT CATTTGTTTT CAGTGAAAGA AAGATTTTCA AGAAAATCAT ATTTACCGGA ATAACCCTTG CAATGTCTGC AACCCTTATA TTCACATCCT CAAGGTCCGC TTGGATAGGA TTTGTAGTAG CAATGGGCGT GTATTTCTGG CTTAAAGACA AGAGGCTTAT TTTATTGCTG GCTCTGCTGG TTGTTGCCGC ATATTTTGCA GTACCTACCA TTGCACACAG AGTAAACTAT CTGTTAAGTC CTCAATATAT GATAAGCAGT GCAGCTGGTG GAAGAATTGC AAGATGGTCC ATAGGTATTG CGGCACTTCA GCAGCATCCA TGGTTCGGCC TCGGACTTGG ACAATTCGGC GGTGCAGTTG CACAGAACTT TAAGATACCG GGTGCATTCT ATGTCGACAG CTATTTTTTA AAGATTGCTG TTGAAATGGG TATTGTTGGC TTAACATCCT TCTGTATTCT TATATATAAC GCTCTGGCAT GGGGAATACG TGCGGTTAAG AGAACAGCAG ACCGACAAAG CCTCAGCATG GCACAGGGTG TTTTTGCGGG AATGGTTGGC ATTGTGGTTC CGAACTTTGT AGAAAATGTA TTTGAGGTTC CAATGATGAC AGCTTATTTC TGGATGTTTG CAGCCGTCCT GATAGCCCTC GGCTTCACCC TTCCTAATAA GGGGTTAAAC CGTCTTAACG TAGGAAGTAT AAGATAG
|
Protein sequence | MNKKTAVILS ILGLLAGVCG AFAPTFLLIA FIAGVIFTAA MMFDYSKFLY VLGFYVLIDY VLRYVISSAL LASFWDELLF IFALCLWLYK WVVYRKQDGY VATPLDIPLV FFVVISICLL LVNSPVMAIG IAGFRQVIQQ MLWYFIVAQL VTSSRNIRWF LYIMVFIGGL LGLHGIYQYV THAEMPSYWV DRLESGITTR VFSIIGSPNI LGSLMVLLIP VAISFVFSER KIFKKIIFTG ITLAMSATLI FTSSRSAWIG FVVAMGVYFW LKDKRLILLL ALLVVAAYFA VPTIAHRVNY LLSPQYMISS AAGGRIARWS IGIAALQQHP WFGLGLGQFG GAVAQNFKIP GAFYVDSYFL KIAVEMGIVG LTSFCILIYN ALAWGIRAVK RTADRQSLSM AQGVFAGMVG IVVPNFVENV FEVPMMTAYF WMFAAVLIAL GFTLPNKGLN RLNVGSIR
|
| |