Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2922 |
Symbol | |
ID | 7311537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 3480348 |
End bp | 3482225 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643609822 |
Product | hypothetical protein |
Protein accession | YP_002507196 |
Protein GI | 220930287 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00129272 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA AAACTAAGCC GTCACCTAAA CTTAATAAAA GAATTGATAA TCTGGTCAGA GTTGTTTTAG TTCTGTCAAT TGCCATAATA GTCTTTTTCT TCGCAATCCG GCCTATGATT TTTAAAAAAG ACTATTACTT TGACTCCGGT GATGATGTTA CCCTTGAAAA TTTAGGAACT GATAAAGGTA CAGATATTTA TGATATAGCT GTAGTTGGTG ATGGTATTGA CGGAATAAGT GCCGCGCTGG GAGCTGCTGG TGTGGGTGCC AAAACTATTC TTATATGTTC CGATAAGGAG CTCGGAAGTC AGATTGGAGA GTCTTTTAAC ACTAGCTGGT CACCTGATGT TACTCCAACA GGTAATAACG TAAGCTCTGA CATATTCAAA GAAATCCGAT ACAAATCCGG TGAAGGTATA AATATAGGTA ATTATTTAAA AACAATCAAG GAAATGGTTT CTGAGAAAAA GAACCTTGAC GTAATGTATG AATCCCAAAT AACTGACATG GGAGTTCAGA GTGGTCAGGT GCAAAATATA CAGTTTGACA CTCCTAAAGG TAAAAGGACT GTAAAAGCAA AACAGTACAT AGACGCTACA AACGATGGAG AAATTCTTAA ACGTGCCAAT GTACCGTATA GCGTCGGATA TAGCGATATT GGCATAAAAC AGCTGTTTCC CCCTATTTAC CTGAATTTTA TGGTATCGGG AGTCGATTAT AAAGAGATTG AAAAATTAAT GAAGGAGCAG AAAATGCTTG TTAACAGCAT TCTGAAACAA TATAACACAA GTAATTCAAA TGTAAGTGTT TCAGGATTTA ATATTTCAGA TCAGGGTAAC AGTAAGGTAC TGATTGAAGG AATTACCGTC AAAAATGTAG ATTTACAAAA TGAGAAGAAG ATTCAGGAAT ATTACAATAT TGCTTCAAAA GAATGTATGG ACTTGTTCCA ATTTCTAAAG CTAAACCTGG AACCATTTAA AGATGCTGGA GGATTCAGCG TAGCGGCACA ATTTGTTAAG CCTTCAGCCT ACCACTTCAA GGGACTTTAT AACCTAACAC TTGGGGATAT ACTTACAGGT AAGAGATTTA GTGACAGAAT CAGCACGGCA TCAAGACCTG TAACAATGAC AATGGAGGAC GGAAACGGAT ATATCCTCTG TAATCCCAAG ATATTTTACA TACCTCTTCG GTCCATAATT CCTCAAGGTC TAACTAATGT ACTTATGACA GGTGACAAAA TATCCTGTTC ATCTCTGGTA CAGAGTGCCA TAAATTCAAA TTCCAGTAAG TCCGGTTCTG GATATGCCGC AGGGATAATC TCCGCCTACA GCATTTCAAA GAGTATAGAT ATTCCCCATA TAGTAGAGGA TTACAATCTG GACACGCAGG CTGAATTGGA GAAGGTATTG CGGAAAAAGG GCATATTCAT GTCAGATATT ACAGAGGACT TGACAAGTAT AACGGAAAAC TGGAGTTATC CTTACGCAGA AAAGCTTATA AACATAGGGT TGCTTAGTGC AGGTATCACA AATGACATGA AATTCAATAA GAAAGCAAAG AGTCAGGATT TTGCCTATGT AATACTAAAT GGTGTGGTCA GGACTTCACC CGATAAATAC AATTATGATT TTGATACAAT CCTTAGAGCA TATCTAAAGG ATGAGCCGTT AACCAAGGAT AAGCTGGCAC AAATACTTTT GGATGTAGCG GGGAAGAAGA CTTCAGGCGA AAATTATTAT GATGATGCCC GTAAACAAGG GCTTGTTGAT GAAACTCTTG AACAGAAGCT CAAGAATAAA TCACATGTGG AGTACTCTGA TATGTATTAT GCAGCTGTTA AGGCTATTGA AAAGATAACA GGAAAGCCAA TGAATTAG
|
Protein sequence | MKKKTKPSPK LNKRIDNLVR VVLVLSIAII VFFFAIRPMI FKKDYYFDSG DDVTLENLGT DKGTDIYDIA VVGDGIDGIS AALGAAGVGA KTILICSDKE LGSQIGESFN TSWSPDVTPT GNNVSSDIFK EIRYKSGEGI NIGNYLKTIK EMVSEKKNLD VMYESQITDM GVQSGQVQNI QFDTPKGKRT VKAKQYIDAT NDGEILKRAN VPYSVGYSDI GIKQLFPPIY LNFMVSGVDY KEIEKLMKEQ KMLVNSILKQ YNTSNSNVSV SGFNISDQGN SKVLIEGITV KNVDLQNEKK IQEYYNIASK ECMDLFQFLK LNLEPFKDAG GFSVAAQFVK PSAYHFKGLY NLTLGDILTG KRFSDRISTA SRPVTMTMED GNGYILCNPK IFYIPLRSII PQGLTNVLMT GDKISCSSLV QSAINSNSSK SGSGYAAGII SAYSISKSID IPHIVEDYNL DTQAELEKVL RKKGIFMSDI TEDLTSITEN WSYPYAEKLI NIGLLSAGIT NDMKFNKKAK SQDFAYVILN GVVRTSPDKY NYDFDTILRA YLKDEPLTKD KLAQILLDVA GKKTSGENYY DDARKQGLVD ETLEQKLKNK SHVEYSDMYY AAVKAIEKIT GKPMN
|
| |