Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_3397 |
Symbol | |
ID | 7311959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 3939731 |
End bp | 3942013 |
Gene Length | 2283 bp |
Protein Length | 760 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643610301 |
Product | S-layer domain protein |
Protein accession | YP_002507665 |
Protein GI | 220930756 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00005416 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACATAA GAAGAAAATT ATCTGTAGTA TTGGCTTTGG TTATTTGCCT ACTATCTTTT ACGCAGGCTT TTGCGTTTGG TAATGAGGAT AGTATGACAG TAGACGAAAA AGCAACAGCA TTGAACAAGA TTTCATTATT AACGGGAACA ACTACAGGTT TCAATCTTGA CGGATATTTG ACAAGAGCTG AGGCAACCAC TTTTATAGTG AAGCTATTGG GCAAGGCTGA ATATGTTAAC CAGGAAAAAG ACAGTCTGAG GATATCCCGC TTTCCGGATG TAGTACAGTC TGCATGGTAT GCTCCATATG TAGGATACTG CAGTGAGAGC GGTATTATTG GTGGATATCC CAACGGGAAT TTTGGCCCAA ATGACAGATT AAATGAGAAG GCATTTTTGA CAATGGTTTT AAAAGCGATA GGGTATAGTG AATTTACAGG AAGTGAAGTA TATACAAAGG CATATGAAGC AGGACTTGTG ACAGACAACT CATATCTTGA TAAGACAGAT GTAAACAGCG AGTATAAGCG TGGTGACGTG GTAACACTGC TTTATCACAC ACTCAGCATA AGCCCCAATG GTTCTCAGGT ATCAGTTCTG AAAACACTTG TAAGCAGTGG TGCGGTCAAA ATTAATTCAG CGATATCAGC AGGCTTACTT TCTGAAGCCG ATAAATTGGA AATAACAGAA GTAATTCCAC AGAGTGCCAT GACACTGATT GTAAAATTTA ATAAGGAAAT TCAGAAGCTG ACGGCAGATG ATGTCCAAAT AAATACCTTC GGTAATGACA GTGAAGTACT CAATGCCTCC ATTGTTAAAC AGGAAGGAAA TACACTAACA GTAAAAACTG CTTTACAGGT TCCGCAAAAG AATTATTATT TTAAAATAAA TAATGTATCG GATCTGGATG ATAATATTAA GCTCCCGTAT GTTGAGGCGG CATTTCCGGG ATATATACAG AAGGAGATAC TTTCTGATTT CTTTAAAATA AGCAGTATAA AGCCTGTAAG CAAGAATGCA ATTAATATCT ATTTTACACA GCCGGTTAAT ATTAATGCCG AAGATCCAAC CTTTTACTCA ATTTTTGATG GCGATAATGA GATAGTAAAA GGTAGTGCAG GTACTATAGC GGTAAAGGCT CTAGATTCTC AAAAAAATGC GGTTTCATTG ACACTAAAAG GGATAAGCCT TGCACCAGAC AAAGAATACA GGATAAAGGT TTCAGGTGAT CTGAACAGTG AATACGGTGT TCGTTTGGGA GAAGCAGAGG GTGACTCTGG GACATTTATA AGTATAAATG AAGATAATGT GGAATTTGCT GTAGATAAAA TATATGCATT AAATAGCAAG ACTATAAGGA TTGATTTTAA TAAGCAGGTT AATCCTACTC TTGCACAGCA GATATACAAT TACAGCCTTA CTTCACAAAG TGATTATCAG ATTCAGATTA CCAAGGCTGT AGTTGCACCT GACGTGTCCA GCAACGGGAA AAGTATACTT TTGACCATAA ATGGCGGATT GGATTCCTCC CAGGAATATA AGCTTATGAT AAATAACCTG AATGATATAT CAAGACAGCA AAGCATTACC GAAAGACAAT ACTCATTTTC CGGGAAATAC ACTGACAGCG GTGTTTTGAG TGTTTCTGAT ATAAAGGTTA TCGATACCGG TACACTGGAT ATTTATTTTA ACCGCGAGTT GGATAGTGAA ACAGCTGCGG TAAGCAATAA CTACACATTT ATAGGTATTA CAAATGCAAT GTATTCATCA ATACCTGTCA AGGCATATTT TGACCCTCAG CAGCCAAAAA AAGTACGCCT ATTCCTTGGA AGTGACAACC AGTTTAAATA CAAGGATAAT TACAGGCTTG TTGTGCAGGC TGCTTTGAAG GACTACATGG GTAATCCTGT TGGGACATTG TTATACGCAT CTTTTCCGTG CAATACAGAT ATAAAAGCAG TCAAACCCAA TATCAGTAAT GCGGTTATAA TTGCAAAGGA TGCCGTTAAA TTGACCTTTT CAAGAGAGCT ATCTCTAGAA ACTCCCAACA TACTGAATTC AAACTATGTG TTAGAGTATA GTGATGGTGT AAATACCATT AAAAAGATAC CAATTTCTGT AAACTATATT AATGCTACTA CAATGATACT CAAGTTTGAT AAACTAGACT TTAATAACAG ATATACAATT CGGTTTACAT CATTAAAGGA TATCTCAGGA ATGTATAAGT CATCAGGGTC TGAATTTAAT CCTGTTCAGG TAATAATGGG TTCGGACAAA TAG
|
Protein sequence | MYIRRKLSVV LALVICLLSF TQAFAFGNED SMTVDEKATA LNKISLLTGT TTGFNLDGYL TRAEATTFIV KLLGKAEYVN QEKDSLRISR FPDVVQSAWY APYVGYCSES GIIGGYPNGN FGPNDRLNEK AFLTMVLKAI GYSEFTGSEV YTKAYEAGLV TDNSYLDKTD VNSEYKRGDV VTLLYHTLSI SPNGSQVSVL KTLVSSGAVK INSAISAGLL SEADKLEITE VIPQSAMTLI VKFNKEIQKL TADDVQINTF GNDSEVLNAS IVKQEGNTLT VKTALQVPQK NYYFKINNVS DLDDNIKLPY VEAAFPGYIQ KEILSDFFKI SSIKPVSKNA INIYFTQPVN INAEDPTFYS IFDGDNEIVK GSAGTIAVKA LDSQKNAVSL TLKGISLAPD KEYRIKVSGD LNSEYGVRLG EAEGDSGTFI SINEDNVEFA VDKIYALNSK TIRIDFNKQV NPTLAQQIYN YSLTSQSDYQ IQITKAVVAP DVSSNGKSIL LTINGGLDSS QEYKLMINNL NDISRQQSIT ERQYSFSGKY TDSGVLSVSD IKVIDTGTLD IYFNRELDSE TAAVSNNYTF IGITNAMYSS IPVKAYFDPQ QPKKVRLFLG SDNQFKYKDN YRLVVQAALK DYMGNPVGTL LYASFPCNTD IKAVKPNISN AVIIAKDAVK LTFSRELSLE TPNILNSNYV LEYSDGVNTI KKIPISVNYI NATTMILKFD KLDFNNRYTI RFTSLKDISG MYKSSGSEFN PVQVIMGSDK
|
| |