Gene Ccel_3397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3397 
Symbol 
ID7311959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3939731 
End bp3942013 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content37% 
IMG OID643610301 
ProductS-layer domain protein 
Protein accessionYP_002507665 
Protein GI220930756 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00005416 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACATAA GAAGAAAATT ATCTGTAGTA TTGGCTTTGG TTATTTGCCT ACTATCTTTT 
ACGCAGGCTT TTGCGTTTGG TAATGAGGAT AGTATGACAG TAGACGAAAA AGCAACAGCA
TTGAACAAGA TTTCATTATT AACGGGAACA ACTACAGGTT TCAATCTTGA CGGATATTTG
ACAAGAGCTG AGGCAACCAC TTTTATAGTG AAGCTATTGG GCAAGGCTGA ATATGTTAAC
CAGGAAAAAG ACAGTCTGAG GATATCCCGC TTTCCGGATG TAGTACAGTC TGCATGGTAT
GCTCCATATG TAGGATACTG CAGTGAGAGC GGTATTATTG GTGGATATCC CAACGGGAAT
TTTGGCCCAA ATGACAGATT AAATGAGAAG GCATTTTTGA CAATGGTTTT AAAAGCGATA
GGGTATAGTG AATTTACAGG AAGTGAAGTA TATACAAAGG CATATGAAGC AGGACTTGTG
ACAGACAACT CATATCTTGA TAAGACAGAT GTAAACAGCG AGTATAAGCG TGGTGACGTG
GTAACACTGC TTTATCACAC ACTCAGCATA AGCCCCAATG GTTCTCAGGT ATCAGTTCTG
AAAACACTTG TAAGCAGTGG TGCGGTCAAA ATTAATTCAG CGATATCAGC AGGCTTACTT
TCTGAAGCCG ATAAATTGGA AATAACAGAA GTAATTCCAC AGAGTGCCAT GACACTGATT
GTAAAATTTA ATAAGGAAAT TCAGAAGCTG ACGGCAGATG ATGTCCAAAT AAATACCTTC
GGTAATGACA GTGAAGTACT CAATGCCTCC ATTGTTAAAC AGGAAGGAAA TACACTAACA
GTAAAAACTG CTTTACAGGT TCCGCAAAAG AATTATTATT TTAAAATAAA TAATGTATCG
GATCTGGATG ATAATATTAA GCTCCCGTAT GTTGAGGCGG CATTTCCGGG ATATATACAG
AAGGAGATAC TTTCTGATTT CTTTAAAATA AGCAGTATAA AGCCTGTAAG CAAGAATGCA
ATTAATATCT ATTTTACACA GCCGGTTAAT ATTAATGCCG AAGATCCAAC CTTTTACTCA
ATTTTTGATG GCGATAATGA GATAGTAAAA GGTAGTGCAG GTACTATAGC GGTAAAGGCT
CTAGATTCTC AAAAAAATGC GGTTTCATTG ACACTAAAAG GGATAAGCCT TGCACCAGAC
AAAGAATACA GGATAAAGGT TTCAGGTGAT CTGAACAGTG AATACGGTGT TCGTTTGGGA
GAAGCAGAGG GTGACTCTGG GACATTTATA AGTATAAATG AAGATAATGT GGAATTTGCT
GTAGATAAAA TATATGCATT AAATAGCAAG ACTATAAGGA TTGATTTTAA TAAGCAGGTT
AATCCTACTC TTGCACAGCA GATATACAAT TACAGCCTTA CTTCACAAAG TGATTATCAG
ATTCAGATTA CCAAGGCTGT AGTTGCACCT GACGTGTCCA GCAACGGGAA AAGTATACTT
TTGACCATAA ATGGCGGATT GGATTCCTCC CAGGAATATA AGCTTATGAT AAATAACCTG
AATGATATAT CAAGACAGCA AAGCATTACC GAAAGACAAT ACTCATTTTC CGGGAAATAC
ACTGACAGCG GTGTTTTGAG TGTTTCTGAT ATAAAGGTTA TCGATACCGG TACACTGGAT
ATTTATTTTA ACCGCGAGTT GGATAGTGAA ACAGCTGCGG TAAGCAATAA CTACACATTT
ATAGGTATTA CAAATGCAAT GTATTCATCA ATACCTGTCA AGGCATATTT TGACCCTCAG
CAGCCAAAAA AAGTACGCCT ATTCCTTGGA AGTGACAACC AGTTTAAATA CAAGGATAAT
TACAGGCTTG TTGTGCAGGC TGCTTTGAAG GACTACATGG GTAATCCTGT TGGGACATTG
TTATACGCAT CTTTTCCGTG CAATACAGAT ATAAAAGCAG TCAAACCCAA TATCAGTAAT
GCGGTTATAA TTGCAAAGGA TGCCGTTAAA TTGACCTTTT CAAGAGAGCT ATCTCTAGAA
ACTCCCAACA TACTGAATTC AAACTATGTG TTAGAGTATA GTGATGGTGT AAATACCATT
AAAAAGATAC CAATTTCTGT AAACTATATT AATGCTACTA CAATGATACT CAAGTTTGAT
AAACTAGACT TTAATAACAG ATATACAATT CGGTTTACAT CATTAAAGGA TATCTCAGGA
ATGTATAAGT CATCAGGGTC TGAATTTAAT CCTGTTCAGG TAATAATGGG TTCGGACAAA
TAG
 
Protein sequence
MYIRRKLSVV LALVICLLSF TQAFAFGNED SMTVDEKATA LNKISLLTGT TTGFNLDGYL 
TRAEATTFIV KLLGKAEYVN QEKDSLRISR FPDVVQSAWY APYVGYCSES GIIGGYPNGN
FGPNDRLNEK AFLTMVLKAI GYSEFTGSEV YTKAYEAGLV TDNSYLDKTD VNSEYKRGDV
VTLLYHTLSI SPNGSQVSVL KTLVSSGAVK INSAISAGLL SEADKLEITE VIPQSAMTLI
VKFNKEIQKL TADDVQINTF GNDSEVLNAS IVKQEGNTLT VKTALQVPQK NYYFKINNVS
DLDDNIKLPY VEAAFPGYIQ KEILSDFFKI SSIKPVSKNA INIYFTQPVN INAEDPTFYS
IFDGDNEIVK GSAGTIAVKA LDSQKNAVSL TLKGISLAPD KEYRIKVSGD LNSEYGVRLG
EAEGDSGTFI SINEDNVEFA VDKIYALNSK TIRIDFNKQV NPTLAQQIYN YSLTSQSDYQ
IQITKAVVAP DVSSNGKSIL LTINGGLDSS QEYKLMINNL NDISRQQSIT ERQYSFSGKY
TDSGVLSVSD IKVIDTGTLD IYFNRELDSE TAAVSNNYTF IGITNAMYSS IPVKAYFDPQ
QPKKVRLFLG SDNQFKYKDN YRLVVQAALK DYMGNPVGTL LYASFPCNTD IKAVKPNISN
AVIIAKDAVK LTFSRELSLE TPNILNSNYV LEYSDGVNTI KKIPISVNYI NATTMILKFD
KLDFNNRYTI RFTSLKDISG MYKSSGSEFN PVQVIMGSDK