Gene Ccel_1303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1303 
Symbol 
ID7312212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1607558 
End bp1609537 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content34% 
IMG OID643608222 
ProductSpore coat protein CotH 
Protein accessionYP_002505637 
Protein GI220928728 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5337] Spore coat assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.844218 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACAA AAAGGTATAA ATATGGTATA TTCATAATTT TTGCCGTTTG TCTGCTTGTA 
GTGATATTTT TCTTAATGTT TATGGAGGAT AGTAACAGCA TTCCGACACA TGGCGGTTTA
AGTCCCAAGG ATTTATCCAT TAATGATATA AAGTACCCGG GTACATTTTT TGATAATTCG
AAAGTCCATA CCATCGATAT TCAGGTTGAT AACCGTACTT GGGAAAACAT GATAGAAAAT
GCCGCATATG AACAATACAT TCCCTGTACT ATGATTATAG ACGGTGCCAG AATTAATGAA
GTTGGGATAA GGCCAAAAGG AGATACTTCT CTCAAACAGG TAATTGATAT GGATTCTCAG
AACTTTAGCT TCAAGGTTGA GTTTGACCAC TATAGAAATC AGACATTTGA TGGTCTTGAT
AAGATGGTAC TCAACAACTG TTTACAGGAT ACAACTTATA TGAAGGACTA CCTGGCACAG
CATATGATGA ACTACATGGG GATTGCGGCA CCATTAACAA GCTATGTAAA TATAAAATTG
AACGGAAAGG ATTTTGGATT TTATCTGGCC ATAGAGGCGG TTGAAAAATC TTTTTGCCTT
AGGAATTTTG GCAGCATTGA CGGAAAGCTA TATAAGCCGG ATGCACTTGA TTTACCAAAA
TACGATTATA TAAAAATTAT GGGTTATGAG ACAGAAGACG GCCAATCAGC AGTCGAGAAC
TTTATAAACA TTATGTCAGG CAATGCATAT AAAGGTTGTG ACAGAAGTAC AAGGGTCGAC
ATGGTTGGCG ATATGGCAGG TTTGCTTATT GATTCAAACG GAATAAACAC CGATGTAACA
GGTTTGACAT ATATTGATAG TAACCCAAAG AGTTATAAAG CTATTTTGGA GTCCAGTGTG
TTTTCCATAA ACGAAAGTGA TGAAAGCAGG TTGATTAATT CAATAAGAAA GCTTAACAGG
GGTGAAGATC TGGATAACAT AGTAGACACT GATTCTTTAA TAAAGTATTT TGTAGTGCAC
AACTTTGTAA ATAATTATGA TGGATATACA AGTGTTTTTT CACATAACTA TTACCTACAT
GAACGTAATG GTAAATTGTC AATGATACCT TGGGATTATA ACCTGGCTTA CGGTTCCTTT
TCAGTTGAGC CCGGAAACTC ATCTTCCAGT CCTTTCGGCA ATTATATTAA AACAACAGAT
GCCCTGTACG GTATGAGTTC TGCAAAAAGC ATGGTTAATT ATCCTATTAA TACACCTGTA
TTTAATACTG ATTTGGAAAA AAGACCTATG ATAAATCAAA TACTTAAAAA CAGTGAGTAT
TCAGATAAGT ATCATCAGTA TTTTGAAAAA TTTATTACTG ACTATTTTAT TAGCGGTTAC
TTTGACAGAT TTTATGAATC CACTGTTGAT ATGATATCAC CATATGTGAA AAATGATAAA
AAGAACTTTC TTACTTATAG CCAGTTTGTA AACGGAATCA ATGAACTTAA TAAATTCTGC
AAACTGAGAG CAAAAAGTGT ACAGGGACAG TTAACTAACG CTATCCCATC GACATTAAAA
GGACAACAAG AGCATCCCGA AGCTCTTGTT GACACACAGG ATCTGGATAT GACTAAAACA
ATTACAACGT ATTCTATACT GGGAATTACA AATGAAGATA TTGATGGGGT ATTAAAAATA
CTTATAAACT ACATTCCGAA AGATTATAAA ACAGACGGTA AAATAGATAT GTCAAAATTC
AAAGCATCAG ATATAACATA TCTAAAAAAA ATCTTCGGGG TAATGGTGCC CTTAGCTTTT
GAGGTCTCCA AGAGCACCAA GGCATCAGAC AATACTACCG TCAATACAAG ACTATCCCGG
ATATTACTAA TTTTGTCATT AATTGCAATG ATTATATTTA CTATATTGGT AAGCAGGTAT
TCACGTGTAA AGTATAAAAA AAGAAAGGTA AGGAGGGAGA AACTTGAAAT TACGTCATGA
 
Protein sequence
METKRYKYGI FIIFAVCLLV VIFFLMFMED SNSIPTHGGL SPKDLSINDI KYPGTFFDNS 
KVHTIDIQVD NRTWENMIEN AAYEQYIPCT MIIDGARINE VGIRPKGDTS LKQVIDMDSQ
NFSFKVEFDH YRNQTFDGLD KMVLNNCLQD TTYMKDYLAQ HMMNYMGIAA PLTSYVNIKL
NGKDFGFYLA IEAVEKSFCL RNFGSIDGKL YKPDALDLPK YDYIKIMGYE TEDGQSAVEN
FINIMSGNAY KGCDRSTRVD MVGDMAGLLI DSNGINTDVT GLTYIDSNPK SYKAILESSV
FSINESDESR LINSIRKLNR GEDLDNIVDT DSLIKYFVVH NFVNNYDGYT SVFSHNYYLH
ERNGKLSMIP WDYNLAYGSF SVEPGNSSSS PFGNYIKTTD ALYGMSSAKS MVNYPINTPV
FNTDLEKRPM INQILKNSEY SDKYHQYFEK FITDYFISGY FDRFYESTVD MISPYVKNDK
KNFLTYSQFV NGINELNKFC KLRAKSVQGQ LTNAIPSTLK GQQEHPEALV DTQDLDMTKT
ITTYSILGIT NEDIDGVLKI LINYIPKDYK TDGKIDMSKF KASDITYLKK IFGVMVPLAF
EVSKSTKASD NTTVNTRLSR ILLILSLIAM IIFTILVSRY SRVKYKKRKV RREKLEITS