Gene Ccel_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1971 
Symbol 
ID7310685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2331249 
End bp2333027 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content36% 
IMG OID643608905 
ProductGerA spore germination protein 
Protein accessionYP_002506299 
Protein GI220929390 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA AGAAAGGTTT TATAAAGACC CTGTTTTCTT ATATAACATA TAAAGAAAAA 
AAACCTGTAA AGCAGTTCTA TATTCCAGAA ATAGATAATG AAGTTACTTC TGAGAATAAA
AACGGTCAAG AAAAAAATAC TTCGATGAAA CGGGATGGCT CAAAAAACAG AAAAATTAAA
AGGCCTGTTC CTGTAGCAGA ATCAAGCAGG GAAAACAAGC CTCAATATGA AAAGGCTGAC
GACGAAAAAA TATCAACAAA CATTGAAGAG AACATACAAT ATATCAAGCA TAAGTTTAAT
TTCCCAAGTA ATAAGGATAT CATTATCCGG GAGTTGACCG TTGCAAAAAA ATACAAGGCA
TTTATTGCAT ACATCGACGG TATGGTGGAC AGGATAACCA TTAATAATTT CATTTTAAGA
GCATTAATGG TTAATGATGA TAAATTCCAA GAAGATTCTG ATGATGAATG CAAACTTGAC
TTTATAATGT CCAATATATT GCAGACGAAC CAGGCCAAAA AGGTGGACAG TCCCGATGAA
TTTTTGTACG AAATATTATC GGGTAATACC CTGTTATACG TTAATGGTTG TAACTTTTAT
ATAACCAACG AAACAAAAGG TTATGACAAA AGGGGAGTAG ATAAGCCCCT TATAGAAGGC
GTGGTTCTTG GATCACAGGA AGCATTCAAT GAAAATCTAA GAACAAATGT AACACTTATT
AGAAAATTAA TAAAGAATAA TAACCTTACT ACTGAATTTA TCAAGGTTGG GAATGTAAAC
AAGCAATTAT GTGCAATTAT GTCTATAAAA GGAATAACCA ACCCTGCAAT AGTAGAAGAA
GTTAAAAGAA GAATAAAAAA CATAAAAAGT GATATGGTAC TTGGTGACGG AGTTCTGGAA
CAATTTATAG AGGACAATCC ATATTCGATT TTCCCAACAA TATTGAGTAC CGAAAGACCT
GACCGAGCCG CAGCACATAT CATGGAAGGA AAGGTTGCAA TACTTGCTGA AGGAGCACCC
TTTGCTAAAA TTGTTCCTGT AACTCTTCTT ACTATTATGC ATAGCCCCGA AGATTCATAT
ATGAGATGGC CATATGGTAC TTTAATAAGG TTAATCAGGT TTGTAGCAGC ATTTATTGCT
ACATTACTAC CAGGCATCTA TGTTGCCATT ACCAATTTCC ATCAGGAAAT GATACCTACA
GAGCTTTTAA TTGCGATAGC AAAGGCTAAA GAAAATGTAC CCTTCCCGAC AATAGTGGAA
GTAGTATTAA TGGAATTGTC CTTCGAACTC ATAAGAGAGG CCGGAATAAG AATTCCGGGT
ATAATAGGTA ACACACTGGG TATTATCGGT GCATTGATTC TCGGACAGGC GGCCGTTCAG
GCCAACATTG TAAGTCCCGT TTTGATAATT GTTGTTTCAG TAACGGGACT TGGCAACTTT
GCAATACCCA ATTACAGCAT GGCATTAGCA GCAAGAGTAT CTCGCTTTTG CTTTATTATT
TTGGGAGCAT TACTTGGATT TTATGGAATA AGCATCGGAA TTGCTTTGTT TGCTATACTG
ATTACTAATA TAAAGTCATT TGGAGTACCG TTTTTTGCCC CTATTGCCCC AAAAACTAAA
GAAAGTAATG ACTTGTTCTT TAAAAAACCG GCATGGCAGC AGATATACAG GCCTGATTAT
GTAAATGCTT TAAAACAGAA AAGACAAGCC AAGGTATCTA GACAATGGAC GGATGAAGAG
CCAAAATACG GTTATGAAAG GGATGAAGAG GATGATTAA
 
Protein sequence
MSKKKGFIKT LFSYITYKEK KPVKQFYIPE IDNEVTSENK NGQEKNTSMK RDGSKNRKIK 
RPVPVAESSR ENKPQYEKAD DEKISTNIEE NIQYIKHKFN FPSNKDIIIR ELTVAKKYKA
FIAYIDGMVD RITINNFILR ALMVNDDKFQ EDSDDECKLD FIMSNILQTN QAKKVDSPDE
FLYEILSGNT LLYVNGCNFY ITNETKGYDK RGVDKPLIEG VVLGSQEAFN ENLRTNVTLI
RKLIKNNNLT TEFIKVGNVN KQLCAIMSIK GITNPAIVEE VKRRIKNIKS DMVLGDGVLE
QFIEDNPYSI FPTILSTERP DRAAAHIMEG KVAILAEGAP FAKIVPVTLL TIMHSPEDSY
MRWPYGTLIR LIRFVAAFIA TLLPGIYVAI TNFHQEMIPT ELLIAIAKAK ENVPFPTIVE
VVLMELSFEL IREAGIRIPG IIGNTLGIIG ALILGQAAVQ ANIVSPVLII VVSVTGLGNF
AIPNYSMALA ARVSRFCFII LGALLGFYGI SIGIALFAIL ITNIKSFGVP FFAPIAPKTK
ESNDLFFKKP AWQQIYRPDY VNALKQKRQA KVSRQWTDEE PKYGYERDEE DD