Gene Ccel_1964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1964 
Symbol 
ID7312305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2324270 
End bp2326048 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content37% 
IMG OID643608898 
ProductFibronectin-binding A domain protein 
Protein accessionYP_002506292 
Protein GI220929383 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTTG ATGGAATAGT AACAAAATGT ATAGTCAATG AATTAAATGA TTTGCTTTCG 
GGAGGGAGAA TTGACAAGGT ATTTCAACCG GAAAACGATG AAATTGTCAT GATGATTCGT
TCAAAAGGTC AGAATTACAG ACTTGTTGCA AGTGCTAATG CAAGTAATCC AAGACTTCAT
TTGACAACCT TACAGAAGGA AAATCCTGCT GCTCCCCCTG TTTTTTGTAT GCTTATGAGA
AAACATGTTG CAGGCGGAAG GCTTTTGAAT ATAAGCTTTC ATGATTATGA GCGTATCATT
ACATTGAATA TAGAGTCCGT TAATGAGCTT GGGGATCTTA CGGTTAAAAG GCTTGTTGTA
GAAATCATGG GCAAGTACAG TAATATTATC TTACTTAATA GCGAAAATAA AATAATAGAT
TCTGTTAAGC ATGTTGACAG CGATATAAGC AGTGTCAGAG AAATCATGCC TGCCAGAACC
TATCTCCTGC CCCCTGCCCA GAACAAAGAG CTGCCTGAGA ATACGGAAGT TGATAAAATT
TTTAATGAGG AGAACATTAA AGGGGCCAAA CACCCTGAAG GTTTGATTTT AAATACTGTC
AAGGGTTTTA GTCCTTATAC CTGCCGCGAT ATATGTGCTT CCGCAGGGGT TCCTTCTAAA
ACTCCTATAG GTGAGTTGAA CGATTCTGAT AAAGAAAAAA TTAAGGTTGC CCTTGCAAAG
TATATTGATA AAATTAAAAG CAATAATTTC TCGCCCTGTA TTGTATATGA AGATAAAAGC
ATGTTAAGAC CGATTGACTT TTATTGTTTT GAGCCCTCAA AGGAGGTTTT CTACAAAAGC
TATGAACTCT TATCCACAGC CCTCGACCAA TACTACATGC TGCGTGATAC CAATGAGCGT
CTGGGCCAGA AAATGGGTGA TGTTTTAAAG GTTGTTAAAA ACGGTATAGA ACGATGCCAG
AAAAAGACCA CAATGTTTAA TGAAAAGCTT AGGGAAGTAT CAGAAAGAGA CAAGCTTCAA
CTATACGGAG AGTTGATTAC TGCAAATATT TACTGCATTG CTGAAGGTGC AAAATCTGCA
AGAGTACTAA ATTATTACAG TGCAAACGAA GAATATGTTG ATATTCCTTT AAATGAGTAT
AAATCAGCTC AGGATAACGC TCAGAAATAT TTCAAGAAGT ATTCAAAGGC AAAAAGTACT
CACTTAAATG TAACCAAACA GTTGGCTGAA ACCTTGTCAG AGCTTGAATA CCTTCAAAGC
GTTCTTACTA TGCTTGGAAA TTGCAACTCA AGGCAGGAAA TCGATGAAAT AAGGCAGGAA
TTAATCGACC AGGGATACAT TAGACAGTCA TATAAAAATG CCAAAAATAA GCAGGACAAG
CCGTCTTCCC CTTTGGAATT TATATCAAGC GACGGATTTC AGATTTTAGT GGGTAAAAAC
AACAAGCAGA ATGACTTGCT TACGCTAAAG ACAGCAGCCT CTAACGATTT ATGGCTTCAT
ACAAAAAACA TACCCGGTTC ACACGTTATT ATAAGGACTG AGCGTAACAC TGTTCCGGAT
TCAACACTGT TAGAAGCAGC GACCCTTGCT GCGTATCACA GCAGTGCAAA AATGTCTTAC
AATGTTCCTG TAGACTACAC CACTGTAAGA AACGTAAAAA AACCTTCCGG TGCCAAGCCG
GGAATGGTTA TATATGAAAA TTTTAAAACT ATTAATGTTA CACCTGAAGA AGAAAAAGTA
ATGAAAATAA TAAACAATAA AAATATTTTT AGTAAGTAG
 
Protein sequence
MPFDGIVTKC IVNELNDLLS GGRIDKVFQP ENDEIVMMIR SKGQNYRLVA SANASNPRLH 
LTTLQKENPA APPVFCMLMR KHVAGGRLLN ISFHDYERII TLNIESVNEL GDLTVKRLVV
EIMGKYSNII LLNSENKIID SVKHVDSDIS SVREIMPART YLLPPAQNKE LPENTEVDKI
FNEENIKGAK HPEGLILNTV KGFSPYTCRD ICASAGVPSK TPIGELNDSD KEKIKVALAK
YIDKIKSNNF SPCIVYEDKS MLRPIDFYCF EPSKEVFYKS YELLSTALDQ YYMLRDTNER
LGQKMGDVLK VVKNGIERCQ KKTTMFNEKL REVSERDKLQ LYGELITANI YCIAEGAKSA
RVLNYYSANE EYVDIPLNEY KSAQDNAQKY FKKYSKAKST HLNVTKQLAE TLSELEYLQS
VLTMLGNCNS RQEIDEIRQE LIDQGYIRQS YKNAKNKQDK PSSPLEFISS DGFQILVGKN
NKQNDLLTLK TAASNDLWLH TKNIPGSHVI IRTERNTVPD STLLEAATLA AYHSSAKMSY
NVPVDYTTVR NVKKPSGAKP GMVIYENFKT INVTPEEEKV MKIINNKNIF SK