Gene Ccel_3000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3000 
Symbol 
ID7312434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3551485 
End bp3552525 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content38% 
IMG OID643609904 
Producttranscriptional regulator, LacI family 
Protein accessionYP_002507274 
Protein GI220930365 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0114429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAAAA AAGTAACAAT GGATGATATT GCAGAGAAGC TTGGTATATC AAAAAATACA 
GTATCTCTAG CACTAAGAGG CATGCCTGGT ATAAGCGAAA GTACAAGAAA AGTAATTGAG
CAGACTGCAA GGGAAATGGG CTACACGTAT AAAGTGTCAG CACGTAAAAA CACTATGGCA
CGAAATCTTT GTCTCATTAT TGCAAAAAGC ACACGTGATT CCATAGGTTT CTTTAGTTAT
GTACAGTTAG GAATAGAAGA TGAAGCAAAG AAAAATAACT TAAACACAAT AATACACTAT
TACGACGAAA ACGTTCAAGG GTTTGAGACT CCCAACTGCG TAAAGGATGG TATGGTTTCA
GGAATAATTA CTCTGGGAAG AATTTCTCGT GAAACAATTA ACTGCATAGT AGGATATAAC
CTTCCTGTTG TTATGGTAGA CAACTATTTC GATAACCTAT CCATGGATTA TATACTCACG
GACAACCATT CAGGCGGATA TGCTGCTACG GAGTATCTTA TAGACTGCGG ACATACCAAA
ATAGGATTTT TAGGTGATAT CTCCGCATCA ATAAGCTTTT ATGACAGGTA TCAGGGGTTT
TTAAAAGCTC TAAGAGATCG GGGAATTGAA ATTAACGAAG GTTATTCGAT AACTGATAAA
AAGCTTGAAG AATTGCCTCA AGAAGATATA ACTGGGCTAG TCAACGAAAT CAGAACCAAG
GCAGGCCTCC CAACCGCTTT TTTTTGCTGC AATGATGCGG AGGCTATTGT AATTATAAAA
GTGCTGAAAA ACATAGGTGT ATTAGTACCG AATAAAATTT CAATCATAGG CTTTGACGAT
ATAGAAAATG CCGCAAATGT TACCCCTGAA TTAACTACAA TGAGAGTGCA GAAGGAGATT
ATGGGTAAAG GAGCAGTTTG CAAGCTTATG GAAAAATTGG AACAAGAAAT TAAGTCCTCT
GAAAAGATAT TGCTGTCAGC CTGTCTTATC AAAAGAAATT CGGTTAATCG TTCGGATATG
GCGTTTCATG GCTCGTGCTG A
 
Protein sequence
MSKKVTMDDI AEKLGISKNT VSLALRGMPG ISESTRKVIE QTAREMGYTY KVSARKNTMA 
RNLCLIIAKS TRDSIGFFSY VQLGIEDEAK KNNLNTIIHY YDENVQGFET PNCVKDGMVS
GIITLGRISR ETINCIVGYN LPVVMVDNYF DNLSMDYILT DNHSGGYAAT EYLIDCGHTK
IGFLGDISAS ISFYDRYQGF LKALRDRGIE INEGYSITDK KLEELPQEDI TGLVNEIRTK
AGLPTAFFCC NDAEAIVIIK VLKNIGVLVP NKISIIGFDD IENAANVTPE LTTMRVQKEI
MGKGAVCKLM EKLEQEIKSS EKILLSACLI KRNSVNRSDM AFHGSC