Gene Ccel_1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1045 
Symbol 
ID7309867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1300307 
End bp1301557 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content37% 
IMG OID643607972 
ProductRadical SAM domain protein 
Protein accessionYP_002505387 
Protein GI220928478 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0621] 2-methylthioadenine synthetase 
TIGRFAM ID[TIGR00089] RNA modification enzyme, MiaB family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA AAGTACGGCT AATATGTAAT TTGAACTGTA GCAGAAGACA GATGGACATG 
GTAAAGCTGG AATCCTATCT TTCTGCAAAC GGCTATGAGG TGGTTGAAGA TGAAAAACAG
GCGGATCAAA TTGTTTATAC AACATGTGGT TTTATAAATG AAACGGCCCA AGTGGCATTT
AATGAAATAG AAAGGCTGAA ATCACTGCCT GCTGAGCTTA TTGTTACAGG CTGCCTGCCT
GATACAGATT CTGAAACGTT CAATAAGATA CATAGCGGTA AAGTAGTCCG CAATACAGAA
TTATATAAAT TTGACGATGT TTTCGGAGGA GATACTAAAT TTCAGGATAT CCCTGACGCA
CATGATATGC CATGGGGAAA GGGTGAATAC TTTTGTGTCG AGGTTAGCCG AGGGTGTCCT
GAAAATTGTT CATATTGTGC AACAAAATGG GCTGTTGGAA AAATGAAGAG TAAGCCTATA
CAAAAGTGTA TAGAGGAAAT TGAAGAATTC AAAAAAAGTA CGTTTAGTAA GGTCGTAATT
AATGGTGACA ATGTGGGGGC TTACGGGCTT GATATAAAAG AAACCTTTGG TACATTAGTT
TCAGCTCTGC CAATAGAGGA TGAAAAATAC AAGGCATATA TTGATTCATT GCATCCAAGA
TGGCTATTGC TATATTATGA TGCAGTACTG GCGGCAATAA GCAAAAACCG CTTTGGTATG
CTTGTATCTG CTATACAGGC AGGTAATGAG AGGGTCTTGG AGCTAATGCG GCGTAAAGCA
GATATGAAAA AATTAAAGGA GGCTTTTATT GAAATAAAGC AAAAAAGCCC GGAAATAGTT
TTAGGAACTG AAGTTATTGT AGGATTTCCA ACAGAAAGCG AAAGTGACTT TATCGAAAGT
GTAGATTTTA TTTTAAGTAC AAAATTAGAT TGGGGTAATA TTTTCGCCTT CTCACCGAAA
AAAGGAACCG AGGCTGCGGC AATCAAAGGT CAGGTTGAAG AAGCCGAAAA AATAAGAAGA
ATCAATTATC TGGTAGAAAA GCTCAAGGAA AATGGATATT TTATTTTTAA GGAAGAGAAA
TCACAAGCTG TTATTTTTAG CAATGCCGAT ATTTGCATTA ATGCAGACAA GAGCCCGAAC
CCATATTGGC AGACCTGTTT TGACACTGTT TGTCTTGACA GAAAGAAACA GAATCAAATT
CGTAGCGATT TAAGGGAAGG AAAAATAAAA GTCAGTGAAG TAAGTTTCTA A
 
Protein sequence
MSKKVRLICN LNCSRRQMDM VKLESYLSAN GYEVVEDEKQ ADQIVYTTCG FINETAQVAF 
NEIERLKSLP AELIVTGCLP DTDSETFNKI HSGKVVRNTE LYKFDDVFGG DTKFQDIPDA
HDMPWGKGEY FCVEVSRGCP ENCSYCATKW AVGKMKSKPI QKCIEEIEEF KKSTFSKVVI
NGDNVGAYGL DIKETFGTLV SALPIEDEKY KAYIDSLHPR WLLLYYDAVL AAISKNRFGM
LVSAIQAGNE RVLELMRRKA DMKKLKEAFI EIKQKSPEIV LGTEVIVGFP TESESDFIES
VDFILSTKLD WGNIFAFSPK KGTEAAAIKG QVEEAEKIRR INYLVEKLKE NGYFIFKEEK
SQAVIFSNAD ICINADKSPN PYWQTCFDTV CLDRKKQNQI RSDLREGKIK VSEVSF