Gene Ccel_0454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0454 
Symbol 
ID7309333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp521381 
End bp522529 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content40% 
IMG OID643607384 
ProductNusA antitermination factor 
Protein accessionYP_002504816 
Protein GI220927907 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCTG AGTTGATATT AGCTCTTGAA CAGCTGGAAA AGGAAAAGGG TATTAAAAAG 
GAAATAATTA TTGAGGCTAT TGAGGCTGCA CTTATTTCTG CATACAAGAA AAACTTTGGT
TCAGCAATGA ACGTTAAAGT AAATATAGAT AGGGTAACAG GTGATGTAAA AGTTTTTGCA
CTCAGGAAAG TTGCTGAAGA CCCAGATGTC GAGGCAATGG ACATATCAAT AGGAGAGGCT
GCCAAGCTTA ATCCTACACT GGACATAGGA GATTATGTAG AATCTGAAGT TACTCCAAGG
TCCTTTGGAA GAATTGCTGC CCAGACTGCC AAACAGGTAG TAGTTCAAAA ACTAAGAGAA
GCAGAAAGAG GAATCATTTA CGATGAGTTC TACAATAAGG AAAGCGACAT TGTAACAGGA
ATCATTCAAA GGATAGAAAA GAGAAATGTA ATAGTTGACC TTGGGAAAAC TGAAGCCGTT
CTTGGGTCTA CCGAGCAGAC TCCCGGAGAG GAATACAGAT TTAACGAACG ACTGAAGTCA
TATATTGTAG AGGTTAAAAA AACTACAAAA GGTCCTCAGA TTATGCTTTC CAGAACACAT
CCGGGCTTGG TAAAAAGGTT ATTTGAACTG GAAGTACCTG AAATTCATGA CGGTACTGTT
GAAATAAAGA GTATTTCAAG GGAACCGGGG TCAAGGACTA AGCTAGCTGT GTACTCTAAA
GATGAAAATG TTGATCCTGT AGGAGCATGT GTTGGGCAGA AGGGTACCAG GGTTCAGGCT
ATTGTTGATG AACTGAGGGG CGAAAAGATT GATATTATCA AATGGAGTAA TGATCCCAAA
GATTATATAT CCAGCAGTTT AAGCCCCGCT AAGGTTGTGA GGGTAGATGT GGACGAAGAA
GAAAAATCTG CAAAGGTAGT GGTTCCTGAC TATCAGCTTT CATTGGCAAT AGGAAAGGAA
GGCCAGAATG CAAGGTTGGC CGCAAAGCTT ACCGGCTGGA AAATTGATAT AAAGAGCGAA
TCCCAGCTAA GACAGTCAAT TGAGAAACAA CTGTTTGATG ATAGCTTAAA CAACGGATAT
TTGGATGAAA CAGATACTGA CAGTATGAAT TATGATAATG ATGACCATGA AAATAATATA
ATTGATTGA
 
Protein sequence
MSAELILALE QLEKEKGIKK EIIIEAIEAA LISAYKKNFG SAMNVKVNID RVTGDVKVFA 
LRKVAEDPDV EAMDISIGEA AKLNPTLDIG DYVESEVTPR SFGRIAAQTA KQVVVQKLRE
AERGIIYDEF YNKESDIVTG IIQRIEKRNV IVDLGKTEAV LGSTEQTPGE EYRFNERLKS
YIVEVKKTTK GPQIMLSRTH PGLVKRLFEL EVPEIHDGTV EIKSISREPG SRTKLAVYSK
DENVDPVGAC VGQKGTRVQA IVDELRGEKI DIIKWSNDPK DYISSSLSPA KVVRVDVDEE
EKSAKVVVPD YQLSLAIGKE GQNARLAAKL TGWKIDIKSE SQLRQSIEKQ LFDDSLNNGY
LDETDTDSMN YDNDDHENNI ID