Gene Ccel_3050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3050 
Symbol 
ID7311652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3596097 
End bp3597449 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content39% 
IMG OID643609952 
Productputative phage terminase, large subunit 
Protein accessionYP_002507322 
Protein GI220930413 
COG category 
COG ID 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.368791 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCCAGG AGAATTTAAT ACTATCCGAT AAGTACAAAG CATTTATAAG GCATTCTGCA 
CCGGTTGAAT TCTTGGAGGG TACAACAGCA GCAGGTAAAA CCACTGTAGG CATATTTAAA
TTCATGCTGC TGGTAGCTGA GAGCAAAAAG AAATATCACA TCATAGCTGC AAAAGATACC
GGTACTGCTG AAAAGAATAT AATAAACAAG GACCTTGGGA TTGTCGATGA TTTTGGAGTG
CTTACAGAGT ATAACGGTAA CGGAACTAAA GATGAAAAGA TACCCCATAT CCTTTACCAT
ACAAGCAAAG GCGATAAGAT AGTTTATGTG ATGGGGTATG GTGATAAAAA GAAATGGCAA
AAGGCTCTTG GAGGTCAGTA TGGATGCCTT TATATAGACG AAATTAACAC TGCTGATATA
GATTTCGTAA GGGAATCATT CATGCGTGCT GATTATGTAA TGGCAACACT TAATCCGGAT
GATCCTAACC TACCGGTGTA TAAAGAATAC ATAAACTGTA GCCGACCGTT GCCAGAATAT
AAAAATGACG CACCGTCTGA AATAAATAAT ATGCTTAAGG AAGAACCAAA ACCCGGTTGG
GTACATTGGT TCTTTTCTTT TGCACATAAC CTTGGATTGA GTAAAGAAAA AATTGACCAG
ATTATCATGA ATGTACCGAA AGGTACAAAG CTGTATAAGA ACAAAATACA GGGATTACGC
GGCAGGGCAA CAGGGCTTAT ATTTGGCAAC TTTACACGGG GAAAAAATGT TATCACAAGG
GAACAGGCTA AGAAATATAA ATACCTTTAC TTTTCAGCAG GTCTTGATAC ATCATACTCT
CAAGAAAGCC CGGATACCTT TGCCTTTATT TACATAGGGA TAACAGACAA GCGAAAAGTT
GTAGTGCTTG ATGAAGAAGT TTACAACAAT GCTAACCTTG ATATACCTCT GGCACCTTCG
GACATTGCTC CGAGGTTTTT TAAGTTTCTG GAGAAAAATA GAAAGGTCTG GGGATTCGCA
AGAGACACTT TTGTTGATAG TGCTGACCAA GCAACTATTA CAGAGCTTAA TAAATATAAG
CGTGAACATG CTTGTTTGTA CAACTTCCTG AACGCATACA AAAAGATAAC CATTATCGAC
CGTATACATT TACAGCTTGG ATGGATTAAC TGTAACGGTA GCGTGTTTTA CGAAGTTGTC
GAGACCTGTA AAAATCATAT CGGAGAGTTA GAGAGTTATA GCTGGAAGGA AGATAAGTAT
GAGCCTGAGG ATGCAAACGA TCACACGATT AATGCAAGTC AATATGCATG GATACCGTTT
AAAACAAAGA TAGGTGATTA CAAGGAGGCA TAA
 
Protein sequence
MSQENLILSD KYKAFIRHSA PVEFLEGTTA AGKTTVGIFK FMLLVAESKK KYHIIAAKDT 
GTAEKNIINK DLGIVDDFGV LTEYNGNGTK DEKIPHILYH TSKGDKIVYV MGYGDKKKWQ
KALGGQYGCL YIDEINTADI DFVRESFMRA DYVMATLNPD DPNLPVYKEY INCSRPLPEY
KNDAPSEINN MLKEEPKPGW VHWFFSFAHN LGLSKEKIDQ IIMNVPKGTK LYKNKIQGLR
GRATGLIFGN FTRGKNVITR EQAKKYKYLY FSAGLDTSYS QESPDTFAFI YIGITDKRKV
VVLDEEVYNN ANLDIPLAPS DIAPRFFKFL EKNRKVWGFA RDTFVDSADQ ATITELNKYK
REHACLYNFL NAYKKITIID RIHLQLGWIN CNGSVFYEVV ETCKNHIGEL ESYSWKEDKY
EPEDANDHTI NASQYAWIPF KTKIGDYKEA