Gene Ccel_3349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3349 
Symbol 
ID7311916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3890856 
End bp3891962 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content41% 
IMG OID643610252 
Producthypothetical protein 
Protein accessionYP_002507618 
Protein GI220930709 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAACCG TTGTAGTGGA TATCCCGAGC ACGACATTTG TATCTTCTGC ACAGCCGGGT 
ATGAATTTTT CTGTGTATCC AACCATCTAT GCAGGTACCG ATGGACAGTA TCAAAATTGT
ATAAGTTTAA TGCAAATAGT ATTACCATCA TTACCCGTTA ATTTTGTTGA CAGTGCTGTT
CTTCAGCTGG CTGTTATAGC AAAAAGCGGA ACTAATCCCA GCCCTGTTTT AGTAAATACA
GTAATGGAAC CATATAACAG AACTAGTGTG ACCTATGATA CACGGCCGGC TTATACGCCA
ACTTCTTCAC AGATTAATGT AACTACAGCA GATCTTTACA AAACAGTTGA AATTGACATA
ACATCTCTGG TGAACAGCTG GCTTAACGGA ACCGTTGCAA ACAACGGCTT AGCCTTAACC
AATTCTGATG GAAATACAGT TGTACAATTT GGTACAGATA ATATCTCATG GGAGCCGTAT
TTTCCGAAGC TACTTCTTAC ATACTCAGGA ACACACGGAG GAAATTCAGC AACCTGCTTC
TGCTATTCCC AGTTGGCACA CGTTATTCAG CAGATTATAA TGTTTTATCC GGCAAGCACC
ATAACTGTTT TTACAAAAGG CTTAACTGCT TCATCTATAA CCGGTACGCC ACACCAGCTG
TTTTCTTCTT CGGTTAGTTC AAATGGAGCT TTATTTATTG TCATGGACAG CGGGCAGCAG
CAGGTAATCC CGGTTAACTC AATAACAGCA ATATATACAG GCGATGGTAC GGTATACAAT
CCCTCTTTCG ATTATCTGCC GGCGCCGACT TTCCCTGTGG GCTGTGATAC TGATCTCGTA
ACGGCATACT ACGAATACTT AAACGATAAG ACTGATATTG ACATATATAC GGGCTCAAAC
ATACATGCTA CAGGGACAAT ATATAAGAAT GAATGTGGAA TCATAGTATT ATCGGATGGT
AGCGGAAATA CACCTGTATT TATCCCTGTT CTGCCTATAA CTGCTTTAAT TCCTTCAACT
AGCCCGTCCT TAGCCAAGGT GAACAGCGAT AAAAGTCAGG TATCTATTAT TGTAGAAACC
TCTGCACAGC AAGTAAAACC AAAATAA
 
Protein sequence
MSTVVVDIPS TTFVSSAQPG MNFSVYPTIY AGTDGQYQNC ISLMQIVLPS LPVNFVDSAV 
LQLAVIAKSG TNPSPVLVNT VMEPYNRTSV TYDTRPAYTP TSSQINVTTA DLYKTVEIDI
TSLVNSWLNG TVANNGLALT NSDGNTVVQF GTDNISWEPY FPKLLLTYSG THGGNSATCF
CYSQLAHVIQ QIIMFYPAST ITVFTKGLTA SSITGTPHQL FSSSVSSNGA LFIVMDSGQQ
QVIPVNSITA IYTGDGTVYN PSFDYLPAPT FPVGCDTDLV TAYYEYLNDK TDIDIYTGSN
IHATGTIYKN ECGIIVLSDG SGNTPVFIPV LPITALIPST SPSLAKVNSD KSQVSIIVET
SAQQVKPK