Gene Cfla_0242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0242 
Symbol 
ID9144108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp278887 
End bp280230 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content71% 
IMG OID 
Productcellulose-binding family II 
Protein accessionYP_003635360 
Protein GI296128110 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGACTCC GACACACGTC CCGGCCACGC AGGGCGGTGC TCGTGGCAGC AGCCGCGGCG 
CTCGTCCTCG GCGGCCTCGC AGCCCCCGCG ACGGCCCAGC CCGCCCCCAC CCTGGCGGGC
GGCGTCGCGC CCATGGCCGG CTCGGCCGGC TGTGGCAGCT CACCGCGCCT GAGCACCGGC
AACCAGACCA TCACCAGCGG CGGGCAGCAG CGCTCGTTCC GCCTCGACGT GCCCTCCAAC
TACGACCCGA ACCGGCAGTA CCGCCTGGTG TTCGGCATCC ACTGGTGGCA CGGTACGTCG
CAGGACGTCG TCAACGAGCA GTTCTACGGC CTCAAGCCGC TGGCCAACAA CAGCACGATC
TTCGTCGCGC CGCAAGGCAT CGACAACGCG TGGCCCAACC CCAACGGGCG TGACACCACG
TTCATCGACG ACATCCTGCG CACGGTCCAG AACGCGCTGT GCGTCGACTC GTCGCAGATC
TTCTCGACCG GCTTCAGCTA CGGCGGCGGC ATGAGCAACG CGCTGGCGTG CGCGCGTGCG
AACGTGTTCC GCGCGGTGGC GGTGCTCAAC GGTGCGCAGC TCTCCGGCTG CGACGGCGGC
ACCCAGCCCA TCGCGTACCT CGGCTCGCAC GGCGTCGTCG ACAGCGTCCT CAACATCTCC
CAGGGCCGCG CACTGCGTGA CCGCGCACTG CGGAACAACG GCTGCCAGGC CCAGAACGCT
CCCGAGCCGC AGGGCAACAG CGGGCAGGCG CACACCAAGA CGGTGTACCA GTGCCGCGAC
GGCTACCCCG TGGTCTGGAT CGCCAACGAC AGCGACCACC AGTGGGCCGC TGTCGACCGC
GGCCAGCAGC GCTCGCACGT CCCCGGGGAG ATCTGGTCGT TCTTCACGTC GCTGCCGTCG
ACGAGCGGCC CGACGCCCAC CCCGACGCCC ACCCCGACGC CGACGGTCTC GCCGACGCCG
ACCTTCTCGC CCACGCCCTC CTCGACGCCG ACGGTCTCGA CCACGCCGAG CCCGACGCCG
TCGCCCGCGG GCACCACCCC GCCGCCCGCC TCGGGTGGCT GCACCGCGAC GTACAAGCTC
ATGAACTCGT GGCCCGGCGG CTGGCAGGGT GAGGTGACCG TGAGCGCCGG TTCCTCGATC
CGCGGCTGGA CCGTCTCGTG GAGCTCGAAC GGCGAGCGCA TCGAGCAGCT CTGGAACGGC
GAGCTCTCGC AGGGCGGTCA GGTCCAGGTG AAGAACGTGT CCTGGAACGG CGCGCTGAAC
GCGAGCGGCA GCGCGAGCTT CGGCTTCCTC GGCAGCGGCA ACGCGCCGTC GAGCCTGTCG
AACCTCACCT GCTCGGCCGC CTGA
 
Protein sequence
MRLRHTSRPR RAVLVAAAAA LVLGGLAAPA TAQPAPTLAG GVAPMAGSAG CGSSPRLSTG 
NQTITSGGQQ RSFRLDVPSN YDPNRQYRLV FGIHWWHGTS QDVVNEQFYG LKPLANNSTI
FVAPQGIDNA WPNPNGRDTT FIDDILRTVQ NALCVDSSQI FSTGFSYGGG MSNALACARA
NVFRAVAVLN GAQLSGCDGG TQPIAYLGSH GVVDSVLNIS QGRALRDRAL RNNGCQAQNA
PEPQGNSGQA HTKTVYQCRD GYPVVWIAND SDHQWAAVDR GQQRSHVPGE IWSFFTSLPS
TSGPTPTPTP TPTPTVSPTP TFSPTPSSTP TVSTTPSPTP SPAGTTPPPA SGGCTATYKL
MNSWPGGWQG EVTVSAGSSI RGWTVSWSSN GERIEQLWNG ELSQGGQVQV KNVSWNGALN
ASGSASFGFL GSGNAPSSLS NLTCSAA