Gene Cfla_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3043 
Symbol 
ID9146955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3384539 
End bp3386863 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content74% 
IMG OID 
Productcellulose-binding family II 
Protein accessionYP_003638125 
Protein GI296130875 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGTC GCACAGCAGC ACTCGCGGCG GCCGCCGCGA CCGCCACCGC CCTCGGCACC 
CTGGCCGCGG CACCGGCCGC GACGGCCGCG GTCGCGTGCT CGGTCACCTA CACGGTCACG
AGCCAGTGGC CCGGCGGGTT CGGTGCGGAC GTGCGCGTCA CCAACCTGGG TGACGCGCTG
GACGGCTGGA CCCTCACGTG GACGTTCGTC GCCGGGCAGA CCGTGCAGCA GGCGTGGAAC
GGCACCGCGA CGCAGTCCGG CGCGCAGGTC AGCGTGACCC ACGCCGGCTG GAACGCGCAG
GTCGGCAGCG GCGGGCAGCT CGCCTTCGGG TTCAACGGCA CGTCCGTCGG CACCACCAAC
CCCGTGCCCA CGTCGTTCGC GCTCAACGGC GCGACGTGCG ACGGGTCGAC CGCTCCCCCG
TCGTCGCCGA CCCCCAGCCC GACGCCCTCC CCCACGCCGA GCGCCACCCC GTCGGCGACG
CCCGGCACGA CGCCGTCGGC GACACCCAGC CCGACCCCCA GCCCGACACC CTCACCGACG
CCCAGCCCCA CGCCCGTGGT CGTCCCGGAC CCGGTGCCGA CGACGCCCGT CGCGGGCGCG
CGCCAGGTCG AGGACCTCGA CCGCGGCCTG GTGTCCGTGC GCTCCGGCAG CGGCAACCTC
GTGCAGTGGC GCCTCCTGGG CTACGAGGAC CGCGGCACGG GTTTCCACGT CTACCGCGAC
GGCGCGCGCA TCACGTCGTC GCCGGTGACG GGATCGACCA ACTACCTCGA CGACGGTGCG
TCCGCCGGCG CGCGGTACAC CGTCCGGGCC GTCACGGCGG GCGGTGAGCA GGCACCCTCC
GCGACGTCGC GGAACCTCCC GAATGGCTAC CTCGACGTGC CCGTCCAGCG GCCGTCGTCG
AACCACGTCA TCAACGACGG GTCCGTCGGC GACCTCGACG GCGACGGCGA CCTCGACGTC
GTCCTCAAGT GGGACCCGAC CGACGCCAAG GACAACAGCC AGGCCGGGTA CACCGGCAAC
GTCTACCTCG ACGGCGTGAC CCTCGAGGGG CAGCGGCTGT GGCGCATCGA CCTGGGCCGC
AACATCCGCG CCGGGGCGCA CTACACGCAG TTCCAGGTGT ACGACTACGA CGGCGACGGC
AAGGCCGAGG TCGTCGTGAA GACGGCCGAC GGCACGCGCT CGGGCACCGG CGAGACCATC
GGCAACGCGT GGGCCGACCA CCGCAACGGC GAGGGTTACG TGCTCGCCGG CCCGGAGTAC
CTCACCGTGT TCCGCGGTGA CACCGGCGCC GTCGCCGCGA CCGTCGACTA CGTCCCGCCG
CGCGGGACCG TCTCCGCGTG GGGCGACAGC TACGGCAACC GCGTCGACCG GTTCCTCGCG
GGCACCGCGT ACGTCGACGG GCAGCGGCCG TCGATCGTGA TGGCGCGCGG CTACTACACG
CGCACCGTGG TGTCCGCGTG GGACTTCCGC AACGGACAGC TCACGCGCCG GTGGACGTTC
GACTCGAACA GCTCCACGCC CGGCAACTCC GCGTACGCCG GGCAGGGCAA CCACTCGCTG
TCCGTCGCGG ACGTCGACGG CGACGGGCGC GACGAGGTCG TGTACGGGGC GTCGGTGATC
GACGACGACG GGCGCGGCCT GTGGGTCAAC GGCACGGGCC ACGGCGACGC CGGGCACGTC
GGCGACCTCG TGCCGTGGCG CTCCGGCCTG GAGTACTTCA AGGTCACCGA GGACAAGTCG
CAGCCCAACA TGTGGGTGGC CGACGCGCGC ACCGGCCAGA CGCTGTGGCG CAGCGGCACG
GGCGCCGACA ACGGCCGCGG GGTGGCCGGT GACGTCTGGG CCGGCAGCGC GGGCGCCGAG
GCATGGTCGT CCGCCGAGAA CGACCTGCGC AGCGCCGCGA CGGGGCAGTC GGTCGGCCGC
AAGCCGTCGT CCGCGAACTT CCTCGCCTGG TGGGACGGCG ACCCGGTCCG CGAGCTGCTC
GACCAGACGA AGATCGACAA GTACGGGCCC AGCGGCGAGA CGCGCCTGCT GACCGGTGCC
GACGTGCGGT CGAACAACGG CACCAAGGCC ACGCCCGTGC TGTCCGGCGA CATCCTCGGC
GACTGGCGCG AGGAGGTGGT CTGGGCGCGG TCCGACGAGG GCGCGCTGCG GATCTACGTG
ACGCCGCACC GCACCGACCT GCGGGTGCCG ACGCTGCTGC ACGACCCGAC GTACCGCGTC
GCGCTGGCCT GGCAGAACAC CGCCTACAAC CAGCCCCCGC ACCCGTCGTT CGCCTTGGGT
GACCCGTTCA CGGCCCCGCC GCAGCAACGC CTGTACGTCC GCTGA
 
Protein sequence
MRRRTAALAA AAATATALGT LAAAPAATAA VACSVTYTVT SQWPGGFGAD VRVTNLGDAL 
DGWTLTWTFV AGQTVQQAWN GTATQSGAQV SVTHAGWNAQ VGSGGQLAFG FNGTSVGTTN
PVPTSFALNG ATCDGSTAPP SSPTPSPTPS PTPSATPSAT PGTTPSATPS PTPSPTPSPT
PSPTPVVVPD PVPTTPVAGA RQVEDLDRGL VSVRSGSGNL VQWRLLGYED RGTGFHVYRD
GARITSSPVT GSTNYLDDGA SAGARYTVRA VTAGGEQAPS ATSRNLPNGY LDVPVQRPSS
NHVINDGSVG DLDGDGDLDV VLKWDPTDAK DNSQAGYTGN VYLDGVTLEG QRLWRIDLGR
NIRAGAHYTQ FQVYDYDGDG KAEVVVKTAD GTRSGTGETI GNAWADHRNG EGYVLAGPEY
LTVFRGDTGA VAATVDYVPP RGTVSAWGDS YGNRVDRFLA GTAYVDGQRP SIVMARGYYT
RTVVSAWDFR NGQLTRRWTF DSNSSTPGNS AYAGQGNHSL SVADVDGDGR DEVVYGASVI
DDDGRGLWVN GTGHGDAGHV GDLVPWRSGL EYFKVTEDKS QPNMWVADAR TGQTLWRSGT
GADNGRGVAG DVWAGSAGAE AWSSAENDLR SAATGQSVGR KPSSANFLAW WDGDPVRELL
DQTKIDKYGP SGETRLLTGA DVRSNNGTKA TPVLSGDILG DWREEVVWAR SDEGALRIYV
TPHRTDLRVP TLLHDPTYRV ALAWQNTAYN QPPHPSFALG DPFTAPPQQR LYVR