Gene Cfla_0842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0842 
Symbol 
ID9144715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp916037 
End bp917506 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content74% 
IMG OID 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003635951 
Protein GI296128701 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.119521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGACG TCCTCGGGAT CGTGCCCGCG CCGCTCGTCG TCGAGCCGGC GTCGCAGCCG 
CCGTTCGTCA TCACCCGCTC GACCGTCGTC GTCGTCGACG CCGACGAGGA GCTGTTGCCG
CTGGCGGTGC TCACGGCCGA CCTGCTGGGC AGGCTCACCG GGCGGGCGGT CGAGATCCGG
CACGCCGAGG CCGGCAGCCC GGGCGTGGTG CGGATGCGGC TCGCCGAGGA CGTGCTGCCC
GGGGTCGAGG CGTACCGCGT CGTGGTCGGC TCGGGCAGGG TGCTGCTCGA GGCGCGCAGC
ACGGAGGGGC TGGTCCACGC CGTCGTCACG CTGCGCCAGC TGCTGCGCGA GCGGCCCGAC
GGCTCCGTCG AGGTCGAGGC CGTGCGCGTC GAGGACGCCC CGCGGTACCC GTGGCGCGGG
CTGTCCGTCG ACGTCGCACG CCACTTCGTC AGCGTGCCGG ACCTCAAGGT CGTCATCGGC
CTCATGGGGC ACTACAAGCT CAACGTGCTG CACCTGCACC TCACGGACGA CCAGGGGTGG
CGGCTCGACA TGCCGTCGCG GCCCGAGCTG GTGCGGCGCT CGGCCGCGCG CTCGGTGGAC
GGTGACCCGG GGGGCTACTA CTCGGCGGCG GACTGGGACG GGATCCTCGC GTTCGCGCGG
GCCCGCGGCA TCCGCGTCGT GCCGGAGATC GACGTGCCGG GGCACGTCAA CGCCGCCCTG
CACGCCTACA GCGAGCTCAA CCCCGACGGG GAGCCCGCCG AGGAGTACCT CGGCACCGAG
GTCGGGTTCT CCCGCCTGTA CGACGACCTG CCTGCGACGC ACGCGTTCCT CGCCGACGTC
CTCGGGGACC TCGCCGAGAT GACGCCGGGC GCCTACGTGC ACATCGGCGG TGATGAGGTC
CTCACGATGG AGCACGCCGA GTACGTGCGG CTCGTGCGGG CGGCGTCGGC CGCGGTCACG
GCGCACGGCA AGCGCGTCGT CGGGTGGCAG GAGATCGCGG CCGTGCCGGA TCTGCCGGCC
GGCACGGTGG TCCAGTACTG GGACATGCGC GTGGACCCCG AGCCGTTCGT CGCGGCGGCT
GCGGCCGGCG CGCGGATCCT GCTGTCGCCG GGCGCGAAGG TCTACCTCGA CATGCGGTAC
GAGCCGGGCT TCCCGCTGGG GCAGGAGTGG GCGGGCACGG TGGACCTGCG CGACGCCTAC
GAGTGGGAGC CTGCGACCCT CGTCGAGGGG CTGCCTCCCG AGGCCGTCGT CGGTGTGGGG
GCGGCCGTGT GGACCGAGAC GCTGCGGACG CTCGACGACC TGACGACCAT GCTGCTGCCG
CGGCTCGCGG CCGTCGCCGA GGTCGCGTGG AGCGCGCCCG CGCGGCGCGA CTTCGACGAC
TTCGCGGAGC GGCTGCGCAG CCACGGGCGG CACTGGGACC GCATGGGCCT CGCGTGGCAC
CCGTCCCGGC AGGGTCGCTG GGACGGGTGA
 
Protein sequence
MSDVLGIVPA PLVVEPASQP PFVITRSTVV VVDADEELLP LAVLTADLLG RLTGRAVEIR 
HAEAGSPGVV RMRLAEDVLP GVEAYRVVVG SGRVLLEARS TEGLVHAVVT LRQLLRERPD
GSVEVEAVRV EDAPRYPWRG LSVDVARHFV SVPDLKVVIG LMGHYKLNVL HLHLTDDQGW
RLDMPSRPEL VRRSAARSVD GDPGGYYSAA DWDGILAFAR ARGIRVVPEI DVPGHVNAAL
HAYSELNPDG EPAEEYLGTE VGFSRLYDDL PATHAFLADV LGDLAEMTPG AYVHIGGDEV
LTMEHAEYVR LVRAASAAVT AHGKRVVGWQ EIAAVPDLPA GTVVQYWDMR VDPEPFVAAA
AAGARILLSP GAKVYLDMRY EPGFPLGQEW AGTVDLRDAY EWEPATLVEG LPPEAVVGVG
AAVWTETLRT LDDLTTMLLP RLAAVAEVAW SAPARRDFDD FAERLRSHGR HWDRMGLAWH
PSRQGRWDG