Gene Cfla_0172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0172 
Symbol 
ID9144038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp206996 
End bp208084 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content70% 
IMG OID 
Productchitin-binding domain 3 protein 
Protein accessionYP_003635290 
Protein GI296128040 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.808366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00471086 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCATCC CCACCCGGAG CCGGTTCGGC CGACTCGCTC GGCTCGCCCT CGCCGTCCCC 
CTCGCCCTCG CGGCCACCGG CATCGTCGCC ACGTCCGCCT CCGCCCACGG CTCCGTCACC
GACCCGCCGT CGCGCAACTA CGGCTGCTGG GAGCGCGAGG GCGGCACGCA CATGGACCCC
GCCATGGCGC AGCGCGACCC CATGTGCTGG CAGGCCTTCC AGGCCAACCC CAACACCATG
TGGAACTGGA ACGGCAACTT CCGTGAGGGC GTCGGCGGCC GCCACGAGCA GGTCATCCCC
GACGACCAGC TCTGCTCGGC CGGCAAGACG CAGAACGGCC TGTACGCGTC GCTCGACACC
CCCGGCCCGT GGATCATGAA GACGGTCCCG CACAACTTCA CGCTCACGCT GACGGACGGC
GCCATGCACG GTGCCGACTA CATGCGCATC TACGTGTCGA AGGCGGGTTA CGACCCGACG
ACCGACCCGC TGGGCTGGGA CGACATCGAG CTGATCAAGG AGACGGGCCG CTACGGCACG
ACCGGTCTCT ACCAGGCGGA CGTCTCCATC CCGTCCAACC GCACGGGCCG CGCGGTGCTG
TTCACGATCT GGCAGGCCTC GCACCTCGAC CAGCCGTACT ACATCTGCTC GGACATCAAC
ATCAACGGGA CCGCGCCGAC GCAGCAGCCG ACGCAGCAGC CGACGCAGCA GCCCACCCAG
CAGCCGACGC AGCAGCCCAC CCAGCAGCCC ACCCAGCAGC CGACGCAGCA GCCGACGCAG
CAGCCCACGC AGCAGCCCAC GCAGAACCCG GGCACCGGTG CCTGCACCGC GACGGTCAAG
GCCGCCAGCA CGTGGGGCAA CGGCTGGCAG GGTGAGGTCA CCGTGACGGC CGGCTCCAGC
GCGATCAACG GCTGGAAGGT CACCGTCGGT GGCGCGTCGA TCACGCAGGC ATGGAGCGGC
TCCTACAGCG GTGGGACGTT CTCCAACGCC GAGTGGAACG GCAAGCTCGC GGCAGGTGCC
TCGACGACGG CCGGCTTCAT CGCCTCGGGT ACGCCCGGCA CGCTGACGGC CACCTGCACC
GCGGCCTGA
 
Protein sequence
MFIPTRSRFG RLARLALAVP LALAATGIVA TSASAHGSVT DPPSRNYGCW EREGGTHMDP 
AMAQRDPMCW QAFQANPNTM WNWNGNFREG VGGRHEQVIP DDQLCSAGKT QNGLYASLDT
PGPWIMKTVP HNFTLTLTDG AMHGADYMRI YVSKAGYDPT TDPLGWDDIE LIKETGRYGT
TGLYQADVSI PSNRTGRAVL FTIWQASHLD QPYYICSDIN INGTAPTQQP TQQPTQQPTQ
QPTQQPTQQP TQQPTQQPTQ QPTQQPTQNP GTGACTATVK AASTWGNGWQ GEVTVTAGSS
AINGWKVTVG GASITQAWSG SYSGGTFSNA EWNGKLAAGA STTAGFIASG TPGTLTATCT
AA