Gene Cfla_3169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3169 
Symbol 
ID9147085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3520806 
End bp3522443 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content76% 
IMG OID 
Productcellulose-binding family II 
Protein accessionYP_003638250 
Protein GI296131000 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.111807 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACCGTC CCCGACGTCC CGCCGCGGTC GCGCTCACCG CGCTGACCGC GAGCGTCGCG 
CTCGTCCTCA CGGGCGCTGC CACGCAGCTG CCCGCCGCAG CCGCCGACCA GCCCGCCCCG
ATCGCCGACA CCGTCCACGC CGTCGGGCGG GTGAAGGCCG TGTCCGGGGG CCTGGCCTAC
AGCTGGCCGG GCGTGGCCTT CGAGGGCCGG TTCCGCGGCA CGGGCGTGGG CGTCGTGCTC
GACGACGGCA ACGCGGACTA CGACCTGTTC GTCGACGGCC GGCGCAGAGC GCACTGGATC
CTGCCGGGCC AGGGCACGAA GTTCGTGACC GGCCTGGCCG ACGGCGAGCA CACCGTCCGG
CTCGTCAAGC GCAACGAGAG CCCGTGGGCG ACGAGCACGT TCGGCGGGTT CGTGCCCTGG
ACGGGCGGCG AGATCCTCGA GGCGCCGGCG CCCCGACAGG TGCAGCTCGA GTTCTACGGC
GACTCCTACA CCGCGGGCTA CGGCAACGAG TCGCGGACCC GCGAGTGCAC CGGCGACGAG
GTCAACCGCA CCACCAACGC CGACGCCGCG TTCCCCGCGA TCGTGGGCCG GGCCGTCGGC
GCGGACGTGC ACGTCAACGC GTTCTCCGGG CGCGGCATGG TGCGCAACTA CGCCGGCAGC
GACCTGGGCA CGAGCTTCCG CACCTACGCG GACCGCGCCC TGCTCGCGGT GCCCGGGGAC
GCGTGGCAGC GCCCGGCGGA CTGGCAGCCG CAGGTCGTCG TCGTGGGCCT CGGCATCAAC
GACTTCTCGA CGGCCGTGGG GGCCGGCGAG CCGTGGAGCG AGCAGACGCT GCGCACGCAG
TTCCGCGCCG CGTACGACGG CTTCGTCGAC AGCCTGCGCC GCAGCTACGG CCCGGACACG
TTCATCGTCC TCAGCGCACC CGACCACACG CCGGACATCC GCACCACCAC GGTCGCGATC
GCGCAGGACC GGCTGGCCGC GGGCGACGAC CGCGTGATCC CCTGGGCCTT CGGCGGCCTC
GACCTCACCG GCTGCCACTG GCACCCGTCC ACGGCCGACC ACGTGACGAT CGCGGCGAGC
CTCAGCCAGC TCGTCACGTC GGTCCTCAGC GTCGACGGCA TCACACCGAC CCCGGAGCCG
GAGCCGTTCC GGCCGATCCA GCCCGACGCC AGCCCGGAGC CCGCGCCGAC CGGGAACCCC
TCGGCCACCC CGAGCCCGCC GACCGGCTAC CCGACGCCCA CGCCGACCGG GCCGACCCAC
CCGACCGACC CGCCCTCGCC GACCGCACCG CCGTCCCCGA CACCGACGCC GACCGACGTG
CCCACGGCCC CGCCCGGCGC GTCGTGCACC GCCGCCCTCG CGGTGACGGC GACGTGGCCC
GGCGGCTACC AGGCGAAGGT CGACGTGACC GCGGGGTCCC GCCCGCTCGG CGGCTGGAGC
GTGACGTTCA CGCTGCCCGG CAGCCTCACG CAGGGCTGGT CCGGTGAGTT CGCCGCGTCC
GGGAGCGCCG TGACGGTGAG CAACGCGTCG TGGAACGGCG CGCTCGGCGG CGGGACCACG
ACCTCGGCCG GGTTCATCGG CAGCGGGACG CCGCCGACGA GCGGCGCGGT GCCCTGCACG
GGCGTCCCGG CGACCTGA
 
Protein sequence
MHRPRRPAAV ALTALTASVA LVLTGAATQL PAAAADQPAP IADTVHAVGR VKAVSGGLAY 
SWPGVAFEGR FRGTGVGVVL DDGNADYDLF VDGRRRAHWI LPGQGTKFVT GLADGEHTVR
LVKRNESPWA TSTFGGFVPW TGGEILEAPA PRQVQLEFYG DSYTAGYGNE SRTRECTGDE
VNRTTNADAA FPAIVGRAVG ADVHVNAFSG RGMVRNYAGS DLGTSFRTYA DRALLAVPGD
AWQRPADWQP QVVVVGLGIN DFSTAVGAGE PWSEQTLRTQ FRAAYDGFVD SLRRSYGPDT
FIVLSAPDHT PDIRTTTVAI AQDRLAAGDD RVIPWAFGGL DLTGCHWHPS TADHVTIAAS
LSQLVTSVLS VDGITPTPEP EPFRPIQPDA SPEPAPTGNP SATPSPPTGY PTPTPTGPTH
PTDPPSPTAP PSPTPTPTDV PTAPPGASCT AALAVTATWP GGYQAKVDVT AGSRPLGGWS
VTFTLPGSLT QGWSGEFAAS GSAVTVSNAS WNGALGGGTT TSAGFIGSGT PPTSGAVPCT
GVPAT