Gene Cfla_3024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3024 
Symbol 
ID9146936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3354156 
End bp3356645 
Gene Length2490 bp 
Protein Length829 aa 
Translation table11 
GC content71% 
IMG OID 
Productglycoside hydrolase family 62 
Protein accessionYP_003638106 
Protein GI296130856 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGGA CATCGCGACG CGCCCGTCGG CTGCTCGTCG CCACGACGGC CGCCGCCGCC 
GTGGCGCTCA CGGGCGCGCT GACGGCGGCG CCCGCGCTCG CGGCGGAGAG CACGCTCGGC
GCCGCCGCCG CGCAGACCGG CCGCTACCTC GGCGTCGCCG TGGCCGCCGG CCGGATGAAC
GACGGCACGT ACATCGGGAT CGTCGAGCGC GAGTTCAACT CGATCGTGGC CGAGAACGAG
ATGAAGATGG ACGCCACGGA GCCGAACCAG AACCAGTTCA GCTACGGCAA CGGCGACCGC
ATCGTCGACT GGGCGCGCGC CCGGGGCAAG AAGGTGCGCG GCCACACCCT GGCGTGGCAC
TCGCAGCAGC CCGGCTGGAT GCAGCGCATG GAGGGTCAGC AGCTGCGCAA CGCCCTGCTC
AACCACGTCA CGCAGGTCGC CACGCACTAC CGCGGCAAGA TCGACAGCTG GGACGTCGTC
AACGAGGCGT TCGCCGACGA CGGCCGCGGC AGCCGCCGGG ACTCCAACCT GCAGCGCACC
GGCAACGACT GGATCGAGGC CGCGTTCCGC GCCGCCCGCG CCGCGGACCC GGGCGCCAAG
CTCTGCTACA ACGACTACAA CACCGACGGC GTCAACGCGA AGTCCACGGG CATCTACAAC
ATGGTCCGCG ACTTCAAGGC CCGCGGCGTC CCGATCGACT GCGTCGGCTT CCAGTCCCAC
CTGGGCACGG GCGTGCCGAG CGACTACCGG GCCAACCTGG AGCGCTTCGC CGCGCTCGGC
GTCGACGTGC AGATCACCGA GCTCGACATC GAGCAGGGCG GCAACCAGGC CAACGCGTAC
CGGCAGGTCA CCGAGGCGTG CCTCGCGGTG CCGCGGTGCA ACGGCATCAC CGTGTGGGGC
GTGCGGGACA GCGACTCGTG GCGCACGGGC GCCAACCCGC TGCTGTTCGA CGGCTCGGGC
AACAAGAAGG CCGCCTACAC GTCGGTGCTG AACGCGCTGA ACGCGGCGAC GCCGTCCCAG
TCGCCGACGC CCTCGCCCAC GCCGTCGTCG TCCCCGAGCG CGTCGCCGTC CGCGTCCCCG
AGCGCGTCGC CGTCCGCGTC GCCCAGCCCG ACGCCCACGC AGGGCACCGG GACGGGCGCG
TGCACGGTCG TCTACACCGT CGGCGCGCAG TGGCCCGGCG GCTTCACCGC GGACGTCAAG
GTGACCAACC ACGGTGCGGC GCTCACCGGG TGGACGTTGA CGTTCACCTT CCCCGGCAGC
CAGACCGTGC AGCAGGCGTG GAACGGTGTC GCGTCGCAGA GCGGCTCGCA GGTGACAGTC
CGCAACGCCG ACTACAACGG CTCGGTACCC GCGGGTGGCA CCGTGGGCTT CGGGTTCAAC
GGCGGGTTCT CGGGCAGCAA CCCGGTGCCG ACCGCGTTCG CGCTCAACGG CGTCGGGTGC
AACGGCGCCG TGACGCCGAC GCAGCAGCCG ACGGCGACGC CCTCGCCGAC ACCGTCGGTG
ACGCCGAGCC CGACGCCGTC CCCGACACCG TCGGTGACGC CGAGCCCGAC CCCCTCGCCG
ACGCCGAGCC CGACGGCCTC GCCGACGTGC AACCTGCCGT CGTCGTACCG GTGGCGCGAC
TCCGGGGTGC TGGCGCAGCC GCGCCAGGGC TGGGTGTCGC TGAAGGACTT CTCGGTCGCG
CCGTACAACG GCCAGCAGCT GGTGTACGCC ACGACCAACA CCGGCACCGC GTGGCAGTCG
ACGATGTTCA GCCCGTTCTC GAGCTGGAAC CAGATGGGCT CGGCCCAGCA GCAGACGATG
CCGTTCACGG CCGTCGCCCC GTCGCTGTTC TACTTCGCAC CCAAGAACAT CTGGGTGCTG
GCGTACCAGT GGGGCGGCCC GGCGTTCTCC TACCGCACGT CGACCAACCC GTCGAACGTC
AACGGCTGGA GCGCGCACCA GACGCTGTTC ACCGGCTCGA TCTCCAACTC CGGCACCGGC
CCCATCGACC AGGCCCTGAT CGGTGACGAC CGGAACATGT ACCTGTTCTT CGCCGGTGAC
AACGGGCGGA TCTACCGCGC GTCGATGCCG ATCGGGAACT TCCCCGGGTC GTTCGGGTCG
AACTACACGA CGATCATGCA GGACTCGACC AACAACCTGT TCGAGGCCGT GCAGGTCTAC
AAGCTGGCGG GCCAGCAGCG GTACCTCATG ATCGTCGAGG CCATCGGCAG CCAGGGCCGG
TACTTCCGGT CCTTCACGGC GACCGACCTG GGCGGCTCCT GGACCCCGCA GGCGACGTCC
GAGTCCAACC CGTTCGCCGG CAAGGCCAAC TCCGGTGCGA CGTGGACGAA CGACATCAGC
CACGGCGAGC TGCTGCGCAC CAGCGCGGAC CAGACCATGA CGGTCGACCC CTGCAACATG
CAGCTGCTCT ACCAGGGCCG CTCCCCGCAG TCCGGCGGCG ACTACGGCGC CCTGCCCTAC
CGCCCCGGCC TGCTGACGCT GCAGCGGTAG
 
Protein sequence
MDRTSRRARR LLVATTAAAA VALTGALTAA PALAAESTLG AAAAQTGRYL GVAVAAGRMN 
DGTYIGIVER EFNSIVAENE MKMDATEPNQ NQFSYGNGDR IVDWARARGK KVRGHTLAWH
SQQPGWMQRM EGQQLRNALL NHVTQVATHY RGKIDSWDVV NEAFADDGRG SRRDSNLQRT
GNDWIEAAFR AARAADPGAK LCYNDYNTDG VNAKSTGIYN MVRDFKARGV PIDCVGFQSH
LGTGVPSDYR ANLERFAALG VDVQITELDI EQGGNQANAY RQVTEACLAV PRCNGITVWG
VRDSDSWRTG ANPLLFDGSG NKKAAYTSVL NALNAATPSQ SPTPSPTPSS SPSASPSASP
SASPSASPSP TPTQGTGTGA CTVVYTVGAQ WPGGFTADVK VTNHGAALTG WTLTFTFPGS
QTVQQAWNGV ASQSGSQVTV RNADYNGSVP AGGTVGFGFN GGFSGSNPVP TAFALNGVGC
NGAVTPTQQP TATPSPTPSV TPSPTPSPTP SVTPSPTPSP TPSPTASPTC NLPSSYRWRD
SGVLAQPRQG WVSLKDFSVA PYNGQQLVYA TTNTGTAWQS TMFSPFSSWN QMGSAQQQTM
PFTAVAPSLF YFAPKNIWVL AYQWGGPAFS YRTSTNPSNV NGWSAHQTLF TGSISNSGTG
PIDQALIGDD RNMYLFFAGD NGRIYRASMP IGNFPGSFGS NYTTIMQDST NNLFEAVQVY
KLAGQQRYLM IVEAIGSQGR YFRSFTATDL GGSWTPQATS ESNPFAGKAN SGATWTNDIS
HGELLRTSAD QTMTVDPCNM QLLYQGRSPQ SGGDYGALPY RPGLLTLQR