Gene Cfla_1897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1897 
Symbol 
ID9145790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2111228 
End bp2113021 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content70% 
IMG OID 
Productglycoside hydrolase family 5 
Protein accessionYP_003636993 
Protein GI296129743 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.754177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCT CACGTGTGCG CAGGCTGCGC ACCGCGCTCG GCAGCGTCCT CGCCGCCGGC 
GCCCTGACGC TCCCCCTCGC CCTGTCCGCG GCGCCTGCAC AGGCGGCCGA CACGCCCGAC
TGGCTGCACG TCCAGGGCAA CAAGATCGTC GACGCCTCCG GCAAGGAGGT GTGGCTCACC
GGCGTGAACT GGTTCGGGTT CAACGCCGAC GAGCGCGTCT TCCACGGCCT GTGGTCCGCG
AACATGCGGA CGCTCACCAA GGGCATGGCC GACCGCGGCC TCAACGTCGT GCGCGTGCCG
ATCTCCGCCG AGCTGATGCT CGAGTGGAAG GCGGGCACGT TCACCAAGCC GAACGTCAAC
GAGTTCGCCA ACCCCGAGCT CGCCGGGCTC AACAGCCTGC AGATCTTCGA GAAGTTCCTC
GAGATGAGTG ACGAGTACGG CCTCAAGGTC TTCCTCGACG TCCACTCCGC CGAGGCGGAC
AACTCGGGGC ACGTCTACCC CGTGTGGTGG AAGGGCGACA TCACCACCGA GCACGTCTAC
GAGGCGTGGG AGTGGGCGGC GGCCCGCTGG AAGACGAACG ACACGCTCAT CGGCGCCGAC
CTGAAGAACG AGCCGCACGG CACCCAGGGC CAGACCGAGC GCGCCAAGTG GGACGGGTCG
ACCGACAAGG ACAACTTCAA GCACTTCGCC GAGACGGCGG CGAAGAAGGT CCTGGCGATC
AACCCGAACT GGCTGATCTT CGTCGAGGGC ATCGAGATCT ACCCGAAGGA CGGCGTGTCC
TGGTCCTCGA CAGGTCTGAC CGACTACCAC AACATGTGGT GGGGCGGGAA CCTGCGCGGC
GTGCGGGACT TCCCGATCGA CCTCGGTGCG AACCAGGACC AGCTCGTGTA CTCCCCGCAC
GACTACGGGC CGCTCGTGCA CCTGCAGCCG TGGTTCAAGG GTGAGTGGAG CCGCGAGACG
CTCGAGCGCG ACGTGTGGGA CCCGAACTGG CTGTACCTGC ACAAGGAGAA CACCGCACCG
CTGCTCATCG GCGAGTGGGG CGGCTTCATG GACGGCGGGC CCAACGAGAA GTGGATGGTC
GCGCTGCGCG ACCTCATCGT CGACCGGCGC CTGAGCCACA CGTTCTGGGT GCTCAACCCG
AACTCGGGTG ACACCGGGGG CCTGCTCGGC TACGACTGGG CCACGTGGGA CGAGGAGAAG
TACGCGCTGC TCAAGCCGGC GCTGTGGCAG GACGGCGGCA AGTTCGTCGG CCTCGACCAC
GACGTCCCGC TGGGCGGCGT GGGCAGCACG ACGGGCAAGT CGCTGTCCCA GGTCAACGGC
ACGTTCCCGT CGCCCACCGC GACGGCGACC CCGACGCCGA CCCCCTCGGT CACGCCGACG
CCCACGCCGT CGGTGACGCC GACCCCGACC CCCTCGGTCA CGCCGACGCC GACGCCCAGT
GCCACGCCGA CACCCACGGC GACGCCGACG AGCGCCGCAG GTGCGTGCAC GGTGACGTTC
AGCGGCAACG CCTGGAACTC GGGCATGACC GGCGCCGTGC GGCTGACGAA CACCGGCAGC
ACGCTGTCGG GCTGGACCTT GACGTTCACG GCGCCCGCGG GTGTGACGGT GACCCAGGGC
TGGGGCGGCA CGTGGTCGCA GTCCGGCTCC ACGGTGACGG TCCGCAACGC GGACTGGAAC
GGCACGCTCG CGACGGGCGG CACGGTCGAG ATCGGGTTCA ACGCCTCGCA CGGCGGCACG
ACGGGCACGC CCTCGGGCTT CGCCGTCAAC GGGGCGTCCT GCGCCACGGC CTGA
 
Protein sequence
MTTSRVRRLR TALGSVLAAG ALTLPLALSA APAQAADTPD WLHVQGNKIV DASGKEVWLT 
GVNWFGFNAD ERVFHGLWSA NMRTLTKGMA DRGLNVVRVP ISAELMLEWK AGTFTKPNVN
EFANPELAGL NSLQIFEKFL EMSDEYGLKV FLDVHSAEAD NSGHVYPVWW KGDITTEHVY
EAWEWAAARW KTNDTLIGAD LKNEPHGTQG QTERAKWDGS TDKDNFKHFA ETAAKKVLAI
NPNWLIFVEG IEIYPKDGVS WSSTGLTDYH NMWWGGNLRG VRDFPIDLGA NQDQLVYSPH
DYGPLVHLQP WFKGEWSRET LERDVWDPNW LYLHKENTAP LLIGEWGGFM DGGPNEKWMV
ALRDLIVDRR LSHTFWVLNP NSGDTGGLLG YDWATWDEEK YALLKPALWQ DGGKFVGLDH
DVPLGGVGST TGKSLSQVNG TFPSPTATAT PTPTPSVTPT PTPSVTPTPT PSVTPTPTPS
ATPTPTATPT SAAGACTVTF SGNAWNSGMT GAVRLTNTGS TLSGWTLTFT APAGVTVTQG
WGGTWSQSGS TVTVRNADWN GTLATGGTVE IGFNASHGGT TGTPSGFAVN GASCATA