Gene Cfla_3401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3401 
Symbol 
ID9147317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3782367 
End bp3784922 
Gene Length2556 bp 
Protein Length851 aa 
Translation table11 
GC content74% 
IMG OID 
ProductGlycosyl hydrolase family 32 domain protein 
Protein accessionYP_003638477 
Protein GI296131227 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.18387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGACGAC GCATCCGCGC CCTGGGCGCG ACGGCCGCGG TGGCGCTCAT GACGGGGCTG 
GCGGCCCCGG TGCAGGGCCA GGACGCAGCC GACCCGCACC GGCCGGTGGT CCACTTCGCC
CCGCAGCGCC ACTGGGTCAA CGACCCCAAC GGTCCCGTGT GGTACGACGG GCAGTACCAC
CTGTTCTTCC AGCACAACCC GCTCGGGGAC ACCTGGGGCC ATATGTCCTG GGGCCACGCC
GTCAGCACGG ACCTCGTGAC GTGGGAGGAG CGGCCCCTCG CGATCCCGTG GTCCGAGCGG
GAGCACATCT TCTCCGGCAC CGTCGTCGTG GACGAGGGGA ACACGAGCGG CCTCGGCACG
CCCGGCACCA CCCCGCTCGT GGCCGCCTAC ACCTCGTGGG ACCCGCTCAC GGGCATCCAG
TCGCAGTCGG TCGCCTCCAG CCTCGACGCC GGGGAGACGT GGACCGCGTA CGAGGGCAAC
CCGGTCCTCG ACATCGGGTC GCGCGAGTTC CGCGACCCCA AGGTGTTCCG GTACGAGGCC
GGCGGGTACT GGGTGATGGC GGTGGCGCTC GCGGAGGAGC GGATCATCCG GTTCTACCGC
TCGCACGACC TGATCCGCTG GACGCACCTC AGCGACTTCG GGCCGGCCGG CGCCGTCGGC
GGCGTCTGGG AGATGCCCGA CCTGTTCGAG CTGCCGGTCG ACGGCGACCC GGCACGCACC
CGGTGGGTGC TGGTCGTGAG CCTCAACCCC GGAGCCGTCG CCGGGGGGTC GGGCGCCCAG
TACTTCGTCG GCGAGTTCGA CGGCACCCGC TTCGTCGCCG ACCCGCCCCC CGCGCCGGGT
CCGGACGGCG ACGTCCTCGC CGACTTCGAG GGCGGCACGT ACGGCGAGGG CTGGACCACC
ACCGGCGACG CCTTCGGCGA CGGACCGGTG TCCGGCACCC TGCCGGGTCA GCACCCCGTC
ACGGACTTCC GGGGCGAGGG ACTCGTCAAC AGCTTCCGTG GCGGAGACGC CGCCACCGGC
ACCCTCACGT CACCGCCGTT CACCGTCGAG CGCCGTCACC TCACGATGCT CGTCGGCGGC
GGCCGGCACC CCCACCGCCC GGGCACCGGC GACGGCGCAG CGCCCGCGGG CACGGTGCTC
GCGGACTTCG AGGGGACCGA CTTCGGCGAC TGGACGGTCG AGGGGACCGC CTTCGGGCCC
GGTCCGCTCG CCGGCGACGC CCCCTGCCAG ACGGGTGTGC GCGGATACCT CGGCACGCGC
CTGGCCAACT CCTACCAGAA CGGCCGAAGC GACCCGTGCA CGCCTCCGCC CGACTCCGCC
ACCGGCCGGC TCGTGTCCCC GACGTTCACC GTCGACCGCC CGTGGATCAG CATGCTCGTC
GGTGGTGGTG CGGGGCCCGG CACCGCCGTG CGGCTCGTCG TCGACGGGCA GGTCGTGCGC
ACCGCCTCCG GACGGGAGAG CGGGACGCTC GACTGGGCGA CCTGGGACGT CGCCGAGCTC
GCCGGGCGGC GTGCCCACCT CGAGGTCGTC GACGAGGTCA CCGGGGGGTG GGGCCACATC
ATGGTCGACC ACATCGTGCT CTCGGACGAG CCGGCGCGTC CGCGGTCGGA CGAGGCGACG
GTCAACCTCG TGGTGGACGG GCGGGTCGTG CGCACTGCCA CCGGCCGCGA CTCCGAGCAA
CTCGCCCCCG TCACCTGGGA CGTCGGTGAC CTCGTCGGGC GCACCGCGCA GCTCGTGGTG
ACGGACACCA GCACGGGCGG GTGGGGTCAC ATCCTGCTCG ACCACGTCGT CGCCACGGAC
GCGCCGGTGC CCACCCCGCT CGAGCGCCAC GCCTGGGTCG ACCACGGCAC GGACTTCTAC
GCGCCGCTGA CCTTCGAGAA CACCCCGGAC GGCGAGCGGG TCGCCATCGC GTGGATGAAC
AACTGGGAGT ACGCCACGTC GACGCCGACC ACCGGGTGGC GGGGATCGAT GACGTTCCCG
CGCACCCTCG CGCTGCGCAC GGTCGACGGC CGGGTCGTGC TGACCTTCCA CCCCGTCGAC
GTGCCCGGTG CGGAACCCAT CCGCCTGCGG GACCATGAGG GGGTCCTCGA CGAGGGCACC
GTGCGCGTCC CCGAGGCGAC GCACGACGGC GCCGCGATCG TCACGGTCGA GCTCGAGGTC
GGCGACGCCG AGCGGCTCGG GCTGCACGTG CGCGTCGGGG ACGACGAGCG CACGGTCGTC
GGCTACGACG TCGCGGCCGG ACGGATGTAC GTCGACCGGA CACGCTCCGG CACGGTCGAC
TTCCATCCGG GCTTCGCCGG CGTGCACACC GCGCTGCTGC CCACCCGCGA CGGACGCGTG
CGGCTGCAGG TCGTCGTCGA CCGGTCGTCC GTCGAGGTCT TCGGCAACGA CGGGGAGGCC
ACGATCACGG ACCTCGTCTA CCCGGGCGGG GGTAGCAACG GGGTGGCACT GTTCGCCGAG
GGCGGCCGTG CGACCGTGAC CTCGCTCGAG GTGCGTCCGC TCGGTCCGGG GACGTCGTGG
GCCGTCGTGG CACGCGGGCA CGCCGCCCGC CCGTGA
 
Protein sequence
MRRRIRALGA TAAVALMTGL AAPVQGQDAA DPHRPVVHFA PQRHWVNDPN GPVWYDGQYH 
LFFQHNPLGD TWGHMSWGHA VSTDLVTWEE RPLAIPWSER EHIFSGTVVV DEGNTSGLGT
PGTTPLVAAY TSWDPLTGIQ SQSVASSLDA GETWTAYEGN PVLDIGSREF RDPKVFRYEA
GGYWVMAVAL AEERIIRFYR SHDLIRWTHL SDFGPAGAVG GVWEMPDLFE LPVDGDPART
RWVLVVSLNP GAVAGGSGAQ YFVGEFDGTR FVADPPPAPG PDGDVLADFE GGTYGEGWTT
TGDAFGDGPV SGTLPGQHPV TDFRGEGLVN SFRGGDAATG TLTSPPFTVE RRHLTMLVGG
GRHPHRPGTG DGAAPAGTVL ADFEGTDFGD WTVEGTAFGP GPLAGDAPCQ TGVRGYLGTR
LANSYQNGRS DPCTPPPDSA TGRLVSPTFT VDRPWISMLV GGGAGPGTAV RLVVDGQVVR
TASGRESGTL DWATWDVAEL AGRRAHLEVV DEVTGGWGHI MVDHIVLSDE PARPRSDEAT
VNLVVDGRVV RTATGRDSEQ LAPVTWDVGD LVGRTAQLVV TDTSTGGWGH ILLDHVVATD
APVPTPLERH AWVDHGTDFY APLTFENTPD GERVAIAWMN NWEYATSTPT TGWRGSMTFP
RTLALRTVDG RVVLTFHPVD VPGAEPIRLR DHEGVLDEGT VRVPEATHDG AAIVTVELEV
GDAERLGLHV RVGDDERTVV GYDVAAGRMY VDRTRSGTVD FHPGFAGVHT ALLPTRDGRV
RLQVVVDRSS VEVFGNDGEA TITDLVYPGG GSNGVALFAE GGRATVTSLE VRPLGPGTSW
AVVARGHAAR P