Gene Cfla_3531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3531 
Symbol 
ID9147447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3921058 
End bp3922476 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content69% 
IMG OID 
ProductEndo-1,4-beta-xylanase 
Protein accessionYP_003638602 
Protein GI296131352 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCC CACACCACTC GCGCCGCCGT GCCCGGATCG CGGCCGTCGG CGGGCTCAGC 
GCCGCCGCGC TGATCGTGAC GCTCGCCGTG CCGGCCCAGG CCGCGGGCAG CACGCTGCAG
GCCGCCGCTG CCGAGACCAA CCGGTACTTC GGCACGGCCA TGGCAGGGCA CTACTTCAAC
AACTCCGGGA CGATGACGAT CACCAACCGT GAGTTCAACA TGATCACCGC CGAGAACGAG
ATGAAGATGG ACGCGACGGA GCCGTCGCAG AACCAGTTCA GCTACGCGGC CGGCGACCAG
ATCGTCAACT GGGCCCGCCA GAACGGCAAG CAGGTCCGTG GCCACGCGCT GGCGTGGCAC
TCCCAGCAGC CCGGGTGGAT GCAGAACATG TCCGGCACGA CCCTGCGCAA CGCCATGCTC
AACCACGTCA CGAAGGTGGC GACGTACTAC AAGGGCAAGA TCTACGCCTG GGACGTGGTG
AACGAGGCCT ACGCCGACGG CTCGTCCGGC GGGCGACGTG ACTCCAACCT GCAGCGCACC
GGCAACGACT GGATCGAGGC GGCGTTCCGC GCCGCTCGCG CCGCCGACCC GCAGGCCAAG
CTCTGCTACA ACGACTACAA CACCGACAAC TGGTCGCACG CCAAGACGCA GGGCGTCTAC
AACATGGTGC GCGACTTCAA GGCCCGCGGT GTCCCGATCG ACTGCGTCGG CTTCCAGGCG
CACTTCAACT CGGGCAACCC CGTGCCGTCG AACTACCACA CCACGCTCGG CAACTTCGCC
GCGCTGGGTG TCGACGTGCA GATCACGGAG CTCGACATCG AGGGCTCCGG CACCTCGCAG
GCCGAGCAGT TCCGGGGCAT CGTCCAGGCG TGCCTCTCGG TCGCCCGCTG CACCGGCATC
ACCGTGTGGG GCGTGAAGGA CTCCGACTCG TGGCGCGCGT CGGGCACCCC GCTGCTGTTC
GACGGCTCGG GCAACAAGAA GGCCGCGTAC ACGTACACGC TGAACGCGCT CAACGCCGGC
GGCACCACCG CGACCCCGGG TGGCGGCACG TCCAGCCCGG CGCCGCAGCC GACATCGAGC
CCGTCTCCCA CGGCCAACCC GACGACGCCC CCGCCCACCA GCGGCACCGG CACCTGCACG
GCGACGTACT CCGAGGGCCA GAAGTGGAGC GACCGCTTCA ACGGCAACGT GACCGTCCGT
GCCAACGGCA ACATCAGCGG CTGGACGACG ACCGTCACGC TGAGCTACCC GCAGTACATC
ACCGCGGCCT GGGGCGGCAC AGCCAGCTGG CCCCAGTCCA ACGTCATGGT GATGCGCGGC
AACGGTGGCC TCGCCAACGG CCAGACCACG ACCTTCGGGT TCACGGTCCA GCACGGCGGC
AACTGGACCT GGCCGACGGT CAGCTGCACG GCCTCCTGA
 
Protein sequence
MTTPHHSRRR ARIAAVGGLS AAALIVTLAV PAQAAGSTLQ AAAAETNRYF GTAMAGHYFN 
NSGTMTITNR EFNMITAENE MKMDATEPSQ NQFSYAAGDQ IVNWARQNGK QVRGHALAWH
SQQPGWMQNM SGTTLRNAML NHVTKVATYY KGKIYAWDVV NEAYADGSSG GRRDSNLQRT
GNDWIEAAFR AARAADPQAK LCYNDYNTDN WSHAKTQGVY NMVRDFKARG VPIDCVGFQA
HFNSGNPVPS NYHTTLGNFA ALGVDVQITE LDIEGSGTSQ AEQFRGIVQA CLSVARCTGI
TVWGVKDSDS WRASGTPLLF DGSGNKKAAY TYTLNALNAG GTTATPGGGT SSPAPQPTSS
PSPTANPTTP PPTSGTGTCT ATYSEGQKWS DRFNGNVTVR ANGNISGWTT TVTLSYPQYI
TAAWGGTASW PQSNVMVMRG NGGLANGQTT TFGFTVQHGG NWTWPTVSCT AS