Gene Cfla_3547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3547 
Symbol 
ID9147463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3936694 
End bp3937824 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content72% 
IMG OID 
ProductEndo-1,4-beta-xylanase 
Protein accessionYP_003638618 
Protein GI296131368 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0132087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCAC CGAGGCTCGT CCGGGACGGG GCCGCGCTGG GCGTCGCCGC CGCCCTCGCC 
GTCGGCGGTT GCACCGCCGG CGAGACGTCC CCCACGCCCA GCCCCACGCC GACCCCCGCG
TCGACAGCCG ACGTGGCGCT GCGCGACGTC GCCCGGGACG GCCTGGCCGT CGGTGTGGCC
GTCGCGGGCG GCGGGCACTA CGCGGCGTCC GGCTACCCCG ACCCGTTCGG TGCGGACGAG
GCGTACCGCG ACGTCATCGC CGAGCAGTTC TCGTCCGTGA CCCACGAGAA CCAGCTGAAG
TGGGAGTTCG TCCGGCCCAC GCGCGACGAG TTCCGGTTCG AGGGCGCCGA CGCGGTGATC
GAGTTCGCCG AGGAGAACGG CCAGGTGGTG CGCGGGCACA CGCTGCTGTG GCACTCGCAG
AACCCGCGCT GGCTGACGAG CGGCGAGTTC ACCGACGACG AGATGCGGGC CCTGCTGCAG
GAGCACATCG CCACCGTCGT CGGCCGGTAC AAGGGCCGGA TCGTGCACTG GGACGTCGCC
AACGAGATCT TCGACGACTC CGGCGTGCTG CGCACCGAGG AGAACCCGTT CCTCGCGCGG
TTCGGCACGG ACATCGTCGC CGACGCCCTG CGCTGGGCCC ACGAGGCCGA CCCCGACGCG
GTGCTGTACC TCAACGACTT CAACGTCGAG TCGATCGGCC GCAAGTCCGA CGCGTACTAC
GCACTCGCCC AGGAGCTGCT GGCGCAGGGC GTCCCGCTGC ACGGGTTCGG CGTGCAGGGG
CACCTGTCGA CGCAGTACCC GTTCCCGGAC GACCTCGAGG ACAACCTGCG ACGGTTCACC
GACCTGGGCC TGGAGGTCGC GATCACCGAG CTCGACGTGC GCGTGCCCGT CGACGCCGAG
GGCAAGCCCG ACGACGTCGA CGTCGACAAG CAGGTCGACT ACTACCGGCG GGCCGTCGGG
GCGTGCGTCG CGGTCGAGCG CTGCACGTCG CTGACCCTGT GGGGCGTGAC CGACGCCTAC
TCGTGGGTGC CCGGCTTCTT CACCGGCGAG GGCTCCGCCC TGGTCCTCGA CGAGGACTTC
CACGCCAAGC CCGCGTTCAC GGCCGTCGCG GAGGCGCTGG CCGGCGAGTA A
 
Protein sequence
MRAPRLVRDG AALGVAAALA VGGCTAGETS PTPSPTPTPA STADVALRDV ARDGLAVGVA 
VAGGGHYAAS GYPDPFGADE AYRDVIAEQF SSVTHENQLK WEFVRPTRDE FRFEGADAVI
EFAEENGQVV RGHTLLWHSQ NPRWLTSGEF TDDEMRALLQ EHIATVVGRY KGRIVHWDVA
NEIFDDSGVL RTEENPFLAR FGTDIVADAL RWAHEADPDA VLYLNDFNVE SIGRKSDAYY
ALAQELLAQG VPLHGFGVQG HLSTQYPFPD DLEDNLRRFT DLGLEVAITE LDVRVPVDAE
GKPDDVDVDK QVDYYRRAVG ACVAVERCTS LTLWGVTDAY SWVPGFFTGE GSALVLDEDF
HAKPAFTAVA EALAGE