Gene Cfla_0141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0141 
Symbol 
ID9144006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp172225 
End bp173799 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content73% 
IMG OID 
Productpeptidase M28 
Protein accessionYP_003635259 
Protein GI296128009 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.481582 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGCTC GACGTACCAC CGTGCTCGCC GGCACGCTCG CGACCCTCGT GCTCGGGACC 
AGTGCACTGC TCGCCCCACC CGCCCTCGCC CACGGGCCCG GTGGCAGGCC CGGGCCGGGG
CACGGCCCGG GCCACGCGGT CGACGCGGAG AGGTTCGCCC GGCAGGTGAC GACGCGCGGC
GTCTGGCGGC ACCTCGAGGA GCTCCAGCGC ATCGCGGACC GGCACGACGG CAACCGTGCT
GCCCTCACCG AGGGCTACGA GGCGAGTGCG CGCTACGTCG AGCGGACGCT GAAGCGCGCC
GGCTACGAGG TCACGCGCGA CCCGTTCACG TTCGGCTTCG AGGTCATCGA CGCCGAGGCG
CTCACGCTGG GCACGGGCGA GACGTTCGCG GTCGACCAGA TGCAGTACGC GCCGAGCACG
GCGGAGGGCG GCGTGACGGC GCCCGGGTCG GTGCCGAACG ACGTCACGGG CTGCACGGCC
GACTCGTGGG CGGGCGTCGA GGCGACCGGG ACGATCGCGG TCATCAGCCG CGGCGCGTGC
TCGTTCGCCG AGAAGGCGAT CGCGGCGCAG GCCGCGGGTG CGATCGGCGC CGTCGTCTAC
AACAACGTCG AGGAGATGCT GTTCGGCACG CTCGGCGAGG AGGGGCTCGT CGACATCCCC
GTCGCGGGCG CCGGCCAGGC CGACGGGGCG GCGATCGTCG CGGCCGTCGC CGCCGGGACG
CCGCTGACGC TCGAGGTCCG CTACCACGTC GAGGAGGAGG AGAGCTTCAA CGTCATCGCG
GAGACGAAGG CCGGGCGCGA CGACAACGTC GTCGTGCTGG GCGCGCACCT CGACGGCGTG
GAGGACGGCC CCGGCCTCAA CGACAACGGG TCGGGCTCGG CGGTGCTGCT GGAGGTCGCC
GTCCAGCTCG CGAAGCAGAA GAAGCTCAAC AACACCGTGC GGTTCGCGTG GTGGGGCGCC
GAGGAGCTCG GGCTGATCGG CTCGACCGCG TACGTCGAGG AGCTCGCGGG CCAGGAGGGC
GAGCTCGACC GCATCGCCAC GTACCTCAAC TTCGACATGG TCGGCTCGCC GAACTACGTC
ATCGGCGTGT ACGACGCGGA CCAGTCGACG TACGAGGCGC CGGTGGACGT CCCGCCGGGC
TCGGCGGAGA CGGAGGCGGT CTTCACCGGC TACTTCGACT CCCGCGACCA GGCGTGGGTC
GACACCGAGT TCTCCGGCCG GTCCGACTAC CAGGCGTTCA TCCTCAACGG CGTCCCCGCG
TCCGGCCTCT TCACGGGCGC GGACGACATC AAGACCGACG AGGAGGTCGC GCTGTTCGGC
GGCACGGCCG GCATCCGGCA CGACCCGAAC TACCACACGC CGGCGGACGA CCTGTCCAAC
GTGAGCCGCG AGGCGATCGG GATCATGGCG CCGGCGGTCG CGTTCGCGAC GGCGAGCCTC
GCGACGGACA CGTCGGCGAT CAACGGGGTC TCGGGCCCGG GCGACCAGGG GCACCACCAC
GGGCCGTCGC ACCACGGCAA GGGCCGTGCG CCGCACCACG GGCACGAGCA CGGAGGCCTG
CTGAAGGCGT CGTGA
 
Protein sequence
MAARRTTVLA GTLATLVLGT SALLAPPALA HGPGGRPGPG HGPGHAVDAE RFARQVTTRG 
VWRHLEELQR IADRHDGNRA ALTEGYEASA RYVERTLKRA GYEVTRDPFT FGFEVIDAEA
LTLGTGETFA VDQMQYAPST AEGGVTAPGS VPNDVTGCTA DSWAGVEATG TIAVISRGAC
SFAEKAIAAQ AAGAIGAVVY NNVEEMLFGT LGEEGLVDIP VAGAGQADGA AIVAAVAAGT
PLTLEVRYHV EEEESFNVIA ETKAGRDDNV VVLGAHLDGV EDGPGLNDNG SGSAVLLEVA
VQLAKQKKLN NTVRFAWWGA EELGLIGSTA YVEELAGQEG ELDRIATYLN FDMVGSPNYV
IGVYDADQST YEAPVDVPPG SAETEAVFTG YFDSRDQAWV DTEFSGRSDY QAFILNGVPA
SGLFTGADDI KTDEEVALFG GTAGIRHDPN YHTPADDLSN VSREAIGIMA PAVAFATASL
ATDTSAINGV SGPGDQGHHH GPSHHGKGRA PHHGHEHGGL LKAS