Gene Cfla_1246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1246 
Symbol 
ID9145125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1390785 
End bp1392125 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content75% 
IMG OID 
ProductPeptidase M23 
Protein accessionYP_003636345 
Protein GI296129095 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGCCC TCCTGGCGGC GCTGTGCGCA CTGGCCGTCG CGCTGCCCGC GCTCGTCGCC 
GGCTCGGCGC CCGCGACGGC CGACGACCTG TCGAACCGGC GCGCGGCCGC CGAGCAGCGG
GCCGCAGCGG CCGAGGAGCG GGCGGCGGAC CTCGCCGAGG CGATCGAGGA GCTCAGCGGT
GAGCTCGCGG CCGCGCTGAC GCGGCTCGCG GAGGTCGAGG CGCAGCTGCC CGTCGCGCAG
GCGGCGCTGG ACGCCGCGGT CGCGGCGGCG CAGGCCGCGC AGCGTGAGGC GGAGATCCTC
GCCGCGCGGC TGCAGGCGGC GCGCGACGAG GAGGCCGCGA TCACCGAGCA GATCGCCGTG
GACGACGCCC GCGAGCAGGA GATCCGCGGT GCCATCGGAC AGATGGCGCG CGAGGCGTAC
AAGGGCGGCC GGGACGTCTC GGGCATGAGC GTCATGCTCG ACGCCGAGAG CTCGCAGGAC
TTCGTCGAGA AGTACGGGCT GGTGTCCACC GCGCTGCGCA CGCAGACGCA GGTGCTCGAC
GAGCTCACGG CCCTCGCGGC GAAGAACCGC AACGCGCAGG CGCGGCAGAC GGCCGTGCGC
GCCAAGGTCG ACGAGCTGAA GGTGGCGGCG GACGCCAAGC TCGCGGAGGC GCGTGAGGCA
CAACGACAGG CCGAGGCGGC GAAGGCCGAG GTCGAGCGGC TCGTCGCGGA GCAGCAGCAG
CGCACGGCGG ACATCGAGTC ACGCAAGGCC GAGGCGCAGG CGCAGGTCGC CGCGAACGAC
GCCGAGCGCG CGGCGGTCGC CGGGGAGCTC GCGGCGATCA TCGAGGCGCA GCGCGTCCAG
CGCGAGAAGG AGGAGGCCGC ACGGCGGGCC GCGGGGCAGA CCGGCGGTGG GGGCGGGGGT
GGCAGTGCGC AGCCGGGCGG TGCGCGGCCC GGGGCGCTGT TCGCGAACCC CACGGCGCAC
AACCCGATCG TCGTGACGTC CGAGTACGGC AACCGCCTGC ACCCCGTCCT GGGGTACTGG
CGCCTGCACG CGGGCATCGA CCTGCGCGAC CGCTGCGGTG AGCCGGTCTA CGCGGGCCGG
GACGGCACCG TGCAGTGGGC GCGCCACCGC TCGGGGTACG GCGGCCAGGT GATGATCGAC
CACGGCTGGG TCAACGGCTC GTCGCTCATG TCGAGCTACA ACCACATGTC GTCCTTCGCG
GTCGGTGGCG GGGCGAACGT CCGTGCCGGT CAGCTCCTCG GGTACGCGGG CAACACGGGC
ACGTCGGCGG CGTGCCACCT GCACTTCGAG GTGTACGTCA ACGGAGCGAC GGTGAACCCG
CGGTCCTACC TGGGGCTGTG A
 
Protein sequence
MRALLAALCA LAVALPALVA GSAPATADDL SNRRAAAEQR AAAAEERAAD LAEAIEELSG 
ELAAALTRLA EVEAQLPVAQ AALDAAVAAA QAAQREAEIL AARLQAARDE EAAITEQIAV
DDAREQEIRG AIGQMAREAY KGGRDVSGMS VMLDAESSQD FVEKYGLVST ALRTQTQVLD
ELTALAAKNR NAQARQTAVR AKVDELKVAA DAKLAEAREA QRQAEAAKAE VERLVAEQQQ
RTADIESRKA EAQAQVAAND AERAAVAGEL AAIIEAQRVQ REKEEAARRA AGQTGGGGGG
GSAQPGGARP GALFANPTAH NPIVVTSEYG NRLHPVLGYW RLHAGIDLRD RCGEPVYAGR
DGTVQWARHR SGYGGQVMID HGWVNGSSLM SSYNHMSSFA VGGGANVRAG QLLGYAGNTG
TSAACHLHFE VYVNGATVNP RSYLGL