Gene Cfla_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1914 
Symbol 
ID9145807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2128043 
End bp2129344 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content77% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003637009 
Protein GI296129759 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.29836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGATC CCGCGCATGA CGTCCGACTG CGGAGCGACG GCCCCGACGA GCTGTTCGCG 
GCCGCGCGCC GGCGCACGGT CGCGGTGCTC GCCTGCGCCG GGGTGCTCGG TGGGCTCGCC
GCGGGCACGG TCGTCTCGGT CGGCTCGCTG CTGGCCGTCG AGCTGTCCGG CGCGGACCGC
TGGGCGGGGT CGGTGACGAC GGCGGCGACC CTGGGGGCGG CCGTCGCCTC GATCGGGCTC
GCGCGTCTGG CGGTGGCGCA CGGTCGGCGT CGTGCGCTGT CGACGGGCCT CGCGCTCGCC
GCGACGGGCG CGACCGGCGT CGTCGTCGCG GCCGTCGTGG CGAGCTTTCC GCTGCTGCTG
GTCGCCGGCG TGCTCCTGGG CGTCGGGTCG GCCGTCAACC TGCAGTCGCG CTTCGCCGCC
ACGGACCTGT CGACGCCGTC CACGCGCGCG CGTGACCTGT CGCTCGTCGT GTGGGCGGGC
ACCGTCGGGG CCGTCGTCGG GCCGAACCTC GTCGGGCTGA ACGAGCCGGT GTCCCGCCTC
ACGGGCCTGC CGGAGCACGC GCCCGTCTTC GTGGTCTCGA CGCTCGGCAT GCTGGCCGCG
CTGATCGTCG TGCAGGCCGG CCTGCGCCCT GACCCGCTCG ACGTCGCGGG GCGTACGGGT
GCCGCGACCG GCCGGCGGCG CCACGTCCCG CTGCGTGTCG CGGTCGCCGT GCTACGGCGG
CACCCCCAGG CCCTGGGTGC GCTGCTCGGC GTCCTGGTGG CGCACGGCGT GATGATCGCG
GTGATGTCCA CGACCCCGGT CCACATGGAG GGCCACGGCG CGTCGATCTC GCTCGTCGGG
CTCACGGTGA GCCTGCACCT CGCCGCGATG TTCGCGTTGT CACCGCTGAT CGGCCTCCTC
GCCGACCGGG TCGGCGCCGG ACGCGCGCTG CTCGCGGGTC TCGCCGTCGT CGTCGCCGCG
TGCGCGGTGT GCGCCACGGC CCATGGTGAC CACGTGCGCG TGACCGTCGG CCTCGTGCTG
CTCGGCCTGG GGTGGTCGGT CGCGACGGTC GCGGGCTCGA GCCTCGTCGC GGCCGCGGTG
CCCGGCGCCG AGCGCGTGGC GGTCCAGGGG CTGTCCGACG CGGCGATGTC TCTCGCCGGT
GCCGGCGGCG GCGCGCTGGC CGGGGTGTGG CTGGAGGTGG TCGGCTACGG CGGCCTGGCC
GCGGTGTCCG GTGCCGTCAC CGTGGCGGGG GTGCTGGCGG TCCTCGTCGT CGTCCGCGGC
CGGGTGCCGG TCGCGGACGC GAGCGCACTG CTGCCGCGCT GA
 
Protein sequence
MSDPAHDVRL RSDGPDELFA AARRRTVAVL ACAGVLGGLA AGTVVSVGSL LAVELSGADR 
WAGSVTTAAT LGAAVASIGL ARLAVAHGRR RALSTGLALA ATGATGVVVA AVVASFPLLL
VAGVLLGVGS AVNLQSRFAA TDLSTPSTRA RDLSLVVWAG TVGAVVGPNL VGLNEPVSRL
TGLPEHAPVF VVSTLGMLAA LIVVQAGLRP DPLDVAGRTG AATGRRRHVP LRVAVAVLRR
HPQALGALLG VLVAHGVMIA VMSTTPVHME GHGASISLVG LTVSLHLAAM FALSPLIGLL
ADRVGAGRAL LAGLAVVVAA CAVCATAHGD HVRVTVGLVL LGLGWSVATV AGSSLVAAAV
PGAERVAVQG LSDAAMSLAG AGGGALAGVW LEVVGYGGLA AVSGAVTVAG VLAVLVVVRG
RVPVADASAL LPR