Gene Cfla_3195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3195 
Symbol 
ID9147111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3556230 
End bp3557570 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content75% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003638276 
Protein GI296131026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000039587 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCTGCAGC CCTACCGGGA CGTCCTCGCG CGTCCTGGCG CACTCGCGTT CTCCGGCACG 
GGACTCGTCG CCCGGCTCCC GATGTCCATG GTCGGCATCG GGATCGTCCT GCTGATCTCC
GCCCAGTACG GCTCCTACGG GCTGGCAGGC CGCGTCTCCG CGGCGCTCGT GCTCGCGCAG
GCCGTCTGCG GGCCGCAGCT CGCGCGGCTG ATCGACCGGC ACGGCCAGGC GCGCGTCATG
CGGCCCGCGC TCGTCGTGTC CGCCATGAGC CTCACCGCGC TCGTCGTCGC CGCGTCGCAG
CACGCGCCGT CGGCCTGGCT CTACCTCCCC GCCGTGCTCA CCGGCGCGAC CATCGGCTCG
TTCGGGTCGC TCGTGCGCGC CCGCTGGAAC CACGCCCTCG GCACCGACCC GCGGCGCATC
CACACCGCGT ACTCGCTGGA GTCCGCGTTC GACGAGCTCG TGTTCGTCGT CGGGCCCGTC
GCCGCGACCC TGCTGGCCAC CAGCGTGTCA CCCGTCGCCG GGCTCGTCGT GCCGGTCGTC
GCGATGGTCG TCGGGGGGCT CGCGTTCCTC TCGCTGCGCG GCACCGAGCC CCCGCCGACC
TCGGCGGCCG GGCCGCGCCC CCGCGGCAGC GTGCTCGCCC TGCCGGGCAT GGTCGCGATC
GTCCTCGTCT TCGTCGCGAT CGGCTCGATC TTCGGCGCCA CCGACGTCGC GACCGTCGCC
TTCGCCGAGG AGTCCGGTCG CCAGGAGCTG GCCGGCGTGA TCCTCGCGGT CTTCGCCCTC
GGCTCGCTCA TCTCCGGCCT GCTGTACGGC GCGCGGCACT GGGTGTCCGC GCTGCACCGG
CGCTTCGCCA TCGGCGTCGT CGCGCTCGCC GTCGGGGTGT GCGCGTTCTT CCTCGCCCAG
TCCCTGTGGG TGCTCGCGGC CGCGATGTTC GTCGTCGGGT TCGCGATCGC GCCGTCGATC
ATCAACGGCA ACGCGCTGGT CGCGGAGCTC GTGCCGAGCG GTCGGCTCAC CGAGGGCCTC
ACCTGGGTCA GCACCGGGCT GAGCATCGGC GTGTCCGTCG GTGCGTCGGT CGCCGGGACG
CGCATCGACG CGGACGGCTC GCACGGTGGG TTCCTCGTCG TCGTCGTCTC CGGGGCCGCC
GCGCTCGTCG CGACCTTCGG CGCGCTGCCG TCCCTCGGGA GGCACGCGGC CCGCCACGCC
GACCGCGTCC GCACCGCGCA CCACGCCGAC GCCCGCGCGA CCGAGGACGA GCCGCCCAGC
CCCACCGCCG GCGCCACCGT CGCGGCGTGC GAGCTCGCGA CGGACCGCAC CGCCGACCTG
CCGGACGACG CCCGGAGCTG A
 
Protein sequence
MLQPYRDVLA RPGALAFSGT GLVARLPMSM VGIGIVLLIS AQYGSYGLAG RVSAALVLAQ 
AVCGPQLARL IDRHGQARVM RPALVVSAMS LTALVVAASQ HAPSAWLYLP AVLTGATIGS
FGSLVRARWN HALGTDPRRI HTAYSLESAF DELVFVVGPV AATLLATSVS PVAGLVVPVV
AMVVGGLAFL SLRGTEPPPT SAAGPRPRGS VLALPGMVAI VLVFVAIGSI FGATDVATVA
FAEESGRQEL AGVILAVFAL GSLISGLLYG ARHWVSALHR RFAIGVVALA VGVCAFFLAQ
SLWVLAAAMF VVGFAIAPSI INGNALVAEL VPSGRLTEGL TWVSTGLSIG VSVGASVAGT
RIDADGSHGG FLVVVVSGAA ALVATFGALP SLGRHAARHA DRVRTAHHAD ARATEDEPPS
PTAGATVAAC ELATDRTADL PDDARS