Gene Cfla_0501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0501 
Symbol 
ID9144368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp533260 
End bp534561 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content71% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003635614 
Protein GI296128364 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000488029 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000184856 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGCTTC GCCACCTCGC AGCAGCGTCC GTCCCCGTCC TCCTGCTCGC GGCCTGCTCG 
AGCGGCGGCG ACGCCGCGGA CGACGGGCCC GTCACGCTCA CGTACTGGGC CAGCAACCAG
GGCACGAGCC TCGACCACGA CAAGGAGGTC CTCACGCCCG TCCTGGAGGA CTTCACGGAG
CGCACGGGCG TCGAGGTCGA CCTGGAGGTC ATCGGCTGGA GCGACCTGCA GACGCGCATC
CAGACGGCCG TCACGTCCGG CCAGGCGCCC GACGTGGTCA ACATCGGCAA CACGTGGGCG
GTGTCCCTGC AGGCCACGGG CGCGTTCCTG CCGCTCGACG ACGCGGCGAT GGACGCGATC
GGTGGCGCCG ACAAGTTCGT GGCGACCGCG CTGGAGACCG GCGGTGCGCC GGGGACCGAC
CCGACGTCGG TGCCGCTGTA CGGGCTGGCG TACGGGCTCT ACTACAACAC GGCGATGTTC
GCCGACGCAG GGCTGCAGCC GCCGACGACG TGGGAGGAGA TGGTCGCCGC CGCGCAGGCG
CTCACCGACC CCGCGGCGGG CGTGTACGGC ATGGCGCTGG CGGCCGGCTC GTACACGGAG
AACAACCACT TCGCGTTCAT CAACGCCACG CAGAACGGCG CCGAGCTGTT CGACGCCGAC
GGCAACCCGA CGTTCACGGG CGACGGCGTC GTCGACGGCA TCGTGCGCTA CCTCGACCTC
ATGCAGGACG CCGGCGCGGT GAACCCCGCG AACGCGCAGT ACGACAACGC GTCGTTCGCG
GCGGCGGACT TCGCGAACGG CAAGGCCGCG ATGATCCTCA ACCAGAGCAA CGCGGGCGCG
ACCATCGAGG CGAACGGCAT GGCGCCCGAC GCGTACGGCG TCGTCCCGTT CCCGGCACCG
CAGGACGCCG TGAGCGACGT CGCGAGCCAC GTCGCGGGCA TCAACGTGTC GGTCTTCGGC
AACACCGAGC ACCCCGACGA GGCGCTGCAG CTCGTCGAGC ACCTGACGAG CGCGGACGTG
CAGACCACGC TGGGCAGGCC GTTCTCGTCG CTCCCGGTGC TGAAGGACGC GACGGCGGCG
TTCACGGACG ACGCCGAGCT GGCCGCGATC TTCACGGAGA TCTACAACGA GCGCTCCGCA
CCGCTGCCCC TGGTGCCCGC GGAGGACCAG TTCGAGACGA CGGTCGGCAA GGCGATGAAC
GCGATGTTCG CGACCATCGC CACGGGCGGC ACGGTCACCG CGGACGACGT GCGTGAGGCG
ATGCAGACCG CGCAGGACCA GGTGCAGGCG TCGGTCGGCT GA
 
Protein sequence
MKLRHLAAAS VPVLLLAACS SGGDAADDGP VTLTYWASNQ GTSLDHDKEV LTPVLEDFTE 
RTGVEVDLEV IGWSDLQTRI QTAVTSGQAP DVVNIGNTWA VSLQATGAFL PLDDAAMDAI
GGADKFVATA LETGGAPGTD PTSVPLYGLA YGLYYNTAMF ADAGLQPPTT WEEMVAAAQA
LTDPAAGVYG MALAAGSYTE NNHFAFINAT QNGAELFDAD GNPTFTGDGV VDGIVRYLDL
MQDAGAVNPA NAQYDNASFA AADFANGKAA MILNQSNAGA TIEANGMAPD AYGVVPFPAP
QDAVSDVASH VAGINVSVFG NTEHPDEALQ LVEHLTSADV QTTLGRPFSS LPVLKDATAA
FTDDAELAAI FTEIYNERSA PLPLVPAEDQ FETTVGKAMN AMFATIATGG TVTADDVREA
MQTAQDQVQA SVG