Gene Cfla_3052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3052 
Symbol 
ID9146964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3397280 
End bp3398563 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content70% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003638134 
Protein GI296130884 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.854825 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.208902 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCAA GGAAGGCCCT CGCGGCCTGC GCGACGGTGC TGATGAGCGC GCTCGCGCTG 
ACCGCGTGCG CGGGTTCTGA CGACGGCGGC AGCTCGGAGG GCGCGAGCGA GCTGGTGTTC
TGGCACAACT CCACCACCGG TGACGGCAAG GCGTACTGGG AGGAGGTCGG CGCGGCGTTC
GAGGAGGAGA CCGGCATCAA GGTCGCCATC CAGTCCATCC AGAACGAGGA CATGGACGGC
AAGCTGCAGA CGGCCCTCAA CGGCGGCGAC GCCCCGGACG TCTTCATGTC CCGCGGCGGC
GGCAAGCTGG CCGCGGTCGT CGAGGCGGGC CAGGCCATGG ACCTCACCGA CCTCATCGAC
GACGACGTGC GCGCGGCGGC CGGTGGCTCG CTCGACGCGT TCTCGGTCGA CGGCAAGGTC
TACGGCATGC CCACCGCCGT GCTCCCCGGC GGCATCTGGT ACTCGAAGGA CCTCTTCGAG
CAGGCCGGCA TCACCGAGAC GCCCACGACC ATGGGTGACC TCGAGGACGC GGTCGGCAAG
CTCAAGGACG CCGGCATCCA GCCGATCGCG CTCGGTGCCA AGGACGCGTG GCCCGCGGCC
CACTGGTACT ACTTCTTCGC GCTGCGCGCG TGCGCCCAGG ACACCATCAC GGACGCCGCC
GCCGAGATGA ACTTCGACGA CCCGTGCTGG GTCAAGGCCG GCGAGGCCTT CGAGGAGTTC
GCCTCGATCG AGCCGTTCAA CAACGGCTTC CTCACCACGA CCGCCCAGCA GGGCGCCGGC
TCCTCGGCCG GTCTCCTCGC CAACAAGCAG GCGGCGATGG AGCTCATGGG TGCCTGGAAC
CCGGGCGTCA TCGCGGGCCT GACGCCCGAC GGCGAGCCGC TCGCGGACCT CGGCTGGTTC
CCCTTCCCGG CGGTCGACGG CGGTGACGGC GACCCCACGG CCATGATGGG CGGCGTCGAC
GGCTACAGCT GCTTCGTGGA CGCCCCGAAG GAGTGCGCCG ACTTCCTCAA CTTCTACATG
AAGAAGGAGT GGCAGGAGGG CTACGCGGAG GCGTTCGTCA CCATCCCGGC CAGCAAGGAC
GCGCAGGCGG CCGTCACCGA CCCGGCCCTC ACGCAGGTCC TCGAGGCGTA CAACGGTGCG
GCCTACGTGT CGGTGTGGCT GGACACGCTG TTCGGCAACA ACGTCGGCAA CGCCCTGAAC
ACGTCGGTCG TCGAGATGCT CGCGGGCAGC GGCGACGCCG AGAGCATCGT CGCCACGGTC
AAGTCCGCGG CAGCCAAGGA GTAA
 
Protein sequence
MKARKALAAC ATVLMSALAL TACAGSDDGG SSEGASELVF WHNSTTGDGK AYWEEVGAAF 
EEETGIKVAI QSIQNEDMDG KLQTALNGGD APDVFMSRGG GKLAAVVEAG QAMDLTDLID
DDVRAAAGGS LDAFSVDGKV YGMPTAVLPG GIWYSKDLFE QAGITETPTT MGDLEDAVGK
LKDAGIQPIA LGAKDAWPAA HWYYFFALRA CAQDTITDAA AEMNFDDPCW VKAGEAFEEF
ASIEPFNNGF LTTTAQQGAG SSAGLLANKQ AAMELMGAWN PGVIAGLTPD GEPLADLGWF
PFPAVDGGDG DPTAMMGGVD GYSCFVDAPK ECADFLNFYM KKEWQEGYAE AFVTIPASKD
AQAAVTDPAL TQVLEAYNGA AYVSVWLDTL FGNNVGNALN TSVVEMLAGS GDAESIVATV
KSAAAKE