Gene Cfla_0981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0981 
Symbol 
ID9144856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp1085570 
End bp1087426 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content67% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003636086 
Protein GI296128836 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0454363 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGGATCA CACGGAAGGC TGCCCTCGCG TCGGTCGCGA CGGTCAGCGT CCTCGCCCTC 
GCGGCCTGCA CCGGTAGCGG CGACGACACG TCGGGTGACG ACGACAACGC GGGGATCAAC
ACCGACACCG CGATCAACAT CGCCTGGAAC CAGCCGTTCT CGTCCTTCAA CGGCGAGAGC
ATCACGGGCA ACGCGACGGC CAACAACATC ATCACGTACA TGGCCAACTC GCGCTTCAAC
GACTACAACG CCGACCTCGA GGTCGTGCCG GACGAGTCGT GGGGCACCTA CGAGAAGGTC
TCCGACGACC CGCTGACCGT CGCGTACACG TACGCCGACA CGGCGAAGTG GTCGGACGGC
GTCTCGGCCG GCCCGGCCGA CCTCCTCCTG GAGTGGGCCG CCCAGAGCGG CAAGTTCAAC
AACGTCGAGC CCGAGTACGA CGACGAGGGC AACGTCACCA ACCAGGACGC GCTCGACGCG
GGTGTCTACT TCGACGCCGC CAGCCCGTCC GTGGCGCTGA TCACGGAGAC CCCGGAGATC
GACGGCGACA CGATCACCCT CGTGTACTCC AAGCCGTTCG CCGACTGGGA GGTCGCGATC
GACAACAACC TCCCCGCGCA CATCGTGGCG CAGCGGGCCC TCGGCATCGA GGACCCCGAG
GAGGCGACCG AGGCCCTGAT CGCCGCGATC ACCGACGAGA ACCTCGAGGA CCTGTCGAAG
ATCGCGAAGG TCTGGAGCGA CGACTGGAAC TTCGCGTCCC TGCCCGACGA CCCGCAGCTG
CTCGTCACCT CGGGCCCGTA CACGATCACG GAGTTCGTCG AGCAGCAGTA CCTGACGCTC
ACGGCGAACC CCGACTACGA GGGCGACAAG AAGGCCGTCT TCGAGAAGGT CACCGTCCGC
TACAACGGCG ACCCGATGGG TCAGGTCCAG GCGCTGCAGA ACGGTGAGGT CGACCTCATC
AGCCCGCAGT CGACGGCGGA CGTCCTCAAG GCGCTCGAGG CGATCGACGG CCTGACCGTC
GAGACGAACG TCGAGGGCAC GTACGAGCAC GTCGACCTGC AGCAGGGCAA CGGCGGTCCG
TTCGACGCCG CGACCTACGG CGGCGACACC GAGAAGGCCC ACAAGATCCG CCAGGCCTTC
CTCAAGACGA TCCCCCGCGA GAAGATCGTC ACCGACCTCA TCCAGCCGCT GAACCCCGAC
GCCGAGGTCC GCAACTCCTT CACGCAGGTG CCCGGCTCGC CGATGTACGA CGGCATCGTC
GAGGCGAACG GCCAGCAGGA CGCCTACGGC GAGGTCGACA TCGAGGGTGC CAAGGCGCTG
CTCGCCGAGG CCGGTGTGCC CAGCGTCCAG GTGCGTCTGC TCTTCGACCC GGACAACACG
CGCCGTGTGA ACCAGTACGA GCTCATCAAG GGCTCGGCCG CCGAGGCCGG CTTCGACGTC
GTCCCCTACA CGGTCCAGAC GGACTGGGGT ACGGACCTGT CGAACGCGCG GTCGTTCTAC
GACGCGGCGC TCTTCGGGTG GCAGTCGACC TCGACCGCCG TCACCGAGTC CGACGCGAAC
TACCGCACCG GCGCGACGAA CAACTACTAC GGGTACTCCA ACCCCGAGGT GGACGCGCTG
TACGACGCGC TGCAGACCGA GACCGACGCC GCCGAGCAGG AGCGGATCCT CGGTGAGGTC
GAGAAGCACC TGGTCGACGA CGCGTTCGGC GTGACGATCT TCCAGCACCC GGGCGTCACC
GCCTGGAACC CGGAGAAGAT CGGCAACGTC CAGAAGCTGG GGATCGCGCC GACGATCTTC
TACGGCTTCT GGGAGTGGAC CGCAGGCGAC GCGGCCACCG AGGGCGCCTC CGAGTGA
 
Protein sequence
MRITRKAALA SVATVSVLAL AACTGSGDDT SGDDDNAGIN TDTAINIAWN QPFSSFNGES 
ITGNATANNI ITYMANSRFN DYNADLEVVP DESWGTYEKV SDDPLTVAYT YADTAKWSDG
VSAGPADLLL EWAAQSGKFN NVEPEYDDEG NVTNQDALDA GVYFDAASPS VALITETPEI
DGDTITLVYS KPFADWEVAI DNNLPAHIVA QRALGIEDPE EATEALIAAI TDENLEDLSK
IAKVWSDDWN FASLPDDPQL LVTSGPYTIT EFVEQQYLTL TANPDYEGDK KAVFEKVTVR
YNGDPMGQVQ ALQNGEVDLI SPQSTADVLK ALEAIDGLTV ETNVEGTYEH VDLQQGNGGP
FDAATYGGDT EKAHKIRQAF LKTIPREKIV TDLIQPLNPD AEVRNSFTQV PGSPMYDGIV
EANGQQDAYG EVDIEGAKAL LAEAGVPSVQ VRLLFDPDNT RRVNQYELIK GSAAEAGFDV
VPYTVQTDWG TDLSNARSFY DAALFGWQST STAVTESDAN YRTGATNNYY GYSNPEVDAL
YDALQTETDA AEQERILGEV EKHLVDDAFG VTIFQHPGVT AWNPEKIGNV QKLGIAPTIF
YGFWEWTAGD AATEGASE