Gene Cfla_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2998 
Symbol 
ID9146910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3323634 
End bp3325292 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content69% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003638080 
Protein GI296130830 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.58673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.432395 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACGA CGAAGAAGCG GCCCCTCGCG CTCGCCGCGA CCGCCGCGAC CCTCGCGCTG 
GCCCTGGCGG CCTGCTCGGG CGGGTCGGAC GACGAGACGG ACGACGCCCC CGAGGCGAGC
GAGCTCGGTC AGGTCGGTGC GATGGAGGAC TACGGCGTCG GCACGACGTT CGTGGCCACC
GAGCCGGTGA GCTTCGGCCT GATGTACCGC GACCACCCGA ACTACCCGCT CAAGGAGGAC
TGGGACATCC TCACGAAGCT CGAGGAGAAC CAGAAGGTCA CCTTCGAGAT GCAGACCGCC
CCGCTGTCCG ACTGGCAGCA GGCGCAGTCG ATCGCGATCG GCGCGGGCAA CGCCCCGGAC
ATCATCTCCG TGACCTACCC CGGGCAGGAG GTGGCCTTCG TCGCCGGCGG TGCGATCCTG
CCCGTGAGCG ACTACGTCGA GCACATGCCG AACTACCTCG ACAAGGTCGA GAAGTGGGGC
CTGGAGGCCG ACATCGACCG GATGCGCCAG CAGGACGGCA AGTACTACGT GCTGCCCGGC
CTGCGCGAGT CGGTCCGTCC CTCGTACACG TACGCGGTGC GCAAGGACGT CTGGGAGCAG
CTCGGCCTGA GCCTGGAGCC GGAGACCTTC GAGGACTTCG CCGCCGACCT GGCGAAGGTC
AAGGCCGCGT ACCCCGACCT GTACCCCCTG TCCGACCGCT GGTCGGCCAA CGGTCCGCTC
GAGGCCACTC TCAACGTCGC CGCGTCGAAC TTCGGCACGG CCGCCGGCTG GGGCTACGGC
GAGGGCACCT GGTGGGACGA GGACGCGGGC GAGTTCGTCT ACACCGGCGC CATGGACGAG
TACCGCGAGC TGCTCGAGTA CTACCACGGC CTCATCGCCG ACGGGCTCAT GGACCCCGAG
AGCCTCACGC AGGAGGACGA CCAGGCCATC CAGAAGATGG CGTCGGGCCA GACCTTCGCC
CAGCTGACGA ACGACCAGGA GATCCTCAAG GTCCGGACCG CCATGACCGA GGTCGGCACG
CAGGGCGAGG TCGCCATGAT CCGCGTCCCC GCCGGCCCCG CCGGTGACGT CCTGGCCGGT
TCGCGCCTCG TCAGCGGTCT CATGCTGTCC TCGTCGGCCG CCGAGGAGGA CGACTTCCTC
GCGATGCTGC AGTTCATCGA CTGGCTGTAC TACTCCGACG AGGGCCTGGA GTTCGCCAAG
TGGGGTGTCG AGGGTGAGAC CTTCACGCGC GAGGGCGACA AGCGCGTGCT CATGCCGGAC
ATCGACCAGA ACGGCCTGAA CCCGGGCGCG CCGAAGGCGC TCAACGTCGA CTACGGCTAC
CACAACGGCG TGTGGATGCT CGAGCACGGC TCGTCGGACG AGCTGGACCG GTCGATGCTG
CGTGACGAGG TCGTCGAGTT CGTCGAGTCC ATGAGCGACA AGGAGCTCGC CCCGGTCTCG
CCGCCCGCAC CGCTGGACGA GCTCGAGCGT GAGCAGGTCT CGCTCTGGCA GACCGCGCTG
CGCGACCACG TGCTGCAGAA CACCGCCGCG TTCATCCTCG GCCAGCGCGA CCTGTCCGAG
TGGGACGCGT ACGTCGCCGA GCTCGAGGGC AAGAACATGC AGCAGTACCT CGACGTGGTG
AACGCCGCGC AGGAGCGGTT CGCCGAGCAG AACGGCTGA
 
Protein sequence
MRTTKKRPLA LAATAATLAL ALAACSGGSD DETDDAPEAS ELGQVGAMED YGVGTTFVAT 
EPVSFGLMYR DHPNYPLKED WDILTKLEEN QKVTFEMQTA PLSDWQQAQS IAIGAGNAPD
IISVTYPGQE VAFVAGGAIL PVSDYVEHMP NYLDKVEKWG LEADIDRMRQ QDGKYYVLPG
LRESVRPSYT YAVRKDVWEQ LGLSLEPETF EDFAADLAKV KAAYPDLYPL SDRWSANGPL
EATLNVAASN FGTAAGWGYG EGTWWDEDAG EFVYTGAMDE YRELLEYYHG LIADGLMDPE
SLTQEDDQAI QKMASGQTFA QLTNDQEILK VRTAMTEVGT QGEVAMIRVP AGPAGDVLAG
SRLVSGLMLS SSAAEEDDFL AMLQFIDWLY YSDEGLEFAK WGVEGETFTR EGDKRVLMPD
IDQNGLNPGA PKALNVDYGY HNGVWMLEHG SSDELDRSML RDEVVEFVES MSDKELAPVS
PPAPLDELER EQVSLWQTAL RDHVLQNTAA FILGQRDLSE WDAYVAELEG KNMQQYLDVV
NAAQERFAEQ NG