Gene Cfla_2399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2399 
Symbol 
ID9146302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2690367 
End bp2691668 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content69% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003637488 
Protein GI296130238 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00275508 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGCAAGA CCACACGCAA GGCGTGGGCG CTCGCAGCGG GTGTGACGAG CATTGCGCTC 
GTCGCGACCG CCTGCTCCTC GAGCGACGAC CCGGGCGAGG GTGACGAGAC CGCGGACGGC
GGCAACATCA CCCTCACGGT CGCCACCTTC AACGAGTTCG GCTACACGGA CGAGATGTTC
GACCGGTACG AGGCCGAGCA CCCCGGCGTC ACGATCGAGC AGAAGGTCGC CGCCACCTCG
AACGAGGCGC GCGAGAACCT CAACACGCGT CTGGCGGCCG GCTCCGGCAC CGCCGACATC
GAGGCGATCG AGGTCGACTG GCTGCCCGAG CTCCTGCAGT ACCCGGACTA CTTCGAGGAC
CTGTCCTCCC CCGAGGTCGA GGGTCGCTGG CTCGACTGGA AGGTCCAGCA GGCCACGACG
GCCGACGGCA AGCTCATCGG CTACGGCACG GACATCGGCC CGGAGGGCAT CGCCTACCGC
GCCGACCTGT TCCAGGCCGC CGGCCTGCCG GCCGACCGCG AGGCCGTCGC CGAGCTCTTC
GGTGGCGAGA GCGCGACGTG GGAGAAGTTC TTCGAGGTGG GCAAGACCTA CACCTCCGCG
ACGGGCAAGC CCTTCTTCGA CTCCGCGGCC GCCATCTACC AGGGCATGGT CAACCAGGAG
GAGGCGGCCT ACGAGGACCC CGACTCGGGT GACGTCATCG CGCTGGAGAA CCCGCGCGTC
AAGGAGATGT ACGAGCAGGT CACCACGGCC GCCGTGGGCG ACAACCTGTC CGCCCACTTC
GAGCAGTGGC AGCCGGACTG GCAGAACGCC TTCCAGAACG ACGGCTTCGC CGTCATGCTG
GCGCCGGGCT GGATGCTGGG CGTCATCGCG GGCAACGCGG CCGGCGTCAC CGGCTGGGAC
CTCGCCGACG TGTTCCCCGG CGGTGCCGGC AACTGGGGCG GCTCGTTCCT CACGGTCCCG
TCGCAGGGTG CCAACGTCGA GGCCGCCAAG GAGCTGGCCG CGTGGCTGAC GGCCCCCGAG
CAGCAGATCG AGGCGTTCCA GAACAAGGGC ACGTTCCCGA GCCAGGTCGA GGCGCTCGAG
TCGGACGAGA TCAAGTCCGC CACCAACGAG TTCTTCAACA ACGCTCCGGT CGGCGAGATC
CTCGCCAACC GCGCGCAGGG CGTCGTGGTG CCCTTCAAGG GCCCGCAGTA CTTCACCGTG
CAGGACGCGA TCAACAACGC GATCACGCAG GTGGACGTCA ACGGTGCCGA CGCGGCCGCC
GAGTGGGCGA CCTTCGAGGG CGTGGTCCAG GGTCTCGGCT GA
 
Protein sequence
MRKTTRKAWA LAAGVTSIAL VATACSSSDD PGEGDETADG GNITLTVATF NEFGYTDEMF 
DRYEAEHPGV TIEQKVAATS NEARENLNTR LAAGSGTADI EAIEVDWLPE LLQYPDYFED
LSSPEVEGRW LDWKVQQATT ADGKLIGYGT DIGPEGIAYR ADLFQAAGLP ADREAVAELF
GGESATWEKF FEVGKTYTSA TGKPFFDSAA AIYQGMVNQE EAAYEDPDSG DVIALENPRV
KEMYEQVTTA AVGDNLSAHF EQWQPDWQNA FQNDGFAVML APGWMLGVIA GNAAGVTGWD
LADVFPGGAG NWGGSFLTVP SQGANVEAAK ELAAWLTAPE QQIEAFQNKG TFPSQVEALE
SDEIKSATNE FFNNAPVGEI LANRAQGVVV PFKGPQYFTV QDAINNAITQ VDVNGADAAA
EWATFEGVVQ GLG