Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1965 |
Symbol | |
ID | 9145859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 2186361 |
End bp | 2188031 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003637059 |
Protein GI | 296129809 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGACCA GATGGCAGAT CGTCGCGGGC GGCCTCGCGG CGGGGCTCGC CCTGACCGCG TGCGCGGCCA GCGACCGGGA CCCCGACGCG AGCTCCGACG GCACGAGCGG CGGTGGCCGC AGCACGTTCA TCTTCGCGGC GTCCGGAGAC CCGGCGTCGC TCGACCCGGC GTTCGCGAGC GACGGGGAGT CGTTCCGCGT GGCGCGCCAG ATCTTCGAGG GACTCGTCGG CGTCGAGCCC GGCACGGCCG ACCCCGCGCC GCTGCTCGCC GAGTCGTGGG AGGTCAGCGA CGACGGTCTC GAGTACACCT TCGACCTCAA GGAGGGCGTG ACGTTCCACG ACGGCACCCC GTTCGACGCC GAGGCCGTGT GCTTCAACTT CGAGCGGTGG AACAACTTCA CGGGCGTCCT GCAGTCCGAG TCGCTGTCCT ACTACTGGCA GAAGGTCAAC GGCGGGTTCG CGAGCAGCGA CGTCGCCGGC CTCGACGGCA CGGGCAAGTA CGAGTCGTGC GAGGCACCCG ACGCCGGCAC CGCGGTGATC CGCCTGCGTT CCCCGCTGCC GGAGCTGGTC TCGGCGCTGT CGCTGCCGGC CTTCTCCATG CAGTCACCCA CGGCGCTGCA GGAGTACGAC GCGGACGGCG TGACGGGCGA CGGCGAGGCG CCGGTGCTGC CCGAGTACGC CACGAGCCAC CCGACAGGCA CCGGGCCCTA CGTCTTCGAG TCGTGGAGCC CGGGCGAGGA GGTCGTGCTC TCGGCCTACG ACGAGTACTG GGGCGAGCAG GGCCAGATCA CGCGCATCGT GTTCCCGATC ATCTCGGACG CGACCGCGCG CCGGCAGGCC CTGCAGGCCG GTGACATCGA CGGCTACGAC CTGGTCGGCC CGGCCGACGT CGTCGCCCTC GAGGAGGCCG GCTTCCAGAT CGTCGACCGC GAGCCGTTCA ACGTGCTGTA CCTCGGCATG AACCAGGCGA ACCCCGACCT GGCCGACCTC AAGGTCCGCC AGGCCATCGC GCACGCGATC AACAAGGAGG CGCTCGTCGC GCAGACGCTG CCGGAGGGCA CCGAGGTCGC GACGAACTTC GTGCCGCCGT CGGTCGCGGG CTGGAACGCC GACGTGACGA CGTACGAGTA CGACCCCGAC AAGGCGCGCG CGCTGCTCGC GGAGGCCGGG AAGCCGGACC TCACGATCGA CTTCAACTAC CCGACCAACG TGTCACGGCC GTACATGCCG ACGCCCGAGC AGGTGTTCAC CGCGATCACC GCCGACCTCG AGGCCGTGGG CATCACCGTG AACGCCGTGC CGGACCCGTG GAACCCGGAG TACCTCGACA AGATCCAGGG CGGGTCGGAC CACGGGCTGC ACCTGCTCGG GTGGACCGGG GACTACAACG ACACCTACAA CTTCATCGGG GTGTTCTTCG GCGGTCCGTC GAACGAGTGG GGCTTCGACA ACGCCGAGCT GTTCCAGGCG ATCAACGACG CCCGGTACGT CGCGGACATC GACGCGCAGC AGGAGGCCTA CGAGGCGGCG AACGCCGCGA TCCTCGACTT CCTGCCGGGC GTCCCGCTGG CGCACCCCGT GCCGTCGCTC GCGTTCAAGG CCGACGTCCA GGGCTACCCG GCCTCGCCGG TGCAGGACGA GGTCTACAAC GTGATCACCC TCGGCAGCTG A
|
Protein sequence | MRTRWQIVAG GLAAGLALTA CAASDRDPDA SSDGTSGGGR STFIFAASGD PASLDPAFAS DGESFRVARQ IFEGLVGVEP GTADPAPLLA ESWEVSDDGL EYTFDLKEGV TFHDGTPFDA EAVCFNFERW NNFTGVLQSE SLSYYWQKVN GGFASSDVAG LDGTGKYESC EAPDAGTAVI RLRSPLPELV SALSLPAFSM QSPTALQEYD ADGVTGDGEA PVLPEYATSH PTGTGPYVFE SWSPGEEVVL SAYDEYWGEQ GQITRIVFPI ISDATARRQA LQAGDIDGYD LVGPADVVAL EEAGFQIVDR EPFNVLYLGM NQANPDLADL KVRQAIAHAI NKEALVAQTL PEGTEVATNF VPPSVAGWNA DVTTYEYDPD KARALLAEAG KPDLTIDFNY PTNVSRPYMP TPEQVFTAIT ADLEAVGITV NAVPDPWNPE YLDKIQGGSD HGLHLLGWTG DYNDTYNFIG VFFGGPSNEW GFDNAELFQA INDARYVADI DAQQEAYEAA NAAILDFLPG VPLAHPVPSL AFKADVQGYP ASPVQDEVYN VITLGS
|
| |