Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0986 |
Symbol | |
ID | 9144861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1093054 |
End bp | 1095105 |
Gene Length | 2052 bp |
Protein Length | 683 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003636091 |
Protein GI | 296128841 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0589275 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.732347 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGTGG CGAGCGGGCG GACGACGGGG GACACGCAGG ACACGCGGTC GCACGGGGCG GGCCGGCGGG CCGGGCGCCG CGCGGCGACG CTCCTGGTGC CGGTGCTCGC GGTGACCACC CTCGCGGCGT GCACCGCCGA GGAGGTCGAC CCGGCGCGTG CCGGGACCGT GGTCGTCTCG GTGGACCTGC CCTTCGCCTC GCTCAACGGC GCGACGGCCG CCGGTCGTGC GCCGGGCAGC GTGCTGGTGC GCGGTCTGGT GCAGTCCGGG TTCTCGGCGA TCGAGCCCGA CGGCACGGTC CGCCCGGACG AGTCGTTCGG CACCGTCGAG AAGGTCGGCG ACGACCCGCT GACGGTGCGC TACACGATCG CGCCGACCGC GCGGTGGTCC GACGGCGTGC CCGTCACCCC CGCGGACCTG CTGCTCGAGT GGGCGGCGCG CAGCGGTCAG CTGGACGAGG TCGTGCCCGA GCTCGACGCG GACGGCGTGG TCGCGCACAC CCGGGACGAC GTCGTCGTCT TCGGTGCCGC GTCCGCCGCG CTCGCGCGCG CCGCGTCGGT CCCGACCGTC GAGGGCGACA CCGTCACGGT CGTCTACGAC GCCCCTGTCG CCGACTGGCG CACCGCGCTG GACGTCAACC TGCCCGCGCA CGTCCTGGGG CGCCTCGCGC TCGATCCCGA CGCGCCCGCG CTGCCGATCC CGGGGGCGAC TCCGTCGGCC ACGGCCACGG GCACCACGTC GCCGTCGGCG CAGGCCGCGG CGGACGCCAC CGCGACCACG TCGCCCTCGC CCGTCGCAGC GGCGTCCGCG TCGCCCTCCG GCGACGCGGA TCCCGACGCC ACTCCCGAGC CTGCGGGCGA GCGCGACGAC GCCGGGGCCG GCGAGGAGCT CGAGGAGGCC GCCGGCTGGG CGCAGGCGGT GGTGACGGCC GTGCAGCAGC AGGACCGGTC GGCGCTCGTG CCGATCTCGC GGGTCTGGCG TGCGGCCGGG CGCGCGGGGG ACGTCACGGC GGACCCGACG CTCACGACGA CGACCGGACC GTACGTGCTC GCGGACGTGG GGGGAGCGGG CGTCGAGATG GTCCGCAACG AGCGGTACGC GGGGGAGGCG CCGGCGGCGT ACGACCGGGT GCGGGTGCGC ACGGACCTCG ATCCCCTCGC CCAGCTCGAC GCGCTCGCCG CGGACGAGGT CGACGTGGCA GCACCGGTGA GCACGTCCGA CGTCCTGGCA GCGGCGGAGG GCCTCGAGGA CGTCGCGATC GCCACGGGCG GGGACGCCGT GCTCCAGCTC GTGCTGCAGC AGGACGCCGG TGGCGTCTTC GATCCCGGCT CGTACCAGGA CGCGCCCGAC CCGGCGGCGA CGGTCGCCGC ACTGCGCGCG GCGTTCCTCG TGAGCGTGCC GCGCGAGGAG GTCGTGGTCG ACGCGGTGCG CCCGCTGTGG GCGCGTGCGC AGGTGTCGGA GGTGGTCGCG GCGCAGGTGG CGCCGGCGGC GACGCCCACG CCGGTGGCGT CGGCCACCGC CGCGGCGGCG GCCGACGGGC CCGTGGAGGT GCGGGTGCTG ACGAACACCG CCGACCCCCT GCGCGCGGCG GTGCTCGACG CGCTGACGAC GGCGGCCGCC GAGCAGGGCT TCGAGGTGGT CCCCGTGGCG ACGGCGGACG CGGCGCAGAG CCTGCGCACG CGCCCGGAGG ACTGGGACGC CGCGCTCGTA CCCGTCGCCC AGGAGGACCT GCCCGTCGCC GCCTTCGCGG CGCGCTGGCG CAGTGGGGGC GCCACCAACG TCACGGGCCA CGCGGACCCC GCGCTCGACG AGGTGCTCGA CGCGCTGGTC GCGCAGCCGG ACCCGGACGC CGCGGGTGCG CAGGTCGCGG ACGCGTCGGC CGCCCTGCGC ACGTGGGGCG CCGTGCTCCC GGTGGTGCGC ACACCCGTCC TGACGGTGTC CGCCACGCGT GACGCGGCGG AGGACCGCGG GCTGCCGGTG GTCGCGGACG TCCCGGTGCT CACACCTGCT GCGGCGGACC TCACATGGTG GTGGAACTGG ACACGACGGT AG
|
Protein sequence | MSVASGRTTG DTQDTRSHGA GRRAGRRAAT LLVPVLAVTT LAACTAEEVD PARAGTVVVS VDLPFASLNG ATAAGRAPGS VLVRGLVQSG FSAIEPDGTV RPDESFGTVE KVGDDPLTVR YTIAPTARWS DGVPVTPADL LLEWAARSGQ LDEVVPELDA DGVVAHTRDD VVVFGAASAA LARAASVPTV EGDTVTVVYD APVADWRTAL DVNLPAHVLG RLALDPDAPA LPIPGATPSA TATGTTSPSA QAAADATATT SPSPVAAASA SPSGDADPDA TPEPAGERDD AGAGEELEEA AGWAQAVVTA VQQQDRSALV PISRVWRAAG RAGDVTADPT LTTTTGPYVL ADVGGAGVEM VRNERYAGEA PAAYDRVRVR TDLDPLAQLD ALAADEVDVA APVSTSDVLA AAEGLEDVAI ATGGDAVLQL VLQQDAGGVF DPGSYQDAPD PAATVAALRA AFLVSVPREE VVVDAVRPLW ARAQVSEVVA AQVAPAATPT PVASATAAAA ADGPVEVRVL TNTADPLRAA VLDALTTAAA EQGFEVVPVA TADAAQSLRT RPEDWDAALV PVAQEDLPVA AFAARWRSGG ATNVTGHADP ALDEVLDALV AQPDPDAAGA QVADASAALR TWGAVLPVVR TPVLTVSATR DAAEDRGLPV VADVPVLTPA AADLTWWWNW TRR
|
| |