Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1803 |
Symbol | |
ID | 9145696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 2009618 |
End bp | 2011327 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | protein of unknown function DUF349 |
Protein accession | YP_003636899 |
Protein GI | 296129649 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0809623 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0235589 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGAGC ACCCGACGAC GTCGTCCACG GACGACGCCG AGCAGGCTCC CCCCCTCGCG GGCGACGCCG GGGACACCGT CGCAGAGGCC GTCGCCGACG CGCCCGACGC GACGCCGGAG GCGCTCCCGA CGCCGGAGAC CGAGGCCGCC CCGGCAACCG ACGCGACCGC CGAGGCCGCC CCGGCAACCG ACGCGACCGC CGAGGCCGCC CCGGAAGCCG ACGCGGCCAC CGCAGACGCG ACCGCCGACG ACGCGACCAC CGCAGACGCG ACCGCCGACG GCACACCCGC CGACGAGACC ACGGACGGCA CACCGGCACC CGCCCGCCCG GACCGGCCCG GGCCCCGTCC CTCACCGGCG TCGGTGCGTC CGCACCCCCG CCCGGGTCGG CCCGCCGCTG CCGCTCCCGT CGTGCCCGTC ACCCCGGTGC CCACGCCCGA CGAGGAGGCC GCAGCCCAGC ACGCCGCGAC CTTCGGACGC GTCGACGAGG ACGGCACGGT GCACGTCGTC GAGGCCGCCG GCGAGCGTGC GGTCGGGCAG TTCCCCGGTG CGAGCGCCCC GGAGGCGCTC GCGCTCTACG TGCGCCGGTT CCTGGACCTG CAGGCGAAGG TCGCGCTCTT CGAGGCCCGG CTCTCCGCGA CGGACCTGTC CGTCAAGGAG ATCGACCAGA CCCTCACGCG CCTCTCCGAG GAGCTCGCCG AGCCGGCCGC CGTCGGCGAC CTCGACGGGC TCCGCGCCCG CCTCGAGGGC CTGCGCGCCC GTGCGGGAGA GCGTCGTGCG CAGGCGGAGG CCGAGCGCGC GGCGGCCCGT GAGGCCGCCG TCGCCGCCCG CACCGCGATC GTCGAGCAGG CGGAGAGGAT CGCCTCGACC GACCCGTCCC GCATCCAGTG GCGGCCCGCG GGTGAGCAGC TCCGCGGGCT GCTGGACCAG TGGAAGGACG CCCAGCGCTC CGGGCCGCGC ATCGACCGAC CCACCGAGGA GTCGCTCTGG AAGCGGTTCA GCCACGCGCG CACCACCTTC GACCGTGAGC GGCGGCACTT CTTCGCCGAG CTCGAGGCAC GCAACTCCGA GGCGAAGGCC GTCAAGGAGC AGCTCGTCGC GGAGGCCGAG CGCCTCGCCT CGAGCACCGA CTGGGGTGGC ACCTCTGCGG CGTTCCGTGA CCTCATGACC CGCTGGAAGG CGGCCGGCCG CGCGAACCGC CAGGTCGACG ACGCGCTCTG GGCGCGGTTC CGGACGGCGC AGGACACGTT CTTCGCGGCA CGCGACGCCG CGAACCAGGC GATCGACGAG GAGTTCCGCG CGAACCTCGT CGTCAAGGAG GCGCTGCTCG TCGAGGCGGA GGCCCTGCTG CCGATCACCG ACCTCGGTGC CGCGAAGGCG GCGCTGCGCT CGATCCAGGA CCGCTGGGAC GCCGCGGGCA AGGTCCCGCG CGCCGACGTC CAGCGTGTCG AGGCCCGCAT GCGGGCGGTC GAGAGCGCCG TGCGCGAGGC CGACTCCGCG CAGTGGCGTC GCACGAACCC CGAGACGCGA GCACGCGCCG AGGGCGCGGC CGCTCAGCTC GAGCAGGCGA TCGCGGGTCT CGAGGCCGAT CTGGCCGCCG CCCAGGCCAA GGGCGACAAG CGCAGGGTCG CCGAGGCGCA GGCTGCGCTC GACGCCCGGC GCGCGTGGCT CGAGCAGGTC CAGCGCGCCG CGCAGGACGC GCGCGGCTGA
|
Protein sequence | MTEHPTTSST DDAEQAPPLA GDAGDTVAEA VADAPDATPE ALPTPETEAA PATDATAEAA PATDATAEAA PEADAATADA TADDATTADA TADGTPADET TDGTPAPARP DRPGPRPSPA SVRPHPRPGR PAAAAPVVPV TPVPTPDEEA AAQHAATFGR VDEDGTVHVV EAAGERAVGQ FPGASAPEAL ALYVRRFLDL QAKVALFEAR LSATDLSVKE IDQTLTRLSE ELAEPAAVGD LDGLRARLEG LRARAGERRA QAEAERAAAR EAAVAARTAI VEQAERIAST DPSRIQWRPA GEQLRGLLDQ WKDAQRSGPR IDRPTEESLW KRFSHARTTF DRERRHFFAE LEARNSEAKA VKEQLVAEAE RLASSTDWGG TSAAFRDLMT RWKAAGRANR QVDDALWARF RTAQDTFFAA RDAANQAIDE EFRANLVVKE ALLVEAEALL PITDLGAAKA ALRSIQDRWD AAGKVPRADV QRVEARMRAV ESAVREADSA QWRRTNPETR ARAEGAAAQL EQAIAGLEAD LAAAQAKGDK RRVAEAQAAL DARRAWLEQV QRAAQDARG
|
| |