Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1872 |
Symbol | |
ID | 9145765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 2083663 |
End bp | 2086521 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | DSH domain protein |
Protein accession | YP_003636968 |
Protein GI | 296129718 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0864183 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGTCCC GCCGCCGCAG CACCCGCTCC GCCCCCACGA CCCCCGCCGC GCGCCCCGGT GCCGTCGAGC CCTCGCCCGC GGAGCGCTAC GCCGCGTCCC GCCGCCGCGC CGCCGCCGAG CACGGGGAGC TCGCCGTCTT CCGCGAGCGC CTCGACTTCC CGCTCGACGA CTTCCAGGTC GAGGCGTGCG CCGCCCTCGA ACGTGGCAGC GGCGTCCTCG TCGCCGCACC CACCGGCGCA GGAAAGACCG TCGTCGGCGA GTTCGCGGTG CACCTGGCCC TCGTCTCGGG TCGCAAGGCG TTCTACACGA CGCCCATCAA GGCGCTGTCC AACCAGAAGT ACGCCGACCT CGCCCGCGTC CACGGCACCG AACGCGTCGG GCTGCTCACC GGCGACACCT CCGTCAACGG CGACGCCGAC GTCGTCGTCA TGACCACCGA GGTGCTGCGC AACATGCTCT ACGCCGGCTC GTCGGCGCTC GACGGGCTCG GGTACGTCGT CATGGACGAG GTCCACTACC TCGCCGACCG CTTCCGCGGC CCCGTCTGGG AGGAGGTCAT CATCCACCTC CCCGACGACG TGCAGCTCGT GTCGCTGTCC GCCACCGTCT CCAACGCCGA GGAGTTCGGC GACTGGCTCG CGACCGTGCG CGGCGACACC ACCGTCGTCG TCAGCGAGCA CCGCCCGGTC CCGCTGGGTC AGCACGTGCT CGTCGGCGAC CAGCTCCTCG ACCTGTACGC CGGGCACGTC GACCCCACCG ATCCCGGCGT CGACCCACCG ATCAACCCCG ACCTCACCCA TCTGCTGCGC GGGCGCACCG GCGGCGACCG CGACCGCCGC GCACCGCGCG GCCGCCCCGG GCGAGCCCGC GACCACCGCC CTGCCGGCGG CGTGCGGCCC GTCCCGCGCT TCGTCATGGT CGACGAGCTC GCCGAGGCGC GCCTGCTACC CGCGATCGTC TTCATCTTCT CCCGGGTGGG CTGCGAGGCT GCTGTGCAGC AGTGCCTGTC GGCCGGTGTG CGGCTCACGA CACCGGCCGA ACGCGCCGAC ATCCGCCGCG TCGCCGAGGA GCGCTGCGCC GCGATCCCGC CGGAGGACCT CGAGGTGCTC GGCTACGACG CCTTCGTCGA GGGCCTGGTG CGTGGCGTCG CCGCCCACCA CGCGGGCATG CTGCCGCTGT TCAAGGAGAC CGTGGAGGAC CTGTTCTCCC GTGGCTGGGT CAAGGTCGTG TTCGCCACCG AGACGCTCGC GCTCGGCATC AACATGCCCG CGCGCTGCGT GGTCCTCGAG AAGCTCGTCA AGTGGGACGG CTCCGCGCAC GTCGACGTCA CGCCGGGGGA GTACACCCAG CTGACCGGCC GGGCGGGGCG CCGCGGCATC GACACCGAGG GGCATGCCGT CGTCGTCGCG CACGGCGCTC TCGACCCCGT GCAGCTCGCC GGGCTGGCGT CCCGCCGCCT CTACCCGCTG CGCTCGTCGT TCCGTCCCAC GTACAACATG GCGGTCAACC TCGTGGCCCA GGTAGGCCGC GAGCGCGCCC GTGACGTCCT CGAGACGTCG TTCGCGCAGT TCCAGGCGGA CCGGGGGGTG GTGGGTCTGG CCCGCCAGGC GCAGACCCAC GCGGAGGCCC TGGAGGGCTA CGCCCAGGCC ATGACGTGCC ACCTCGGGGA CTTCGCCCAG TACATGGGGC TGCGGCGGGC GATCACGGAC CGCGAGCGCG GTCTCGCCAA GGAGCAGGCC GCCACACGCC GCGCCGACGT CGCCCGCACG CTCGAGGGGC TGCACGTCGG CGACGTCGTC GAGATCCCCG GGGGCCGACG CGCCGAGCAC GCGGTCGTCG TGGATCCCGG CGGTCCGGGC GGGTTCGACG GGCCACGCCC CACCGTGCTG ACCACCGACC GCCAGGTGCG CCGGCTCACG GTGGCCGACG CCGGCTCGGG GCTGCGCACG GTCGGGCGCC TGGCCGTGCC GGCGAGGTTC GAGCTGCGGG TCCCGGCGGC CCGCCGCGAC CTGGCTGCAC GGCTGCGGGC GCGGCTCGAC GAGCTCGACG TCGTCCCGGC GCCTGCGCGC GCCCCGGGCC GGCGCGCCGA GCGGCGTTCG GCTGCCGCCG ACGACGCCGA GCTCGCCGCA CTTCGCCGTG AGCTGCGTTC GCACCCCTGC CACCGGTGCC CGGAGCGGGA GGACCACGCG CGCTGGGCCG AGCGCTGGGA GCGGCTGCGC TCGGAGCACG CCGCCCTCGT GCGGCGCATC GCGGGGCGCA CGGGCTCGAT CGCGGCGGTC TTCGACCGGA TCTGCGACGT GCTCGGGACG CTCGGGTACC TCACGAGCGA CGACTCCGGT GCGCTACGCG TCACCGACGA CGGCCGCTGG TTGCGGCGGC TGTACGCCGA GAACGACCTC GTGCTCGCCG AGTGCCTGCG TCGGGGGGTG TGGGACGAGC TCGACGCCCC CGGGCTCGCG GCCGCGGTCT CGACCCTCGT GTACCGCTCA CGGCGCGACG ACGAGGGCGA CGCACGGGTG CCCGGCGGGC CCGACGGCCG GCTCGGCCGG GCGCTCGACG CGGCGGTGCG GGCGTGGTCG CAGCTCGACG ACCTCGAGCG GGAGGCACGC CTCGAGACGA TCCAGCCGCT CGACCTCGGT CTCGTGCAGC CCGTCCACCG GTGGGCTGCC GGCCGCAGCC TGGACGCCGT GCTGCGGGGC TCGGACCTGG CCGCCGGCGA CTTCGTCCGC TGGTGCAAGC AGGTCATCGA CGTGCTCGAC CAGGTCTCCG GCGCCGCCCC GACCGCGCGC CTGCGCACCA CCGCGGCGAA GGCGGTCACG GCGATGCGCC GCGGTGTCGT GGCGTACTCG ACCGTCTGA
|
Protein sequence | MPSRRRSTRS APTTPAARPG AVEPSPAERY AASRRRAAAE HGELAVFRER LDFPLDDFQV EACAALERGS GVLVAAPTGA GKTVVGEFAV HLALVSGRKA FYTTPIKALS NQKYADLARV HGTERVGLLT GDTSVNGDAD VVVMTTEVLR NMLYAGSSAL DGLGYVVMDE VHYLADRFRG PVWEEVIIHL PDDVQLVSLS ATVSNAEEFG DWLATVRGDT TVVVSEHRPV PLGQHVLVGD QLLDLYAGHV DPTDPGVDPP INPDLTHLLR GRTGGDRDRR APRGRPGRAR DHRPAGGVRP VPRFVMVDEL AEARLLPAIV FIFSRVGCEA AVQQCLSAGV RLTTPAERAD IRRVAEERCA AIPPEDLEVL GYDAFVEGLV RGVAAHHAGM LPLFKETVED LFSRGWVKVV FATETLALGI NMPARCVVLE KLVKWDGSAH VDVTPGEYTQ LTGRAGRRGI DTEGHAVVVA HGALDPVQLA GLASRRLYPL RSSFRPTYNM AVNLVAQVGR ERARDVLETS FAQFQADRGV VGLARQAQTH AEALEGYAQA MTCHLGDFAQ YMGLRRAITD RERGLAKEQA ATRRADVART LEGLHVGDVV EIPGGRRAEH AVVVDPGGPG GFDGPRPTVL TTDRQVRRLT VADAGSGLRT VGRLAVPARF ELRVPAARRD LAARLRARLD ELDVVPAPAR APGRRAERRS AAADDAELAA LRRELRSHPC HRCPEREDHA RWAERWERLR SEHAALVRRI AGRTGSIAAV FDRICDVLGT LGYLTSDDSG ALRVTDDGRW LRRLYAENDL VLAECLRRGV WDELDAPGLA AAVSTLVYRS RRDDEGDARV PGGPDGRLGR ALDAAVRAWS QLDDLEREAR LETIQPLDLG LVQPVHRWAA GRSLDAVLRG SDLAAGDFVR WCKQVIDVLD QVSGAAPTAR LRTTAAKAVT AMRRGVVAYS TV
|
| |