Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2771 |
Symbol | |
ID | 9146679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3082660 |
End bp | 3085260 |
Gene Length | 2601 bp |
Protein Length | 866 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | Protein of unknown function DUF1998 |
Protein accession | YP_003637855 |
Protein GI | 296130605 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00683804 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000025981 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGGTCCCG GTGAGCTGCT CGAGGTGCTC CTCGCCGGTG GGCGACGGGC CGACCGGGCG ACCCACGTGC GCGAGCTCCC GGCGCGGCCG GGGGTCCGCG CGGACTGGCC CGCGTGGGCC GACGCGGACC TGGTGCGCGG GTACCGGGCG CTGGGCGTCG AACGGCCCTG GGAACACCAG GTCGACGCGG CCGAGGCAGC GTGGTCGGGC CGGCACACCG TGCTCGCGAC GTCCACCGGG TCCGGCAAGT CGCTGGCCTT CTGGCTGCCC GCGGTCTCGG CCGTGCGGCG TGGTGCCGTG GGGGCGCTGC TGGACCCGGG ACGCATCGAG TCGGCGACGC GGCGCCCGAC GGTGCTGTAC CTGGGCCCGA CGAAGGCGCT CGCAGCGGAC CAGCTCGCGG GGCTCGAGCG TCTGCTGGCA GCGGCGGGGA CGCGCGACGT GCGGGTCGCG ACGTGCGACG GCGACACGTC GCGCGACGAG CGCCGCTGGG TGCGCGAGCA CGCGGACGTC GTCCTCACCA ACCCCGACTT CCTGCACTTC GCGCTGCTGC CCGCGCACAC GTCGTGGTCG CGCGTGCTGT CGTCGCTGGC CTTCGTGGTC GTCGACGAGT GCCACGCGTT CCGCGGTGTG TTCGGCGCGC ACGTCGCGTT GGTGCTGCGC CGGCTGCGGC GCCTGGCCGC GGCGTACGAT GCCGCGCCGG TGGTCGTGCT GGCGTCGGCC ACGACCTCCG ACCCGGCCGC GAGCGCCGCC CGTCTGCTGG GCGTCGAGCC CGGTGACGTG CACGCCGTCA CGGCGGACAC GTCGCCGGCC GGGCGCAGGA CCGTCGTGCT GTGGCAGCCG CCGGAGCTGC CCAGTGGCGA CGGGCCGTGG GCGTCGCTGC TGCCCGACGA GGACCCGTGG TCGACGGTGA TCACCGTGCC GGCGAGCGAC GGCGCGGCCC GCGCGCTCGC CGGGGCGGAC GACAGCGGTA CGCAGGACGC GGCCGGCGCC ACGGACGGTG CCGGAGCGCC GGCGGGGCCG GCACCTGTCG CCGAGGGGTC GGGGCCGGTG GGCACGGGTG AGCGCCTCGT CGCGGTGCCC CGGGACCGTC CGCGACGCAC GGCGACGGCG GAGGTCGCCG ACCTGCTGGC GGACCTGGTC GCGGCCGGGG CCCGCGTGCT GGCGTTCACG CGGTCGCGGC GCGGCGCGGA GTCGGTCGCC GCGACGACGC GGGCCCACCT GGCGGAGGTC GACCCGACGC TGCCGTCGCT CGTGTCGTCG TACCGCGGCG GCTACCTGCC CGAGGAGCGG CGTGCCCTGG AGCGTGCCAT CCGCGCCGGG CACCTGCGGG CCCTCGCGAC GACGAACGCG CTGGAGCTGG GCGTCGACAT CTCGGGGCTC GACGCGGTGC TCATCGCCGG CTGGCCGGGA ACACGTGTCT CGCTGTGGCA GCAGGCGGGC CGCGCCGGGC GGGCGGGCGC GGACGGCCTC GTCGTGCTCG TCTCGCGCGA GGACCCGCTC GACACGTACC TCGTGCACCA CCCGGAGGCG GCGCTGGACG TGCCCGTCGA GGCCACCGTG TTCGACCCGG GCAACCCGTA CGTGCTCGCG CCGCACCTGT GCGCGGCCGC CGCGGAGCGT CCGCTGCGCG CGGACGAGCT CGACCTGTTC GGCCCGCGGG CGCCCGAGCT GCTGGCCGAG CTCACCGCGC GCGGCATCCT GCGTCGCCGC TCGTCCGGCT GGTACTGGAC GCACGCCGAG CCGGCGAGCC GCATGACGGA CCTGCGCGGC GCCGGCGGCG ACCCCGTGCG CGTCGTCGAG ACGGCGACGG GCCGTCTGCT GGGCACCGTG GACGCCGCGT CGGCCGACGC GACCGTGCAC CCCGGGGCCG TGTACGTGCA CCTGGGCACG ACGTACGTGG TCGACGAGCT GCACCTGCAG GACGGTGTCG CCCTGGCGAC GCGACGGGCC GTCGACCACG GCACGTGGGC GCGCTGGGTG ACCTCGACGA CCGTCGTCGA CGTCGAGCGC GAGGTCGCGT GGGGTCCGCT GACGTGGTCC TACGGACAGG TGGACGTGAC GACGCAGGTG ATCGGGTACC AGCGGCGACG GCTCCCGGAC CTGCAGGTGC TCTCGACGCA CGACCTCGAC CTGCCCGCGC GCACGCTGCG CACGACCGCC GTGTGGTGGA CGACCCCGCC CGAGGTGCTG GCCGAGGCCG GTGTCACGCT CGAGGTCGCC CCGGGTGCGC TGCACGCCGC CGAGCACGCG TCGATCGGTC TGCTGCCCCT GCTCGCGACG TGCGACCGCT GGGACCTCGG CGGACTGTCG ACGCTTCAGC ACCCGGACAC CGGGCAGGCG ACGGTGTTCG TCCACGACGG GCACCCCGGC GGTGCGGGGT TCGCCGAGCG CGGCTTCGAG CTGGGGCCGG TGTGGCTCAC CGCGACGCGG GACGCGATCG CCGCCTGCCC GTGCGCGACG GGCTGCCCGG CGTGCGTCCA GTCCCCCAAG TGCGGCAACG GGAACGAGCC GCTCGACAAG GCCGGCGCCC TGCGCCTGCT GTCCACCGTC CTGCGCCACG CCGCGGACGC GCCGACGCCG GACGACCCCG CCGCGCGCTG A
|
Protein sequence | MGPGELLEVL LAGGRRADRA THVRELPARP GVRADWPAWA DADLVRGYRA LGVERPWEHQ VDAAEAAWSG RHTVLATSTG SGKSLAFWLP AVSAVRRGAV GALLDPGRIE SATRRPTVLY LGPTKALAAD QLAGLERLLA AAGTRDVRVA TCDGDTSRDE RRWVREHADV VLTNPDFLHF ALLPAHTSWS RVLSSLAFVV VDECHAFRGV FGAHVALVLR RLRRLAAAYD AAPVVVLASA TTSDPAASAA RLLGVEPGDV HAVTADTSPA GRRTVVLWQP PELPSGDGPW ASLLPDEDPW STVITVPASD GAARALAGAD DSGTQDAAGA TDGAGAPAGP APVAEGSGPV GTGERLVAVP RDRPRRTATA EVADLLADLV AAGARVLAFT RSRRGAESVA ATTRAHLAEV DPTLPSLVSS YRGGYLPEER RALERAIRAG HLRALATTNA LELGVDISGL DAVLIAGWPG TRVSLWQQAG RAGRAGADGL VVLVSREDPL DTYLVHHPEA ALDVPVEATV FDPGNPYVLA PHLCAAAAER PLRADELDLF GPRAPELLAE LTARGILRRR SSGWYWTHAE PASRMTDLRG AGGDPVRVVE TATGRLLGTV DAASADATVH PGAVYVHLGT TYVVDELHLQ DGVALATRRA VDHGTWARWV TSTTVVDVER EVAWGPLTWS YGQVDVTTQV IGYQRRRLPD LQVLSTHDLD LPARTLRTTA VWWTTPPEVL AEAGVTLEVA PGALHAAEHA SIGLLPLLAT CDRWDLGGLS TLQHPDTGQA TVFVHDGHPG GAGFAERGFE LGPVWLTATR DAIAACPCAT GCPACVQSPK CGNGNEPLDK AGALRLLSTV LRHAADAPTP DDPAAR
|
| |