Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3454 |
Symbol | |
ID | 9147370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3845906 |
End bp | 3848527 |
Gene Length | 2622 bp |
Protein Length | 873 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003638527 |
Protein GI | 296131277 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0261651 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGCCT CGTCGACGCC AGCACCCGAC GGTGCCGACG TCGACGCCGC CCCCGGACCC GCCACCGCGC GCGAGGCGCC CACGACCAGC GCGACCCCCG CCCGCACCGT CCCGGACGGC ACGAACACGT CACCGCCGAC CTCGCCCGTG CGCGCCGTGC GCTCCCGTGT GGACGTGCAG GACGTGCGCC CACCGCGCGT CCACCACCCC AGCGACCTGC TGGGCGCGAC GCTGACGACG CTGCTCGCCG TCGTCGTCGT CGTCCTCGCG ACGTACGCGC AGAACACCAC CACCGGCGTC GCCGAGGACG TGCAGGGCTT CGCGACGCTG CTGCGGCGCA TCCTGTTCGT CCCCGTCAAC GTGCTCGTCG GCTTCACCAC CGTCGTCGTG CCCATCGCGG TGCTCACCGA GCTCGCGCTG CGACGCCTGG GCCGCCAGCT CCTGCAGGCG GTCCTCGCGG CCGCCGGCGC CATGGTCGTC GTCGCCGCGC TGCACTGGAC GTTCCTCACG TTCGGCTCCG CCGAGCTCGT GCAGGGCCTG TCGGTACGGC TCGGCGGCCA GTGGCGCCTG ACGATCCCCG AGTACACCGC GATGCTCACG GGCCTGCTCA CCGTCGCCGG GCCGCGCGGC CGCCGGCGCA GCGTGGCGTG GTCCTGGAAC CTGCTGTGGC TGACGACGGG CGTCGTGCTC ATCACCGCCG CCGGCTCCCT GCCCGGGCTG GGGCTGTCCC TGCTCGTCGG GCGCGTCGCC GGGCTCGGCG TGCGCTACCT CGGCGGCGTC GACCCCGAGC GGGCCTACGG TGACGCGCTG CTCGCGGGCA TCCGACGCGC CGGCGTCGAA CCCGCGGACG TCGTACGCGT CCCCGACCCC GCGCTCGACG CCGACCGCAC CCTCGCCACG CAGGACGCGC TCGCGATGCT GCCGCCGGCG ACACCCGCGC AGCTCGCGCT CGTGCGGGCC TCCGGCGACC GCGTCTACGA CGTCACCACC GACGACGGCC GGCACCTCGA CCTCATCGTC TTCGACGGCG ACCGGCAGGT CGTCGGCATG CTCACCCGGC TGTGGCGCAG CCTGCGGCTC CGCGGCCTCG AGGGCCGCTC CGCGCTGTCG CTGCGGCGCG CCACCGAACG TGCCGCGTTG CTCTCCTACG CCGCGCGCGC CGCCGGCGTG CGCACCCCGC AGCTGCTGAG CATCGCCGAG GCCGAGGACT CGATGCTCCT GCTGCAGGAG AGCACCGACG CGGCCGTGCC GTTGTCGGAC CTCACGCGTG AGGACATCGG GGACGACGAC CTGCAGGCGA TCTGGGAACA GCTGCGCCTC GCGCACGCCG CGGGCATCGC GCACCGCGCC CTCACCGCCG ACGTGGTCCT CGTCGACCAG CGACCCGGCG CGCCCCGCGT CTGGATCACG GCGTGGGAGC AGGGGGACGT CGCGTCGTCG GAGCTCGCGC GACGCATGGA CACCATGCAG CTGCTCGCGC TGCTCGCGCT GCGTGTGGGC GCGGCCCGCG CCGTCGCGTC GGCCGCCGCG ATGCTCCCGG ACGACGACAT CGACGGCATC GGTCCGCTCC TGCAGACCGT CGCGCTCCCC CGCCGCACGC GTGAGGAGAT GCGCGCCCAC AAGGAGATCC TCGCCGAGCT GCGCTCCGCG CTCGTGGCGC GCATCCCCGA GGCGGACGTG CAGCCCGCGC AGATCGTGCG CTTCGGCGCC CGCACGCTGC TCACCATCGT GCTCACGGTC GTCGCGGTCT TCGTCGTCCT CGCGTCCGTC AACGTCGCGC AGATCGGGCC CGTGCTCGCG CGCAGCGACT GGCGGTTCTC GGTGCTCGCG TTCGGTCTCG GGCTCCTCAC CCTCGTGGGC GCGGCGCTCG CGTTCGTCGC GTTCTCGCCC GTGCGCCTGT CCGTGTGGCG CGCGACGCTC GTGCAGTCCG CCGCGACCTT CGTCGCGCTC GCCGCGCCGG CCGGCATCGG CCCCGCCGCG CTCAACCTGC GCATGCTCAC GCGCCGCGGC GTCAGCGCGT CCCTCGCCGG CGCGACCGTG GCGCTCGTGC AGGTCAGCCA GTTCGTCACC ACGCTGCTGC TGCTGCTCGT GCTCACCGTG ACGTCGGGCG TGCAGTCCCC CACGCCGTTC TCCGTGCCCC CGGCCGTGCT CATCGTCATC GCGGTCGTCG CCGCCGCCGT CGGCGTGGCG CTGCTGTTCC CCGGCGTGCG CACCTGGGTG CAGCGCACGG TCGGACCGAC GTTCCGGCAG ACGCTGCCGC GCCTCATCGA GGTCGTCGGG CAGCCGTGGC GGCTCGCGCT CGCGGTCTTC GGCAACGTGC TCATGACGAT GGGCTACGTG CTCGCGTTCG ACGCCGCGCT CGTCGCGCTC GGCCAGGAGG CCTCGCTGGT GCAGGTGGCG CTCGTGTACC TCACGGGCAA CACCGCCGGC GCGCTCATCC CCACCCCGGG CGGCATGGGC ACCGTCGAGA CGGCGCTGGC CGCCGGCCTG TCCGGGTTCA CCGGCACCAA CATCGGTGTC GCCTACACGG TGGCGCTGCT GTTCCGCCTG CTGACCTTCT GGCTGCGCAT CCCCCTCGGC TGGGTCGCGA TGCGCTACCT GCAGCGCGTC GGCGAGCTCT GA
|
Protein sequence | MHASSTPAPD GADVDAAPGP ATAREAPTTS ATPARTVPDG TNTSPPTSPV RAVRSRVDVQ DVRPPRVHHP SDLLGATLTT LLAVVVVVLA TYAQNTTTGV AEDVQGFATL LRRILFVPVN VLVGFTTVVV PIAVLTELAL RRLGRQLLQA VLAAAGAMVV VAALHWTFLT FGSAELVQGL SVRLGGQWRL TIPEYTAMLT GLLTVAGPRG RRRSVAWSWN LLWLTTGVVL ITAAGSLPGL GLSLLVGRVA GLGVRYLGGV DPERAYGDAL LAGIRRAGVE PADVVRVPDP ALDADRTLAT QDALAMLPPA TPAQLALVRA SGDRVYDVTT DDGRHLDLIV FDGDRQVVGM LTRLWRSLRL RGLEGRSALS LRRATERAAL LSYAARAAGV RTPQLLSIAE AEDSMLLLQE STDAAVPLSD LTREDIGDDD LQAIWEQLRL AHAAGIAHRA LTADVVLVDQ RPGAPRVWIT AWEQGDVASS ELARRMDTMQ LLALLALRVG AARAVASAAA MLPDDDIDGI GPLLQTVALP RRTREEMRAH KEILAELRSA LVARIPEADV QPAQIVRFGA RTLLTIVLTV VAVFVVLASV NVAQIGPVLA RSDWRFSVLA FGLGLLTLVG AALAFVAFSP VRLSVWRATL VQSAATFVAL AAPAGIGPAA LNLRMLTRRG VSASLAGATV ALVQVSQFVT TLLLLLVLTV TSGVQSPTPF SVPPAVLIVI AVVAAAVGVA LLFPGVRTWV QRTVGPTFRQ TLPRLIEVVG QPWRLALAVF GNVLMTMGYV LAFDAALVAL GQEASLVQVA LVYLTGNTAG ALIPTPGGMG TVETALAAGL SGFTGTNIGV AYTVALLFRL LTFWLRIPLG WVAMRYLQRV GEL
|
| |