Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0516 |
Symbol | |
ID | 9144383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 549899 |
End bp | 551020 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003635629 |
Protein GI | 296128379 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00149183 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00000250398 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGGACG TTCTCGACGA CCTGTACGGG GGGTCGCTCG ACGACTTCGT CGCGCGCCGG GACGCCGCCG TGCGGGCCGC GAAGGCCGCG AAGGACAGGG ACCTCGCGGG ACGGATCGGA TCGCTGCGCA AGCCGACCGT CGCCGCCTGG GCCGTGAACC TGCTGGTGAG GGACGACCTG TCGCTGGCCG GTGCGCTGCG CGACCTCGGC GAGGGCATGC GCGACGCGGA GCGGTCGCTC GACGGGCCGG CGCTGCGTGA GCTCGGCAAG CAGCGGCGCG CGCTCGTGTC CGGGCTGGTC GCGCGCGCAC GTCGGCTCGC CTCGGACGCC GGGCAGAAGC TCTCGACGGC CGCGGCGCAG GAGGTCGAGC ACACCCTGAC GGCGGCGCTC GCGGACCCGA AGGTCGCTGA CGCCGTCGCC GCGGGCACCC TCACGCACGG CACCGAGCAC GTCGGCTTCG GCTCCGCGGA CGCCGCCGAG GGAGGACGTG GAGCGAACGG GACGTCGGGC GGCCGGAGTG CTTCCCGCCC TCCCGAGCGC ACGGCGCAGT CGGTGGGCAC GCGCGGACCG GCCGACGCGG GGAGCGGGAC GTCCGACGCC GCCGACGGCG AGGGGCAGGG CGACGCCCCA GCGGGGGGCG ACCCGTCGAC GGGCCGAGCG TCGGCCGCGG GCCGCACGTC AGCCGGGAAC CGCGCCGAGG ACGCCGAGAG GCGTGAGCGT GAACGCCAGG AGCGCGAGCG CGAGAAGCAG CAGCGCCGGC GCGAGCAGGC GGCCGCCGCG CGCGAGGCCG CCGAGGCCGA CCTCGACCGC GCCCGCACCC GGCACGACGA CGCGGCGACC GCTGAGCAGC AGGCCGCGAC CGCGCTCGAG GACGCCGAGC GCGCGTCCGC TGAGGCCTAC GAGGCCGAGG AGGAGCTGGT CCGGGCGCTC GCGGACGTGC GGCGGCGCCT CGCCGAGGCC CGTGCCGTCC TGCCTCGCGT CGACGAGGCC CTCGTCGCCG CGCGCTCGGC CGCCCGTGAC CGCCAGAAGG CCGCCCGGGC CGCTGCCCGG GAGCTGGAGC GGACGACGAA CGCCGCGGTG CGCGCCCGCG AGCGGGAGGA ACGCCTCGCT GCGGAGACCT GA
|
Protein sequence | MEDVLDDLYG GSLDDFVARR DAAVRAAKAA KDRDLAGRIG SLRKPTVAAW AVNLLVRDDL SLAGALRDLG EGMRDAERSL DGPALRELGK QRRALVSGLV ARARRLASDA GQKLSTAAAQ EVEHTLTAAL ADPKVADAVA AGTLTHGTEH VGFGSADAAE GGRGANGTSG GRSASRPPER TAQSVGTRGP ADAGSGTSDA ADGEGQGDAP AGGDPSTGRA SAAGRTSAGN RAEDAERRER ERQEREREKQ QRRREQAAAA REAAEADLDR ARTRHDDAAT AEQQAATALE DAERASAEAY EAEEELVRAL ADVRRRLAEA RAVLPRVDEA LVAARSAARD RQKAARAAAR ELERTTNAAV RAREREERLA AET
|
| |