Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3044 |
Symbol | |
ID | 9146956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3387023 |
End bp | 3388171 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003638126 |
Protein GI | 296130876 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.755391 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCGACA TCGACCCTGC ACGGGCGGCG TACGCGTTCG CGAACAGGTT CGTCAACGTG GTCGTCACGC TGCCGAACGG CAAGCCGCTG CGCACGGCCG GCCGCTGCGG CGGCATGTCG TTCTCGGTGC TCGACCACCT CGCCGCCGGC CAGGACGTGC CGGTGTGGCC GCCGGCGCTC TTCGCACCCG GCACGGTGCC GCCCGACGGC CACGTCGTCG CCGACCATCT CAAGGCGCGG CAGCTCGACT CGTTCCGGAC GCTGTCGGCG CTGCGCTTCC TCGCGTGGTC GGTGCTGCCG GACCGCGACC TCGGGCCGGT CCGCGGCGTG CGGACGCTCA CGCGCCGCCA GCTGCCGGGC GTGCTGCGCG CGCTCGCGCA GGGGCGCCCG GTGGTGCTCG GGCTCGTGGT CGCGACGAAC CTGCTGCGCG CGGGCGACAA CCACCAGGTC GTCGCGACGG GGTTCCGGCG CGATGCCGAG GGCGGGCTCG TCCTCACGCT GCTCGACCCC AACACTCCGC GTCGCGAGGT CACGCTCGTC GAGACGCCGG AGGGCTGGCG GGCGAGCAAC GGGCGGCTGT GGCGGGGCTT CTTCCACCAC TCCTACGCCC GGCGCACCCC GCCACCGCTG CCCAGCGCGT CACGGCGCCC CACGGCGCGC GTCCGGTCCG GGGCACCGCT GGCTCTCGCG CACGTCGCGA CGGGCAGCCT GCTGGTCGTC GCCGGCGGCG TCCCGGTGCT GGCGGCGGCG GGTACGACGA CGTGGGCGCC GGAGTCCGGC CGGGGAGGGC TGCTGACCGA CGGGGCGGTC GTGCGGCTCG TGCCAATCGG CACGCCGGGG CCCGACGGGG ACGACGGGAC GGCGCCCCTG TCGCTCGGCG TCGTCGACGG CACACCGCGC CTCGTCGACA GCCCACCGCC CCGCGTCGAC GGGACGCCGG TCCCCGTCGA CGGCCCGTCG ACCCTCGGCA CCACCGAGGC GCCCGACGCC TGGCGCGTCG TCGTCGACGG CGCCGGCCCG TGGCGGGAGG GGTCGCGCGT GCGCCTGGTC CACACGACGA CGGGCGCGCT GCTCGCCGGG AGCCCGTCGG GCGTCCGCAC GACGTTCGAC CCGGACCCAT CGACCTGGTG GACGACGGCC GCGCACTGA
|
Protein sequence | MGDIDPARAA YAFANRFVNV VVTLPNGKPL RTAGRCGGMS FSVLDHLAAG QDVPVWPPAL FAPGTVPPDG HVVADHLKAR QLDSFRTLSA LRFLAWSVLP DRDLGPVRGV RTLTRRQLPG VLRALAQGRP VVLGLVVATN LLRAGDNHQV VATGFRRDAE GGLVLTLLDP NTPRREVTLV ETPEGWRASN GRLWRGFFHH SYARRTPPPL PSASRRPTAR VRSGAPLALA HVATGSLLVV AGGVPVLAAA GTTTWAPESG RGGLLTDGAV VRLVPIGTPG PDGDDGTAPL SLGVVDGTPR LVDSPPPRVD GTPVPVDGPS TLGTTEAPDA WRVVVDGAGP WREGSRVRLV HTTTGALLAG SPSGVRTTFD PDPSTWWTTA AH
|
| |