Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2204 |
Symbol | |
ID | 9146104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 2460491 |
End bp | 2461861 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | CBS domain containing protein |
Protein accession | YP_003637294 |
Protein GI | 296130044 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00693904 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGGCG TCCCCGTCGG GCTGCTCGTC GCCGTCGCCG TCGTCGGGAT CCTGCTGGCC GCGGCGCTGT CCGCGGGCGA GGTCGCGGTG CTGCGCGTCA CGCGCGCCCG CGTCACCGAG CTCGAGGCCG AGCGTCCCGG TGCGGCCGCG CGGGTGCGCC GGCTCGTCGA CGACCCGGCA CGCGTCACGG CCGCGGCGTC GTTCGTGCGG CTCCTCGGCG AGATGACGGC CACCGTGTGC CTCACGCTCG CCATCAGCGC CGGCAGCCTG TCGTGGTGGG CCACGGCGCT GCTCGCGATC GCCGCGTGCG CCGTCGTCGC GTTCCTGCTC GTGCGCGTCA GCCCGCGCAG CATGGGCCGG CGCCACCCCG TCGGCGTGCT CGCGAACCTG TCGCGTCTGC TGCTCGCCGT CACCGCGCTC GCGGGCGGCG TGGGCCGCGG TGGCGAGGCC GCGACCACCA CGAGCGAGCA GGACGACGCC GAGCTGCGCG ACATGGTCGA GCGCGTCAGC GAGTCCGACG CGATCGAGGA GAACGAGCGC GAGATGTTCC GCTCGGTGCT CGAGCTCGGG GACACCCTCA CGCGCGAGGT CATGGTGCCC CGCACGGACA TGATCACGAC GCAGGCGGAC ACGCCCCTGC ACAAGGTGCT GGCGCTGCTG CTGCGCTCCG GCTTCTCGCG CGTGCCCGTG GTGGGGGAGT CGGTCGACGA CGTGGTCGGG GTGCTCTACC TCAAGGACGT CGTGCGCCGC ATCCCCGCCC ACGGTCACGG CCACGGCAAC GGCGACGGCG ACCCGCTCGA CGCGCCCGCG GCGTCCCTCG CGCGTCCCGC GGTCTACGTG CCGGAGTCCA AGCCCGTCGA CGAGCTGCTC CTGGAGCTGC GCGACGGGTC CAGCCACATC GCGCTCGTCG TGGACGAGTA CGGCGGCATC GCCGGGCTCG TGACCATCGA GGACGCGCTC GAGGAGATCG TCGGCGAGCT CACCGACGAG CACGACGCCA GCGCGCCCGT CGTCGAGGAG CTCGAGGACG GCGGCTACCG CGTCCCGGCG CGCCTGGGTC GCGACGAGCT CGGCGACCTG TTCGGCCTCG AGGTCGAGGA CGAGGACGTC GACACCGCGG CCGGTCTGCT CGCCAAGGCG CTCGGCAAGG TGCCCCTCCC GGGTGCCGTC GGTGAGATCC ACGGTCTGCG GCTCGAGGCC GAACGTGTCG AGGGCCGCCG CAAGCGCCTG GCGACCGTGC TCGTGCACCG GGCCGAGGAG GCCACGGAGG ACGCCGCCCC CGCTACACCT GCCCGCGGCA CCCATGCCGC GGGAACCCCG TCGCGCGGCA CCCCCACCGT GCGGGACCAC GGCTCGGAGG CCGCCCGATG A
|
Protein sequence | MSGVPVGLLV AVAVVGILLA AALSAGEVAV LRVTRARVTE LEAERPGAAA RVRRLVDDPA RVTAAASFVR LLGEMTATVC LTLAISAGSL SWWATALLAI AACAVVAFLL VRVSPRSMGR RHPVGVLANL SRLLLAVTAL AGGVGRGGEA ATTTSEQDDA ELRDMVERVS ESDAIEENER EMFRSVLELG DTLTREVMVP RTDMITTQAD TPLHKVLALL LRSGFSRVPV VGESVDDVVG VLYLKDVVRR IPAHGHGHGN GDGDPLDAPA ASLARPAVYV PESKPVDELL LELRDGSSHI ALVVDEYGGI AGLVTIEDAL EEIVGELTDE HDASAPVVEE LEDGGYRVPA RLGRDELGDL FGLEVEDEDV DTAAGLLAKA LGKVPLPGAV GEIHGLRLEA ERVEGRRKRL ATVLVHRAEE ATEDAAPATP ARGTHAAGTP SRGTPTVRDH GSEAAR
|
| |