Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2039 |
Symbol | |
ID | 9145935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 2275540 |
End bp | 2277285 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003637133 |
Protein GI | 296129883 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.566112 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.522751 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGC GGACGACGGT CGTGTGGGTG CCCGACTGGC CCGTGGTCGC GGCGATGACC GCCGACGAGG TCCCGGTCGA CGTGCCTGCC GCGGTGCACG ACGGGCGTCG GATCACCGCG GTGTCCGCGC TCGCGCGTGC CGACGGGGTG CGCCGCGGGA TGCGGCGGCG GCAGGCGCAG GGCGTGTGCC CCGAGCTCGT GCTGCTGCCG GTCGACGACG CGCGCGACGT GCGGCTGTTC GAGCCCGTCG CCGCGGCGAC CGAGACGGTC GTCGCCGGTG TCGAGGTCGC CCGGCCGGGC ATGCTCCTGC TGCCTGCCGG CGGAGCCGCG CGCTACCACG GGTCGGAGGA GGCGCTCGCC GAGCGCGTCG TCGACGCGGT CGCGCGCGCC ACCGGTCACG AGTGCGGCGT GGGCACGGCC GACGGGCTGC TCGCCGCCGT GCTGGCCGCG CGCACCGGCG CCGTCGTCGA GCCCGGGCTG TCGCCCGTCT TCCTCGCGCC GCACGGTGTC ACCGAGCTCG TGCACGCCAC CACCACGCCC GAGCAGGCCG CCGAGGTCAT GCGCCTGGTC GACCTGCTGC ACCGCCTGGG GCTGCGCACG CTCGGGGCGT TCGCGGCGCT GCCGGCCCCC GACGTGCACG CGCGGTTCGG GCGGCTGGGC TCGTGGGCGC GCACGCTCGC GCGCGGCCTC GACGAGCGGC CGCCCGCGCG TCGTCGTCCC GAGGCCGACC TCGAGGTGGA CGTCGAGCTC GACCCGCCCG TCGACCGCGT GGACACCGCG ACGTTCGCCG GACGGCGCCT GGCCGAGGAG CTGCACGCCG AGCTCGTCGC GCGCTCCGTG ACGTGCGGGC GCCTGCAGAT CACCGCGCGG ACCGACGACG GGACCGAGCT GGTGCGCACG TGGCGTACCG ACCTGGGCGG CTGGGGCGGG CTCGCCGCCG CGCGCATCAC CGACCGGATC CGCTGGCAGC TCGACGGGTG GCTCACCGCG GCGGCCGTCG CCACGGCTCG GGACCGGCGC CGTGAGGTGC GCGAGCGTGA ACGCGGGGCG CGGGGACGCG GCGAGGGCGG GGGACCGGCG CACGGCGAGC ACGGACCGGT GCGCGGGACG CACGGGGCGG TCCTCGATGT CCGGGTGCCG GACGACGAGG ACGACACGGC GCCCGTCGCG CTGGTGCGTC TGACCCTCAC GGCGCTCGAC GTCGCGCCCG CGGGGTCCGA GGCGACGCAG CTGTGGGGCG GACCGTCGGG TGGGGACCTG CGGGCGCACC GCGCGCTCGA GCGCGCGCAG AGCATCGTCG GCGGGCCGGG CGTGCTCACC GCGACCCTGC AGGGCGGGCG TGACGTGCGT GACCAGGTGC ACGTGCGGCC GTGGGGCGAG CAGAGCGACC CGCCGCGCCC GCTCGACCGT CCCTGGCCGG GCCGGCTGCC GGACCCGGCA CCCGCGACCG TGCTGGTCGA CCCCGTGCAC GTCGAGGTGC GGGACGTGCA CGGCTCGCCC GTGCGCGTCG ACCGGCGGGG GCGGCTGAGC GGACCACCCG GCAGCGTGCT CGCCGGGAGC GGCCCCGACC GGGTGCGCGT CGTCGCCGGG TGGGCGGGGC CGTGGCTGCT GACCGACCGC TGGTGGACGC ACCCCGGCGC CGGGCCGCAG GTGCGCGCCC ACCTGCAGGT CGCGTTCGAC GACGGCGGCG CGGTGCTGCT CACGCACACC GACGGGGCGT GGACGTACGA GGCGGACTAT GACTGA
|
Protein sequence | MSTRTTVVWV PDWPVVAAMT ADEVPVDVPA AVHDGRRITA VSALARADGV RRGMRRRQAQ GVCPELVLLP VDDARDVRLF EPVAAATETV VAGVEVARPG MLLLPAGGAA RYHGSEEALA ERVVDAVARA TGHECGVGTA DGLLAAVLAA RTGAVVEPGL SPVFLAPHGV TELVHATTTP EQAAEVMRLV DLLHRLGLRT LGAFAALPAP DVHARFGRLG SWARTLARGL DERPPARRRP EADLEVDVEL DPPVDRVDTA TFAGRRLAEE LHAELVARSV TCGRLQITAR TDDGTELVRT WRTDLGGWGG LAAARITDRI RWQLDGWLTA AAVATARDRR REVRERERGA RGRGEGGGPA HGEHGPVRGT HGAVLDVRVP DDEDDTAPVA LVRLTLTALD VAPAGSEATQ LWGGPSGGDL RAHRALERAQ SIVGGPGVLT ATLQGGRDVR DQVHVRPWGE QSDPPRPLDR PWPGRLPDPA PATVLVDPVH VEVRDVHGSP VRVDRRGRLS GPPGSVLAGS GPDRVRVVAG WAGPWLLTDR WWTHPGAGPQ VRAHLQVAFD DGGAVLLTHT DGAWTYEADY D
|
| |