Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2336 |
Symbol | |
ID | 9146239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 2610486 |
End bp | 2611553 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | ATPase associated with various cellular activities AAA_3 |
Protein accession | YP_003637425 |
Protein GI | 296130175 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00989852 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000000618175 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGACCGACC AGACGCCACC CCCCCATCCC TGGGACGGGT CGCCGAGCCC GCTCCCGTCG ACCCCGCCCG CCACGCCCCC CGGGCCGGTC GGCACGCCCG GCGCCTCGCC GGCCGTGAGC GGTGAGCTGC GCACCGCGCT CGCGGCCGTG CGCACCGAGG TCGGCAAGGC CGTCGTCGGG CAGGACGCGG CCGTCACCGG CCTCATCATC GCGCTGCTGT GCCGCGGGCA CGTGCTGCTC GAGGGCGTGC CCGGCGTCGC CAAGACGCTG CTCGTCCGCT CGCTGTCCGG CGCGCTGTCG CTGGGCACCA AGCGGGTGCA GTTCACGCCC GACCTCATGC CCGGCGACGT CACCGGCTCG CTGGTGTACG ACGCGCGCAC CGCGGAGTTC TCGTTCCGCG AGGGTCCCGT CTTCACCAAC CTCCTGCTGG CCGACGAGAT CAACCGCACA CCCCCCAAGA CGCAGGCGTC GCTCCTGGAG GCCATGGAGG AGCGGCAGGT CACGGTGGAC GGCGAGCCGC GCCGGCTGCC CGACCCGTTC GTGGTCGTCG CCACCCAGAA CCCCGTCGAG TACGAGGGGA CCTACCCGCT GCCCGAGGCG CAGCTCGACC GCTTCCTGCT CAAGCTGCTG CTGCCGCTGC CCGAGCGTGA GGACGAGGTC CAGGTGCTCA GCCGGCACGC CGCAGGATTC GACCCGCGCG ACCTGGCGGC CGCCGGCCTG CGCGCGGTCG CGGGCGCCGA CGAGCTCGCG GACGCCCGCG CGCAGGTGGC GCACGTGCAG ATCAGCCCGG AGGTCCTCGG TTACGTGGTC GACGTGTGCC GCGCGACCCG GACGTCGCCG TCGTTGAGCC TCGGGGTGTC GCCCCGCGGT GCCACGGCGC TGCTGGCGAC GTCGCGCGCG TGGGCGTGGC TCTCGGGCCG CGACTACGTG ACGCCCGACG ACGTCCAGGC GCTCGCGCAC CCGACGCTCC GGCACCGCGT GCAGCTGCGC CCGGAGGCCG AGATCGAGGG CGTCACGGCC GAGACCGTGC TCGACACCGT GCTGGCGTCC GTCGCGGTCC CGCGGTGA
|
Protein sequence | MTDQTPPPHP WDGSPSPLPS TPPATPPGPV GTPGASPAVS GELRTALAAV RTEVGKAVVG QDAAVTGLII ALLCRGHVLL EGVPGVAKTL LVRSLSGALS LGTKRVQFTP DLMPGDVTGS LVYDARTAEF SFREGPVFTN LLLADEINRT PPKTQASLLE AMEERQVTVD GEPRRLPDPF VVVATQNPVE YEGTYPLPEA QLDRFLLKLL LPLPEREDEV QVLSRHAAGF DPRDLAAAGL RAVAGADELA DARAQVAHVQ ISPEVLGYVV DVCRATRTSP SLSLGVSPRG ATALLATSRA WAWLSGRDYV TPDDVQALAH PTLRHRVQLR PEAEIEGVTA ETVLDTVLAS VAVPR
|
| |