Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1349 |
Symbol | |
ID | 9145233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1494701 |
End bp | 1496401 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003636446 |
Protein GI | 296129196 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000440573 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGCGCGC AGCCCGGGAC CGTCCCTGCG CAGACGTCAG AGCCTGCGCC GGGAGCCGTC GCCGAGCCCG CGACGGCCCC GCAGATCCTC GCGCGCCCGC TCGCGCAGAC GTCCCTGCTC GACGCCGTGC GGGACCTGCG GCGCGACGTC GACGCCACGT CGTTCCCGCT CGAGATCCCC GGGGTCGGCG ACGCGCGGGC GTCGCGCGCA CGACTCGTCG ACCAGCTCGA CGAGCACCTC GTGCCCCGCC TGACGGAGCT GTCCGCGCCG GCGGTGGTCG TCGTGGCCGG GTCCACGGGT GCCGGCAAGT CGACGCTCGT GAACTCCCTG GTCGGGCGTG AGGTGACCGC GGCCGGCGTG CTGCGACCGA CGACGCGCGA GCCCGTCCTC GTGCACCACC CGCTGGACAC GGACCTGCTG TCCCACCACC CCGTGCTCGA CGAGGTCGAC GCCGTCGCGG TCGACACCGT GCCGCGGGGC ATCGCGATCC TCGACGCACC CGACCTCGAC TCCGTCCTCG ACTCCAACCG CGACACCGCG CACCGTCTGC TCGAGGCGGC CGACCTGTGG CTCTTCGTCA CGACCGCGTC GCGCTACGGG GACGCGCTGC CGTGGCAGGT GCTCCGCTCG GCCGTCGAAC GCAGCACGTC CGTCGCGATG GTGCTCAACC GCGTGCCCGC CGCCTCGCTG CCCACCGTGC GCGGCGACCT GCTCGAGCGG CTGCGTGCCC ACGGCCTGGC GGGATCCCCG CTCTTCGTCA TCCCCGACGT GGGTCCGCAC TCCGGCCCGC TGGCCGGCCC CGTCGTGGCG CCCGTCCTGC GCTGGCTCAC CACGCTGGCC GGCCCGGACC GGGCCCGCAC GGTCGTCGCC CGCACGCTGC GCGGCTCGCT CGCCGCACTG CGCCCGTGGG TCGACGAGCT CGCGGAGGCC GTGCAGGACC AGGCCGACGC CGCGGCACGG ATCTCCCGCA CGCTGGACGA GGCAACCGCC GCACCGGGCG ACGCCGCCGC CCGGACGGTG CGCTCGGGAG CTGTCGCCGA CGGCGCCGTG CGCGCCCGCT GGGCCGAGCT CGTCGCCAAG GGAGCACCGT TCGCGCGCCT CGTCGGCCGG TCGGGACGCG TCCGCGGCTC CTCGCGCACC GCACGCGCCC GCGCGGCCGC GGTCGCGCCC CTCATGTCGG ACCTGACCGA GTCCACGGCG TCGGTGCTCA CGGCGGTCGG GCTGCGCGCG GGCGCCGCGC TGCGCGCGTC GCTCACCGGG CCGCAGGCAC CGCCCGGCGG GGACTCGGTC CTCGCGCGCT GGCCCGACGG CGAGGCGTCG CGCGGGGCGG CCGCCGAGCG CGCCGCCCGT GCGTGGTCGG GAGAGGGTGC GCGGCACGTC CGGGTGCTGC TCGCCGGGAG CGGCGCGGAC GCCCGTCGCC GGGCGCAGGT GAGCAGGGCC GTGGGGGAGG AAGGGCTGAC CGCCCTCGTC CTCGCCGCGG CCGCCGGCGT CGACGAGGCG GCCGCGGCCG CCCGCACGCT GCTGGGCGAC CCCGCCGACG AGGTCGTCAC CGCGTTGCGC GACGACCTCG CGCGGCGCGC GCGCACGCAG GTGGACCTCG AGCGCACCAT CGCCGAGCGC ACGCTCGACG ACCCCGACCT CGCGGCGGAC GCGTCGTCGC GCCTGCGTCT GCGACTCGCC GTCCTCAAGG GGCTGACGTG A
|
Protein sequence | MSAQPGTVPA QTSEPAPGAV AEPATAPQIL ARPLAQTSLL DAVRDLRRDV DATSFPLEIP GVGDARASRA RLVDQLDEHL VPRLTELSAP AVVVVAGSTG AGKSTLVNSL VGREVTAAGV LRPTTREPVL VHHPLDTDLL SHHPVLDEVD AVAVDTVPRG IAILDAPDLD SVLDSNRDTA HRLLEAADLW LFVTTASRYG DALPWQVLRS AVERSTSVAM VLNRVPAASL PTVRGDLLER LRAHGLAGSP LFVIPDVGPH SGPLAGPVVA PVLRWLTTLA GPDRARTVVA RTLRGSLAAL RPWVDELAEA VQDQADAAAR ISRTLDEATA APGDAAARTV RSGAVADGAV RARWAELVAK GAPFARLVGR SGRVRGSSRT ARARAAAVAP LMSDLTESTA SVLTAVGLRA GAALRASLTG PQAPPGGDSV LARWPDGEAS RGAAAERAAR AWSGEGARHV RVLLAGSGAD ARRRAQVSRA VGEEGLTALV LAAAAGVDEA AAAARTLLGD PADEVVTALR DDLARRARTQ VDLERTIAER TLDDPDLAAD ASSRLRLRLA VLKGLT
|
| |