Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0053 |
Symbol | |
ID | 9143918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 67515 |
End bp | 69161 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | ABC transporter related protein |
Protein accession | YP_003635172 |
Protein GI | 296127922 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00894848 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGTGCGA CCCTCCAGGC CCGGTCCGTG GCCGCGGCGT TCGGTGACCG CGAGCTGTTC TCCGGCCTCG ACCTCGTCGT CGCGCCCGGC GACGTCGTCG GGCTCGTCGG CCCCAACGGC GCCGGCAAGA CGACGTTGCT GCGCATCCTT GCAGGTCAGC GGGCTCCGGA GGCGGGGGTG GTGGCGCTGT CACCGCCGAC GGCGCAGGTC GGCTACCTGG TGCAGGAGGT CGAGCGCCGG GCGGACGAGT CGGTGCGGAC GTTCCTCGAG CGACGGACGG GCGTGGCCGA CGCGCAGGCC GCGATGGACG CGGCGTCCGA CGCGCTCGCG GCCGACGCGC CGGGGGCGGG CGACGCGTTC ACGCACGCGC TCGAGCGGTG GATGGCCCTG GGTGGCGCGG ACCTCGACGC GCGGCTCGGC GGCGTCGCCG ACGACCTCGG GCTCGCCGTC GACCTCGACC TGCCCATGAC GGCGCTGTCC GGCGGGCAGG CCGCGCGCGT CGGGCTCGCG GCGCTGCTGC TCTCGCGCTT CGACCTGTAC CTGCTCGACG AGCCGACCAA CGACCTCGAC GCCGACGGGC TCGACCGCCT GGAGGAGTTC GTCGCGCAGG CGTCGGCGCC GGTCGTGGTC GTCAGCCACG ACCGGGAGTT CCTGGCGCGG ACCGTCACCA CGGTCGTCGA GATCGACCGC AGCCTGCAGC GTGTCGCCAC CTACGGCGGC TCCTACGACG CGTACCTCGA GGAGCGCTCG ACGGCCCGCC GGCAGGCGCG CGAGGCGTAC GAGGACTACG CGGGCCGCCG GGACGCGCTC GCGGCCCGCG CCCGCATGCA GCGCGCGTGG ATGGAGAAGG GCGTGCGCAA CGCGATGCGC AAGGCCACGG ACGGCGACAA GAACGTCAAG CAGGGCCGCC GCGAGTCGTC CGAGAAGCAG GCGTCGAAGG CGCGGCAGAC CGACCGCATG ATCGAGCGGC TCGTGGTCGT CGAGGAGCCG CGCAAGGAGT GGCAGCTGCG CATGAGCATC GCCACCGCAC CGCGCTCGGG GTCCGTGGTC GCGACCGCGC GGGCCGCCGT CGTCCGCCGC GGGGACTTCG TCCTCGGCCC GGTGGACCTG CAGCTCGACT GGCAGGACCG CATCGCCGTC ACCGGGCCCA ACGGCTCGGG CAAGTCGACG CTGCTCGCCC TGCTGCTGGG GCGCCTGGCG GCCGACGAGG GCACCGCGGC CCTCGGGTCG GGCGTGCTCG TGGGGGAGGT CGACCAGGCA CGGGCAGCGT TCGAGCGCGA CGAGCCGCTG GGCGACGCGT TCGCGCGCGA GGTCCCGGAG TGGACGACGG CCGACGTGCG CACGCTGCTG GCGAAGTTCG GCCTCGCCGG CCACCAGGTG GGCCGCCCGG CGGCGTCGCT GTCCCCGGGG GAGCGCACGC GGGCGGCCCT CGCGCTGCTG CAGGCCCGCG GCGTCAACCT GCTGGTCCTC GACGAGCCGA CCAACCACCT CGACCTGCCG GCGATCGAGC AGCTCGAGCA GGCGATGGAG TCCTTCGACG GCACGATCCT GCTGGTCACG CACGATCGCC GGATGCTCGA CACCGTGCGG CTGACGCGCC GCTGGCACGT CGAGGACGGG CAGGTCACGG AGGTCCGGCC GGACTGA
|
Protein sequence | MSATLQARSV AAAFGDRELF SGLDLVVAPG DVVGLVGPNG AGKTTLLRIL AGQRAPEAGV VALSPPTAQV GYLVQEVERR ADESVRTFLE RRTGVADAQA AMDAASDALA ADAPGAGDAF THALERWMAL GGADLDARLG GVADDLGLAV DLDLPMTALS GGQAARVGLA ALLLSRFDLY LLDEPTNDLD ADGLDRLEEF VAQASAPVVV VSHDREFLAR TVTTVVEIDR SLQRVATYGG SYDAYLEERS TARRQAREAY EDYAGRRDAL AARARMQRAW MEKGVRNAMR KATDGDKNVK QGRRESSEKQ ASKARQTDRM IERLVVVEEP RKEWQLRMSI ATAPRSGSVV ATARAAVVRR GDFVLGPVDL QLDWQDRIAV TGPNGSGKST LLALLLGRLA ADEGTAALGS GVLVGEVDQA RAAFERDEPL GDAFAREVPE WTTADVRTLL AKFGLAGHQV GRPAASLSPG ERTRAALALL QARGVNLLVL DEPTNHLDLP AIEQLEQAME SFDGTILLVT HDRRMLDTVR LTRRWHVEDG QVTEVRPD
|
| |