Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0743 |
Symbol | |
ID | 9144614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 808622 |
End bp | 809887 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | putative glycosyl transferase |
Protein accession | YP_003635853 |
Protein GI | 296128603 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.136224 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00617937 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGCAGT TCCGCACGAC CACGGACGAC GCGCGCGCCG CGTCGCCGGT GGACGCGCCC GCCGGCTCCC CCGACGCCAC GGCACCCGTC GACGTCACGC GCCCGCCCCG GTCCGCCGCC GACCACACCG ACGACCACGC CGGCGCGGCA GCGGCCATCA CCCGTCCGGA CGCACCCGGC CCCGCGCAGC GGCTACGCGT GCTGCACACG CTCAACCCGC CCGACGGCAC CACGCGCTAC GTCGACCAGA TGCTCGGCGG CGCGGGCGGC GACGTCGACG CGCTGCCGTT CACCTGGACC ACCGCGCTGC GCGGCGGGTA CGACGTGCTG CACGTCCACT GGCCCGAGCT GCTCGTGCGG CACCCGCGTC CGCTGCGGCG GGCCGCCCGG CACGTCGCGC TGCACCTGCT GCTCGCGCTG CTGCGCGTGC GCCGCGTACC CGTCGTCCGG ACCTTGCACA ACACCGTGCC GCACGAGTCC GTCGGCCGCG CCGAGGCCCG CGCGCTCGCG GCCGTCGACG CCGCGACCCG CCTGTACGTG CTGCTCACCC CGGCCACGCG CGCACCCGGC GACGCCGCGA GCGTGCAGAT CCCGCTGGGC CACTACCGCG ACGCGTTCGC GCACCTGCCG CGCGTCGCCG CCACCCCCGG GCGGCTCGTC ACCATCGGCC TGCTACGCCC CTACAAGGGC GTCGAGGAGC TGCTCGCGGC GTTCACCGCG CTGCCGGGCG ACCACCTCAG CCTCACCGTC GCCGGCAAGC CGACGCCCGA GATCGCGCGC GTCGTCGAGG ACGCCGTCGC ACGCGACCCG CGCATCACCG CCGACCTGCG GTTCGTGCCC GACGCGACGT TCGTCGAGCA CGTCACCGCC GCCGAGCTCG TCGTGCTGCC CTACCGGCAG ATGACCAACT CCGGCGTGCT CGTCGCGGCC CTCTCGCTCG ACCGGCCGTG CCTCGTGCCC GCCTCGCCCG CCAACGCCGC GCTCGCCGCC GAGGTCGGCG AGGGCTGGGT CCTGCAGTAC GACGGCGAGT TCGACGCCGC CGTGCTCGCC GACGGCCTGC ACCGGGCGGC CACCACGCCC CGCAGCGCAC GCCCCGACCT GTCGGCGCGC GACTGGCGCG TGCTCGGGGC GGCGCACGAC GACGCGTACC GGACGGCGGT CGCGCTCGCG CGGGGCGGCG CCCGCGTGAG CGGGACGCCC GCGCCCGGCA CGCCCGCGCA CGGGACGCCC GCGCCCGGCA GGCTCACGGT CGGCGGGCAC GGATGA
|
Protein sequence | MEQFRTTTDD ARAASPVDAP AGSPDATAPV DVTRPPRSAA DHTDDHAGAA AAITRPDAPG PAQRLRVLHT LNPPDGTTRY VDQMLGGAGG DVDALPFTWT TALRGGYDVL HVHWPELLVR HPRPLRRAAR HVALHLLLAL LRVRRVPVVR TLHNTVPHES VGRAEARALA AVDAATRLYV LLTPATRAPG DAASVQIPLG HYRDAFAHLP RVAATPGRLV TIGLLRPYKG VEELLAAFTA LPGDHLSLTV AGKPTPEIAR VVEDAVARDP RITADLRFVP DATFVEHVTA AELVVLPYRQ MTNSGVLVAA LSLDRPCLVP ASPANAALAA EVGEGWVLQY DGEFDAAVLA DGLHRAATTP RSARPDLSAR DWRVLGAAHD DAYRTAVALA RGGARVSGTP APGTPAHGTP APGRLTVGGH G
|
| |