Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3036 |
Symbol | |
ID | 9146948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3373048 |
End bp | 3375801 |
Gene Length | 2754 bp |
Protein Length | 917 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | MMPL domain protein |
Protein accession | YP_003638118 |
Protein GI | 296130868 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.101076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCTCTG CCCTCTACCG CCTCGGCCGG GCCGCGTTCG CCCGACGACG TGCCGTCATC GGCGCCTGGG TGGGGCTGCT CGTCCTCATC GGCGCGGCCG CCGGCCTGCT CGGCGGCACG CTGGACAACT CGGTGTCGAT CCCCGGCACC GAGTCCCAGG CCGCGCTGGA CCGGCTCACC GCGACCTTCC CCCAGGCCGC GGGCACGACG GCCCAGGTGC TCGTCGTCGG CGAGGACGGT GCACAGGTCG ACGACCCGGC CGTGGTCACC GCCGTCGAGG ACTCCGTCGA CGCGTTCCTC GAGGTCGAGA GCGTCACGTC CGCCGTCTCG CCGTTCGACG ACACGCTGCC CGGCGCGTCG GCCGTCAGCG ACGACGGCGA GGCCGCGCTG CTGACCCTCT CGCTCGAGGG CGAGGGCGTC GCCATCGGCG ACGAGGTCAA GGACCGCCTG CGGGACGTCG CCGACGAGCT CGACGCCGCG CTGCCCGACG GGTACGACGC GACGATCGGC GGACAGCTCT TCTCGCAGGA GTTCCCGGGC CTGAGCATCG CGGAGGTGCT CGGCGTCGTC GTCGCGTTCA TCGTGCTGCT CGTGACGCTC GGCGGGTTCG CCGCCGCGGG CATGCCGCTG CTCAACGCGC TGCTCGGCGT CGGCCTGTCC ACGCTGCTGG TGCTCGTCGC CGCGGCGTTC ACGTCGGTCA CCAGCACCAC GCCGCTGCTC TCGCTCATGC TCGGCCTGGC CGTCGGCATC GACTACGCGC TGTTCATCGT CTCGCGCTAC CGCGAGCTGC TCGCCACCGG CCTGCCCACC CAGGAGGCCG CCGCCCGCTC CAACGCGACC GCGGGGTCCG CCGTGATCTT CGCGGGCCTC ACCGTGATGA TCGCGCTCGT CGGCCTGGGG GTCGCCGGCA TCCCGTTCCT CACCGTCATG GGCGTCGCCG GTGCCGCAGC CGTCGGTATC GCCGTCCTGG TCTCCATCAC GCTCGTGCCC GCGATGCTCG GCGTCGCCGG CGAGCGCCTG CGTCCACGCC CGTCGCGGCG TGCCCGCAAG GACGCGGCCG CCGGGACCGC GCCCGCCGCG GCTCCCGCGC CCGCCGCCGA CGGCGACACC TGGGACCTGC CCGAGCACCA CAACCGGTTC TTCGCCGGCT GGGTGCGCCT GGCCACGGCC CGCCCGTGGG TCACCGTCGT CGTGACGATC GGCGCGCTCC TGGCCCTCGC GTTCCCGGCG CTCGACCTGC GCCTCGCGCT GCCCGACGCC GGCGTCGCCC CCACCGACTC CTCGCAGCGC GTCACCTACG ACCGCATCAC CGAGCACTTC GGCCCCGGCG CCAACGGCCC GCTCGTCGTC ACCGGCAGCA TCGTCACCAG CGACGACCCG CTGGGACTCA TGGAGGACGT CGCCGACGAG CTGCGGGCCC TGCCCGGCGT CGACTCCGTG CCCCTGGCGA CGCCCAACGA GTCCATCGAC ACCGGCATCG TGCAGGTCGT GCCCACCACG GGCCCGACCG ACCCCGCGAC CGCCGACCTC GTCAACGCGA TCCGCGACCT GCGGCCGACG ATCCTCGAGA AGCACGGGTT CGACCTGGCC GTCACGGGCT TCACGGCCGT CGGCATCGAC GTGTCCGCCA AGCTGGGCGC GGCGCTGCTG CCGTTCGCGG TGTTCGTCGT CGGCCTGTCG CTGATCCTGC TGACGATGGT GTTCCGCTCG ATCGCCGTGC CGCTCAAGGC GACGATCGGC TACCTGCTGT CGGTCGCCGC CGCGTTCGGC GTCGTCACCG CCGTCTTCGA GCACGGCATC GCGGCCGACC TGCTCCACGT CTCGCGCCTC GGCCCGATCA TCTCGTTCAT GCCGATCGTC CTCATGGGCG TGCTCTTCGG CCTCGCCATG GACTACGAGG TGTTCCTCGT GTCCCGCATG CGCGAGGACT ACGTGCACTC CGGCAAGGCG CGCGCGTCGA TCGCCACCGG GTTCGTCGGC TCCGCCAAGG TCGTCACCGC GGCGGCCGTC ATCATGGTCG CGGTGTTCTT CGCCTTCGTC CCCGAGGGGG ACATCAACAT CAAGCCCATC GCGCTCGGCC TGGCCGTCGG CGTCGCGGTC GACGCGTTCG TCGTCCGCAT GACCCTCGTG CCGGCCGTCA TGCAGATCCT CGGCGAACGC GCCTGGTGGA TGCCGAAGGG CCTGGACCGC GTGCTGCCGT CGTTCGACGT CGAGGGCGAG GCGCTCCACC GCGAGATCAG CATGCAGGCG TGGCCGCACG ACCCGGACGT CGTCGTCGCC GCACGCGGCC TGCGGCTCGC TGCGCTCGAC CGCACCGACG TCGTGGACCT CGCGGTGCGA CGCGGTGAGG TGCTCGTCGC GCACGCCGAC GAGCCCGCCC GCCCCGCAGC CCTGCTGCTC ACCGTCGCCG GGCGCCTGGC ACCCGAGGCG GGCGACCTCA AGGTCGACGG GCTGCTCCTG CCCGTGCGCG CCGCCGCCGT GCGCCGCCGC GTCGGCTACG TCGACCTGCG CACCGAGGGC GTCGACGCGC TCGACGCCGC GGTCGCCGAG CGGCCGCCCG TGCTCGCCGT CGACCGCACC GACCTCGTCA CCGACCCGCA CGAGCGCGCG CACGTCGCCG CGGCACTGTC CCGTGCGCTG GACGCGGGCG CCACGCTCCT GCTCGGCGTC GTCGGCAGCA CCCCCGCCGA CGACCTGCTC CCCGCAGGCA CCCCCGTCAC GACCCTCGCA CCGCAGGCCG GAGCCCTCGC GTGA
|
Protein sequence | MSSALYRLGR AAFARRRAVI GAWVGLLVLI GAAAGLLGGT LDNSVSIPGT ESQAALDRLT ATFPQAAGTT AQVLVVGEDG AQVDDPAVVT AVEDSVDAFL EVESVTSAVS PFDDTLPGAS AVSDDGEAAL LTLSLEGEGV AIGDEVKDRL RDVADELDAA LPDGYDATIG GQLFSQEFPG LSIAEVLGVV VAFIVLLVTL GGFAAAGMPL LNALLGVGLS TLLVLVAAAF TSVTSTTPLL SLMLGLAVGI DYALFIVSRY RELLATGLPT QEAAARSNAT AGSAVIFAGL TVMIALVGLG VAGIPFLTVM GVAGAAAVGI AVLVSITLVP AMLGVAGERL RPRPSRRARK DAAAGTAPAA APAPAADGDT WDLPEHHNRF FAGWVRLATA RPWVTVVVTI GALLALAFPA LDLRLALPDA GVAPTDSSQR VTYDRITEHF GPGANGPLVV TGSIVTSDDP LGLMEDVADE LRALPGVDSV PLATPNESID TGIVQVVPTT GPTDPATADL VNAIRDLRPT ILEKHGFDLA VTGFTAVGID VSAKLGAALL PFAVFVVGLS LILLTMVFRS IAVPLKATIG YLLSVAAAFG VVTAVFEHGI AADLLHVSRL GPIISFMPIV LMGVLFGLAM DYEVFLVSRM REDYVHSGKA RASIATGFVG SAKVVTAAAV IMVAVFFAFV PEGDINIKPI ALGLAVGVAV DAFVVRMTLV PAVMQILGER AWWMPKGLDR VLPSFDVEGE ALHREISMQA WPHDPDVVVA ARGLRLAALD RTDVVDLAVR RGEVLVAHAD EPARPAALLL TVAGRLAPEA GDLKVDGLLL PVRAAAVRRR VGYVDLRTEG VDALDAAVAE RPPVLAVDRT DLVTDPHERA HVAAALSRAL DAGATLLLGV VGSTPADDLL PAGTPVTTLA PQAGALA
|
| |