Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0468 |
Symbol | |
ID | 9144334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 497325 |
End bp | 499286 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | protein of unknown function DUF1565 |
Protein accession | YP_003635582 |
Protein GI | 296128332 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.672167 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACAGG TTCTCCACGT CTCCGTCCAC GGATCCGACG ACGCCGTCGG TACGCAGGAC GCCCCGCTGC GCACCATCGA CCGCGCAGCC CGGCTCGCGC GCCCCGGTGA CACGGTCACC GTGCACGCCG GCACGTACCG CGAGTGGGTC CGCCCGCGCC GCAGCGGCCG CGGCGAGAAC CGCCGCATCA CCTACCAGGC GGCGCCCGGC GAGCACGTGC GCATCACGGG CTCCGAGCAG GTGACCGGCT GGGAGTCCCT GGGTGGCGGC GTGTGGCGCG TCGAGGTGCC GAACGCCCTG TTCGGTGAGT TCAACCCCTT CGCCGTCGAG GTCGACGGCG ACTGGATCGT GCGCCCGGGG CGCGACGAGC CGAAGAAGCA CCTGGGTGCG GTGTACCTCG ACGGGCGTCG CCTGCACGAG GTCGCGACGG CCGACGAGGT CCCGGACGCC CCGCGCCGCG AGGAGATCGT CGACGACTGG ACCGGCACCG TCGTGCCCGT CCCGGACCCC GACCGGACGC CGCGCGTGTG GCACGCCGAG GTCGGCGCCG ACGTCACGAC GATCACCGCG AGCTTCGGCG ACGCCGACCC GAACGCGGCG CTCACCGAGA TCAACGTGCG CCCGACCGTG TTCTGGCCGC AGGACCACCA CGTCGACTTC ATCACCGTCC GCGGCTTCGA ACTGTGCCAG GCGGCCACGC AGTGGGCGCC CCCGACCGCG AACCAGCCCG GCCTCATCGG GCCCAACTGG GCGCGCGGCT GGGTCATCGA GCACAACGAC ATCCACGACG CCACGTGCTC GGCGGTCTCG CTGGGCAAGG AGGCGTCGAC GGGCGACAAC TACGCCACCG ACCGCGGCGA CAAGCCCGGG TACCAGTACC AGCTGGAGTC GGTGTTCTCG GCGCGGCAGA TCGGCTGGGA CCGCGAGCAC ATCGGCTCGC ACGTGGTGCG GGACAACCAC ATCCACCACT GCGGCCAGAA CGCGGTCGTC GGGCACCTGG GTTGCGTGTT CTCGCGCATC GAGCGCAACC ACATCCACGA CATCGCCAAC GACCGCGCGT TCTACGGCCA CGAGATCGCG GGCATCAAGC TGCACGCGCC CATCGACGTC GTCATCGCCG ACAACCGCAT CCACGACTGC TCGCTCGGCA TCTGGCTGGA CTGGCAGACG CAGGGCACGC GCATCACGCG CAACGTGCTG TGGGCCAACA GCCGCGACCT GTTCATCGAG GTCAGCCACG GCCCGTACGT CGTCGACCAC AACGTGCTGA CGTCGCCGGT GTCCGTGGAG AACCACTCGC AGGGCGGCGC GTACGTGCGC AACCTGCTGT GCGGGACGGT CAACCTCAAG CAGATGCTCG ACCGCGCGAC GCCCTACCAC CGCGCGCACT CCACCGACGT GGCGGGGTAC GCGATCATCC TCACCGGTGA CGACCGGTGG ATCGGCAACG TGTTCGCCGG CGGTGACCTC GACAAGGCGT ACCACCCGGA CTCGTGGGGC CGGATCGGGT CGCAGACGGG CACCGCGGCG TACGACGGGT TCCCGACGAG CCTCGAGCAG TACCTCACGG AGATGGGGGA CCGCTGGGAC GGCGACCACA ACCGCTTCGG CAGCCGCGTG CAGCCGTACT GCTCGCGCGG CAACGTCTTC GCCGGCGGGG CGCGCCCGGC GGACGTCGAG GTCGACCCCC TGGTGCTCGA CGGCACGCCG CGCGTCGAGG TCGTCACGCA GGGCGACGAG GTGTGGCTCG AGGTCGACGT GCCCGGCGCC GACGCGGCCG TACTGGACGC GCTCACGGGG GCGGACCTGC CCCCCGTGCG ACTGGTGGGC CTGGAGTTCG AGGACGTCGA CGGCAGCCCC ACGGCGTTCG ACACGGACGT CGCCGGCGAG CAGCTCGACG GCCCGCACCC CGCGGGCCCG CTCGCGGGCG GCCTGAAGAG CGCGCGCCTG CGTCTGCTGT AG
|
Protein sequence | MAQVLHVSVH GSDDAVGTQD APLRTIDRAA RLARPGDTVT VHAGTYREWV RPRRSGRGEN RRITYQAAPG EHVRITGSEQ VTGWESLGGG VWRVEVPNAL FGEFNPFAVE VDGDWIVRPG RDEPKKHLGA VYLDGRRLHE VATADEVPDA PRREEIVDDW TGTVVPVPDP DRTPRVWHAE VGADVTTITA SFGDADPNAA LTEINVRPTV FWPQDHHVDF ITVRGFELCQ AATQWAPPTA NQPGLIGPNW ARGWVIEHND IHDATCSAVS LGKEASTGDN YATDRGDKPG YQYQLESVFS ARQIGWDREH IGSHVVRDNH IHHCGQNAVV GHLGCVFSRI ERNHIHDIAN DRAFYGHEIA GIKLHAPIDV VIADNRIHDC SLGIWLDWQT QGTRITRNVL WANSRDLFIE VSHGPYVVDH NVLTSPVSVE NHSQGGAYVR NLLCGTVNLK QMLDRATPYH RAHSTDVAGY AIILTGDDRW IGNVFAGGDL DKAYHPDSWG RIGSQTGTAA YDGFPTSLEQ YLTEMGDRWD GDHNRFGSRV QPYCSRGNVF AGGARPADVE VDPLVLDGTP RVEVVTQGDE VWLEVDVPGA DAAVLDALTG ADLPPVRLVG LEFEDVDGSP TAFDTDVAGE QLDGPHPAGP LAGGLKSARL RLL
|
| |