Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0954 |
Symbol | |
ID | 9144829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 1051027 |
End bp | 1054299 |
Gene Length | 3273 bp |
Protein Length | 1090 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003636060 |
Protein GI | 296128810 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0173723 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0708717 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATGGC GTGGCCGGCT GCTGCGCGGC CGAGGTCAGG ACCAGGCCCC GGTCGTGCTG GTCGTCACGC TCGTGGCGGT CGCGTCCGCG ACCCTCGTCG GCCTGCTCGC CGGGCTGCTG CACGTCGCCG AGCGCGACGC CGTGCCGCAG GCCATCAGCC GGCTCGACCC GCAGCGCACG CACCTCGAGG CGACGCTGTG GGTGCGCGGG GACGACGTCG AGCCGGCGCT CGACAGGGCC CGCGACGGCC TCGCCCGCAT CACGGGGGAC GTCCCGACCA CCGAGCGGAC GTGGCTGATC GGCGGGCTGC GCGCGCTGCC GACCGAGCTC GGCGTCCCAC CCGAGCTGAC GTACCTCGCT GCACTCCCCC ACGACGACGA GGACCTCGTG CGCCTCGCGA GCGGACGCTG GCCCGCCGCC TCGACCGACG CGGACGGCCG CGTCGAGGTG AACGTCCCGG TCGTCGCCGC GCAGGCGCTC GGGTGGGAGG TCGGCAGCAC CGTCCACGCC CGGCCGTGGG GCGAGGAGCA CGGCGACGCG TTCGTCGTCG TCGGGACCCA CGAGCCCGCG GGTCCGCGCA GCGCCTGGTC GCGCGACCGG CTGCGCGGCC AGGGACGCAG CGCAGGGTTC AACCTGCCCG GGTCGGCCGG TCTGATCCGC ACCACCGCGT GGGGACCGCT CGTCGTCGAC CCCGCGGTCC TCACCAGGCC CCAGATGGTC GACACCGCCT ACCTCGTGGT CGAGCCCGAC CTCGCCGCGT CGACGGCCGA CGCGGTGGCG GCGCTGCGCA CGCAGGTCGA CGACGGCGCG CGGATCCTCT CCGACGCACT GACCGGCCCC GTCAGCGGAC GGCTGCAGAC GGACGTCGAC ACGACGATCG ACGCGACGTG GCGCGAGCTG GTGGTGACGC GCGCCGCGGT GGTGACGATC GGCCTGCTGC TGGGCACGCT GGCCACGACC GTGCTGCTGC TGACGGCCCG CCTGCTCGCG GAACGGCGAG CCGGGGAGGC CGAGCTGCTC GCAGCCCGCG GCGCGTCTCC TGCGCAGCTG CGCTCGACGG TGCTGCTCGA GGCCCTCGTG CTGGCGACCC TCACGTGGCT CGTCTCCCCG TGGCTCGCGC GCGGCGCGCT GGCGGTCGTG ACCCGCTCGG GCCCGCCGGC CGAGGCCGGC TACACCGTGC CGGAGGGCGT GCCGGGCGGC GTGCTGCTCG CCTGCGGCGC CATCGCCGTC GCTCTCGCCG TCACGCTGTG CGTGCCCGCG TGGCACACCG CGGGGTCGAC GTCGCGCTCC GTGCACGGCG GACTGCTGCG CGTCGGCGGT GACCTCGCGC TCCTCGTGCT GGGCGCGCTG GCGCTGGGCC AGCTCGTCGC CTACGGCTCG CCGCTGACGC GCGGCGCCGA CGGCCCGCGG CTGGACCCCG TGCTGGTCGC CGGTCCCGCG CTGGTGTGCC TCGCGGCGGC GACGGTGGCG CTGCGCCTCG TCGCGCCCGT CGCGCGCGCC GGCGAACGGC TCGCGCGTGG CGCCCGCTCG CTGGTGCTGC CGCTCGCGGC CTGGCAGGTG GCACGGCGGT CCGCGGTGGC CACCGGCACC GTGCTCGTGG TGGTCGTCGC AGTCGCGGCC GGCACGTTCA GCGCCGCGTT CGCGGCGACC TGGCGCACGT CCCAGGTCGA GCAGGTCGAC CTCGCGCTGG GCACCGACCT GCGGGCCGAC GCCATCGAGG AGGACCCCCT CGCCGCGTCG GCCGCGCTCG CCGCCGCGAC CGCCGCGTAC CCGGACGCGC ACGGCCAGCC CGTCACGGAC CGCGTGGTCG GGATGGGCCC GCGGGGCAAC ACGGGCGGGG TCGGCGGCAG GCTGGTCGCG CTCGACGCGT CCCGTCCTCA GGACCTGCGG GGGCGCTCGA CCACCCCGTG GCGCGAGGTG GTCGCCGGGC TGCACGCCGA CGAGCCGTCG ACGACCGTCG GGACCGAGCT GCCCGCCGGC ACGCAGTGGC TCGTGCTCAC GGGCTCGGTC GACACCGACC CCTTCCTCAG CGGCACCGCG GTGCTGGGCC TCGGCGTCGA GGACGACCAG GGTGTCCTCA CGCAGCTCCC GCCGCGCACC GCGCCGCTCG GGAGGCCGTT CGAGGTCGTG CTCGAGGTGC CCGTCGCCGA CCGGCTGCGT GTCGTGGCAA CCGACCTGAC CGTGAGCGTC CACGAGCCCG AGAGCGTCCT GACGACCGAC TCCCGCCTGG TGCCCGTGCG GACCACCCTG ACGTCACTGC GCGCGGTCCC GCGCTCGGCC GGCATCGGCA GGGAGCTCGA CGTCCACGAG GCACCCGCGC AGCCCGTGCC GCTGCGCCTC GACGGCTGGA CCGGGGTCGT CACGCAGGGC GAGAGCACCG TGGGCGCCCC GGAGCTCGTA CCGGTGGGTG CCCCGGGCGC GTGGCACGTG ACCGGCACGA TGAAGGTCGA CACGTGGGGC ACCCCGCCGG TGCGCGTGCT CAGCGCCGCG TGGCCGCTCC CCCGGACCGT GCCCGCCGCC GTGAGCGAGT CGGTGCTCGA CGTCCTCGAG ACGCGGCAGG GCCTGACGAT CACGGTGGCC GGCGTGTCGG TCCCGCTCCA GGTCGAGCGG CTCGTCGAGC AGGTCCCGGG CGTGCCGCGC GGGATCGGGG TCGTGGTCGA CCGCACGACA CTCTCGCGGG CCGTGCTCAC CGCGGGCGGC CGCACGGACC TGCTCGACTC CTGGTGGGTC GCGGCCCCGG CGACCACCAC CGCTGCGCTC GCCGCCGACC TCGCGGGCGT CGACGCCGAC GTCACGACAC GGGCGGCCGA GCGGCACGAC GCGCTCACCG GGCCCGTACG GGTCGCGGTC CCGACCGCGG TGTCGCTGGT CGCGGTGTCC GCCGTGCTGC TCGTGCTCGT CGGCACGGGC GCCGTCGCCG CCGCGTCGCT GCGCTCGCGC CGCCTGGAGC TCGCACGGCT CCAGGCGCTC GGCGCCTCGC GTGCCGGGCT CGTCGGAGGG CTGCTCGCCG AGACCACGCT GCTCGTCACG GTGGGGGCCC TCGCGGGGCT CGCGGCCGGC TACGGGCTGG CCGCCGCGGT CGCCCCGCTG CTGACGATGT CGCCCGACGG CCGCACGCCC GCACCCGAGC CCTGGCTCGT GTGGGGCTGG GGAACGCAGT CGCTGCGCAC CCTCGGCGTC GTCGCCGCCG CGTGCGCCGT CACCGCCCTC GTCGCCGTGC TCGGCGTGCG CCGCACGTCG GGAGCCGCGC TGCGGATGGG GGACGACCGA TGA
|
Protein sequence | MRWRGRLLRG RGQDQAPVVL VVTLVAVASA TLVGLLAGLL HVAERDAVPQ AISRLDPQRT HLEATLWVRG DDVEPALDRA RDGLARITGD VPTTERTWLI GGLRALPTEL GVPPELTYLA ALPHDDEDLV RLASGRWPAA STDADGRVEV NVPVVAAQAL GWEVGSTVHA RPWGEEHGDA FVVVGTHEPA GPRSAWSRDR LRGQGRSAGF NLPGSAGLIR TTAWGPLVVD PAVLTRPQMV DTAYLVVEPD LAASTADAVA ALRTQVDDGA RILSDALTGP VSGRLQTDVD TTIDATWREL VVTRAAVVTI GLLLGTLATT VLLLTARLLA ERRAGEAELL AARGASPAQL RSTVLLEALV LATLTWLVSP WLARGALAVV TRSGPPAEAG YTVPEGVPGG VLLACGAIAV ALAVTLCVPA WHTAGSTSRS VHGGLLRVGG DLALLVLGAL ALGQLVAYGS PLTRGADGPR LDPVLVAGPA LVCLAAATVA LRLVAPVARA GERLARGARS LVLPLAAWQV ARRSAVATGT VLVVVVAVAA GTFSAAFAAT WRTSQVEQVD LALGTDLRAD AIEEDPLAAS AALAAATAAY PDAHGQPVTD RVVGMGPRGN TGGVGGRLVA LDASRPQDLR GRSTTPWREV VAGLHADEPS TTVGTELPAG TQWLVLTGSV DTDPFLSGTA VLGLGVEDDQ GVLTQLPPRT APLGRPFEVV LEVPVADRLR VVATDLTVSV HEPESVLTTD SRLVPVRTTL TSLRAVPRSA GIGRELDVHE APAQPVPLRL DGWTGVVTQG ESTVGAPELV PVGAPGAWHV TGTMKVDTWG TPPVRVLSAA WPLPRTVPAA VSESVLDVLE TRQGLTITVA GVSVPLQVER LVEQVPGVPR GIGVVVDRTT LSRAVLTAGG RTDLLDSWWV AAPATTTAAL AADLAGVDAD VTTRAAERHD ALTGPVRVAV PTAVSLVAVS AVLLVLVGTG AVAAASLRSR RLELARLQAL GASRAGLVGG LLAETTLLVT VGALAGLAAG YGLAAAVAPL LTMSPDGRTP APEPWLVWGW GTQSLRTLGV VAAACAVTAL VAVLGVRRTS GAALRMGDDR
|
| |