Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0124 |
Symbol | |
ID | 9143989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 149535 |
End bp | 152675 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | |
Product | putative exonuclease |
Protein accession | YP_003635243 |
Protein GI | 296127993 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0895241 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCTGC GCTCGCTCAC CGTCCAGGCG ATCGGCCCGT TCGCAGGCCG GCACACGGTC GACCTCGACG CGCTCGGGCA GTCCGGGCTG TTCCTGCTGG AGGGGCCGAC CGGGTCGGGC AAGTCGACGC TCATCGACGC CGTCGTGTTC GCGCTGTACG GGAAGGTCGC AGGCACCGAC GCGTCCGACG AGCGCCTGCG CTCGGCGTAC GCGGCGGACG ACGTGGAGAG CGTCGTCGAC CTGGTCTTCG AGGTGCCGTC GGGCGTCTAC CGCGTGCGGC GCACCCCCGC GTACCGCCGG GCCAAGCGCC GTGGCGAGGG CACGACGACG GCGCAGGCGA GCGTCAAGGC CTGGCGCCTG CCGGCCGACG TGGAGCTGGG CGAGGACCCC GAGGAGCTCG ACGGCGTCGG CGTGCTGCTC GGCACGCGCC TGGACGAGGT GGGCCACGAG CTGCAGCGGA TCGTCGGGCT GGACCGCACG CAGTTCGTCC AGACGGTCGT GCTGCCCCAG GGCGAGTTCG CGCGGTTCCT GCGGGCCACC GGCGAGGAGC GCCGGGTGCT GCTGCAGAAG ATCTTCGGCA CCCAGGTGTA CGAGCAGCTG CAGCAGCGGC TCGCCGCGCT GCGGGCCGAG GCGGCGCGGA CCGTCGAGGC GGCGCGTGCC GGACTCGGGG AGGCGGTCGC CCACCTGCTC GGCGCCTGCG CGCTCGGCCC GGAGGAGCAC GCGGCGGCAC GCGTCGCGCT CGACGAGGCC GTCGCGGGCG GCCCGGGCGT CGCGGTGCGC GTCGCGGGCG TCGTCGTGGC CACGACGCGC GCGCTGGACG CCGCCGCCGG CACGCTCGCG CAGGAGGCTG CCTCCGCCCG CACGTCATGG GACGCGGCAC GGGCCGCGCA CGAGGCGGCG CGCGCGACCG CGGCTCTCGC CGCCCGGCGG GACGCGCTGC GCGCGGAGCG TCGCGCGCTC GACGCCGCGG CCGGGCAGCA CGAGGACGAC GTGCAGCGCC TCGCACGGGC GCGCGCGGCT GCCGCGGTGC GTCCGCTGCT CACCGGGTGG CAGGACGCGA GGACGGCGCA CGAGGCCGGG ACCAAGTCGC TGGTCGCTGC GTGCGACACG GCCCCGGCGG ACCTGCTGCC GCCCGGCGGC GCGGCGGTCC TCGTCGTCGG CGGGCGCGAC GGCGGTCCGG GACCCGACGC CGGCGGCACC GAGCGTCTCG ACGGTGTGCT CGACGCGTGG CGGCCCCACC TGCGTGCCGA GCGTGACGCG GCCGCGGACG CCGCCGCCGC GTTGCGTCGC ACGGTCGAGG TCGAGGCCGG GCTCACCGAG CGCCGCCGCG CGGTGCGCGA GCTCGCGGCG GTGCTCGAGG AGCTGCGGTC GGAGGTCGAC GCCGCGACCA CGTGGCTCGC CGGACGCCCG GGGAGCCGAG CGGCCCTCGA GCAGGAGCGG GACGCCGCGC GCGCGCTCGC GGGCCGGTCC GACGCCGCCG AGCAGGCCCG GACGGCCGCG CGGGCTCTCG TCGCGGACGT CGCCGCCCTG ACGGCGGCGC GCGCCGACCT CGCAGCCGCG CAGGACGCCG TCGCCGGTGG TGCGGACGCC GCACGGGCCG CGGTCGCCGC CGAGGCGTCG CTGCGTGCCG CACGCGTCGC CGGCCTGGCC GGCGAGCTCG CGGCCGGTCT CGTCGCCGGC GACCCCTGCC CGGTGTGCGG TGCCTGTGAG CACCCTGCGC CGGCGTCGGT GGGCGCCGAC CACGTGACCG CCGAGCAGGT GCGGGCCGCC GAGGAGGCGC GCGCCGCTGC CGAGTCGCAG CTCGCCGCGC TCGGGGCGCG TCGGGCGGCG CTCGCCGAGC GCGTCGCCGG GCTGGCCGCG CGCGTCGGGG AGCACGACGC CGGTGCGGCC GCGGCGCTGC TGCGGGCGGC CGAGGACGAC GTCGCCGCTG CTGCCGCCGC CCACGCCCGG GTCGCCACCC TCGAGGCCGA CCTCACGGCG CACGACGCGG CCACCCGGGA CCGTGCGCAG GTGCGCGAGG AGGTCCTCGG CCAGGTCCGC GCCGCCGAGC TCACGCTCGA GACCGAGCGC GACGCTCTCG GGCGGGCCGA GGCCGAGGTC GCGGACGCGC GCGCCGACCA CCCGACCGTC GCCGCCCGGC ACGCCGCCCT CGACGCGCGG GCGCGCCAGG CGGTGGCGCT GCTCGACGCC CTCGACACCG AGCGTGCCGC GGCCGCCGAC GAGCAGCGTC GTCACGCCGA GCTCGACGCC GCGCTGGCCG AGCACGGGTT CGCGGACGTC GCCGACGCCC GCGGCGCCTG GTGCCCCGCG ACGGAGCTGG CCGACCTCGA GCGTCGCGTC GTCGCCCGCA CCGCCGACGA GGCACGCGTC GACGCCGGCC TGGCCGACCC CGCGCTCGTC GCGCTGCCCG AGGACGTCGC ACCCGACCTC GCCGGGACCG AGGGTGCCGA GCGTGCGGCG CGGCGCAGCG CCGACGACCT CGAGGGCCGC GCCCGCGTCG CCGCGGCGCG CGCCGAGGCC GCCGCGGACG CCGCAGCACG CGTGCGCGCC GCGGCCGAGG CGCTCGACGC CGCGGCAGCC GCAGCGGCGC CCGTGACCCG CATGGCGAAC CTCGCGTCCG GGACGGGCGC CGACAACCCG CACGCCCTGT CGCTGGCCAC CTACGTGCTC GGGCGTCGCT TCGAGGACGT CGTCGCCGCC GCCAACGAGC GTCTCGCCGT GATGTCCGAC GGGCGGTTCG AGCTCGTCCG GTCCGACGAG AAGGAGGACG TGCGTACCCG GGCGGTCGGA CTGGCGATGC GCGTCGTCGA CCACCGCACC GAGCGTGCGC GTGACCCGCG GACCCTGTCC GGCGGCGAGA CGTTCTACGT GTCGCTGTGC CTCGCGCTCG GCATGGCCGA CGTCGTCACG GCCGAGGCGG GAGGCGTCGA GCTCGGCACC CTGTTCGTGG ACGAGGGCTT CGGTGCGCTC GACCCGCACG TCCTCGACCA GGTGCTGGCC GAGCTCGGCC GGCTGCGGCA CGGCGGGCGC GTCGTCGGCA TCGTCTCCCA CGTGGAGACC CTCAAGCAGG CGGTCGCCGA CCGGATCGAG GTGCGGCCGA CGCCCGCCGG TCCGAGCACG CTCACCGTGC TCGCAGGCTG A
|
Protein sequence | MRLRSLTVQA IGPFAGRHTV DLDALGQSGL FLLEGPTGSG KSTLIDAVVF ALYGKVAGTD ASDERLRSAY AADDVESVVD LVFEVPSGVY RVRRTPAYRR AKRRGEGTTT AQASVKAWRL PADVELGEDP EELDGVGVLL GTRLDEVGHE LQRIVGLDRT QFVQTVVLPQ GEFARFLRAT GEERRVLLQK IFGTQVYEQL QQRLAALRAE AARTVEAARA GLGEAVAHLL GACALGPEEH AAARVALDEA VAGGPGVAVR VAGVVVATTR ALDAAAGTLA QEAASARTSW DAARAAHEAA RATAALAARR DALRAERRAL DAAAGQHEDD VQRLARARAA AAVRPLLTGW QDARTAHEAG TKSLVAACDT APADLLPPGG AAVLVVGGRD GGPGPDAGGT ERLDGVLDAW RPHLRAERDA AADAAAALRR TVEVEAGLTE RRRAVRELAA VLEELRSEVD AATTWLAGRP GSRAALEQER DAARALAGRS DAAEQARTAA RALVADVAAL TAARADLAAA QDAVAGGADA ARAAVAAEAS LRAARVAGLA GELAAGLVAG DPCPVCGACE HPAPASVGAD HVTAEQVRAA EEARAAAESQ LAALGARRAA LAERVAGLAA RVGEHDAGAA AALLRAAEDD VAAAAAAHAR VATLEADLTA HDAATRDRAQ VREEVLGQVR AAELTLETER DALGRAEAEV ADARADHPTV AARHAALDAR ARQAVALLDA LDTERAAAAD EQRRHAELDA ALAEHGFADV ADARGAWCPA TELADLERRV VARTADEARV DAGLADPALV ALPEDVAPDL AGTEGAERAA RRSADDLEGR ARVAAARAEA AADAAARVRA AAEALDAAAA AAAPVTRMAN LASGTGADNP HALSLATYVL GRRFEDVVAA ANERLAVMSD GRFELVRSDE KEDVRTRAVG LAMRVVDHRT ERARDPRTLS GGETFYVSLC LALGMADVVT AEAGGVELGT LFVDEGFGAL DPHVLDQVLA ELGRLRHGGR VVGIVSHVET LKQAVADRIE VRPTPAGPST LTVLAG
|
| |