Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2779 |
Symbol | |
ID | 9146687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3090973 |
End bp | 3092064 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | Acetamidase/Formamidase |
Protein accession | YP_003637863 |
Protein GI | 296130613 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.190953 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000139746 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGCTCG ACCCCGGCAC CGGCCCCGTC CGCGCCGCCA CCTACCTGCC GTCGACCCCG GACAACGTCC TGTGGGGCCG GCTGCCGTGC GCCGCCGACG CGCCCGTCCT CACGGTCGAC CCGGGCACCG AGGTCACGGT CGACACCCTG AGCCACGAGG GCCTCCTGGA GGACCAGGGG CGCGACCCGC GCGGGTTCTT CGGTGCGCAC GGCCTGATCG ACGTGCTCAC CGACGCGGTG GCGCTCGCGG CGTCGGACCA CCCGCACGAC CCGGTGGTCG ACGGCCCGCA CGTCGTCACC GGCCCGATCG CGGTGCGCGG CGCCCGGCCG GGCGACGTCC TGGCGATCAC CGTGCTCGAG GCCCTGCCCC GGGTGCCGTA CGGCGTGATC TCCAACCGGC ACGGTCGCGG CGCGCTGCCG GGCGAGATGC CCGAGGGTGG CGTCGACGTC GTCAGCGTGG TGGCCCGGCT CGACCCGACC GGCACCCGCG CGGTCATGCC CCGGGCGGCC GGCAGCGACC GCACCGTCGC GTTCCCCCTC GCGCCCTTCC CCGGCATCCT CGGCGTGGCC GTCGCGGGCG ACGAGCGACC GCACTCCGTG CCCCCGGGCG CCCACGGCGG CAACCTCGAC ATCAACCTGC TGCAGGCCGG TGCGACGCTG TACCTGCCCG TGCAGGTACC CGACGCGCTG GTCTACGTGG GCGACCCGCA CTTCGCGCAG GGCGACGGCG AGGTCGCGCT CACCGCGCTC GAGGCCTCGC TGCGGGTCAC GGTCCGCCTC GACCTGCTGC CCGGCGCGCA GGCGCTCGCC GAGCTGGGGC CGCTGACCGG TCCCCTCGTC CGCACGTCGG AGCACCTCGT CCCGACAGGT CTCGACGTCG ACCTCGACGA GGCGCTGCGG GCGTGCGTGC GCGCCGCTCT CGACCTGCTG CAGGCGCGCT TCGGCATGGA CCGCGCGCAC GCGCTGGCGT ACCTGTCGGC GGCCACGGAC TTCGACGTGT CGCAGGTCGT GGACCGCGTC ACGGGCGTGC ACGCCCGCAT CCGCCTGGCC GACTTCGACG AGCCGGGTGC CGGGCAGGTC ACCGGCGCAT GA
|
Protein sequence | MLLDPGTGPV RAATYLPSTP DNVLWGRLPC AADAPVLTVD PGTEVTVDTL SHEGLLEDQG RDPRGFFGAH GLIDVLTDAV ALAASDHPHD PVVDGPHVVT GPIAVRGARP GDVLAITVLE ALPRVPYGVI SNRHGRGALP GEMPEGGVDV VSVVARLDPT GTRAVMPRAA GSDRTVAFPL APFPGILGVA VAGDERPHSV PPGAHGGNLD INLLQAGATL YLPVQVPDAL VYVGDPHFAQ GDGEVALTAL EASLRVTVRL DLLPGAQALA ELGPLTGPLV RTSEHLVPTG LDVDLDEALR ACVRAALDLL QARFGMDRAH ALAYLSAATD FDVSQVVDRV TGVHARIRLA DFDEPGAGQV TGA
|
| |