Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_B0068 |
Symbol | |
ID | 3751922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007511 |
Strand | + |
Start bp | 77208 |
End bp | 78392 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637764914 |
Product | cytosine deaminase |
Protein accession | YP_370829 |
Protein GI | 78060921 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAACC TGCTGATCCG CAACGTCCGC CCAAGCGCCG ACGCTGCGCT CGACATCCTG ATCGAAGGCG ATCGCATCGC GCGTATCGGC CCGTCGCTCG ACGCGCCCGC CGGCTGCGCG ATCGAGGACG GCGCGGGCGC GCTCGCGCTG CCCGGCCTCG TCGAGGGCCA CACGCATCTC GACAAGACGC ACTGGGGAAT GCCGTGGTAT CGCAACCAGG TCGGGCCGCG CCTCGTCGAC CGGATCGAGA ACGAACGCCA CTATCGCGCG ACGAGCGGGC ATGACGCCGG TGTCGCGTCG CTCGCACTGT CGCGCGCGTT CCTCGCGGCC GGCACGACGC GCATCCGCAC GCACGTCGAC ATCGACACCG AAGCCGGGTT GCGGCATCTG CACGGCGTGC TCGCGACGCG CGAGACGCTG CGCGGGCAGG TCGAGATCCA GATCGTCGCG TTTCCGCAAT CGGGCGTGCT CAAGCGTCCG GGCACCGACG CACTGCTGTC CGAAGCCCTT GCCGCCGGTG CCGATCTGCT CGGCGGGCTC GACCCGTGCG CGATCGAAGG CGATCCGGTC GAAGCGGTGG ACGTGCTGTT CGCGATCGCC GAGCGCCACG GCCGCGGGCT CGACCTCCAT CTGCACGAGC GCGGATCGAT GGGCGCGTAC TCGCTCGACC TGATCCTGCA GCGCACGGCT ACGCACGGCA TGCAGGGCAA GGTGACGATC AGCCACGGGT TCTGCCTCGG CGATATCGCC GAACGCGAGC GCGATGCGTT GCTCGCGCGG ATGGCCGAAC TCGGCGTCGG GCTCGTCACG ACCGCGCCGG CAGCGGTGCC GGTGCCGCCG GTGGCCGCGT GCCGCGCGGC GGGCGTGACG GTCATCGGCG GCAACGACGG CGTGCGCGAC ACGTGGACGC CGTACGGGTC GCCCGACATG CTCGAACGCG CGATGCTGAT CGGCATGCGC AATGATTTCC GTCGCGATGA TGCACTCGAA GTCGCGCTCG ATTGCGTGAC GCACAGCGCG GCGCGCGGTT GCGGCTTCGA CGGTTACGGG CTCCAGCCGG GCAGCCGGGC GGATGTCGTG CTGGTCGACG CGCTGACGTT CGCCGAGGCC GTTGTCGCAC GGCCGGTGCG GCGGCTCGTC GTGTCGTCGG GGAAGATCGT TGCGCGCAAC GGCGCGCTGG TGTGA
|
Protein sequence | MTNLLIRNVR PSADAALDIL IEGDRIARIG PSLDAPAGCA IEDGAGALAL PGLVEGHTHL DKTHWGMPWY RNQVGPRLVD RIENERHYRA TSGHDAGVAS LALSRAFLAA GTTRIRTHVD IDTEAGLRHL HGVLATRETL RGQVEIQIVA FPQSGVLKRP GTDALLSEAL AAGADLLGGL DPCAIEGDPV EAVDVLFAIA ERHGRGLDLH LHERGSMGAY SLDLILQRTA THGMQGKVTI SHGFCLGDIA ERERDALLAR MAELGVGLVT TAPAAVPVPP VAACRAAGVT VIGGNDGVRD TWTPYGSPDM LERAMLIGMR NDFRRDDALE VALDCVTHSA ARGCGFDGYG LQPGSRADVV LVDALTFAEA VVARPVRRLV VSSGKIVARN GALV
|
| |