Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A4780 |
Symbol | |
ID | 3749988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | + |
Start bp | 1778841 |
End bp | 1779848 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637763077 |
Product | hypothetical protein |
Protein accession | YP_369019 |
Protein GI | 78066250 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.107558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.342828 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTGC TTTCGTTGCC GACGCTCGAC GACCTGCGTA TCGAGCCGGG GCTGCCCACC GTCGTGTCGC CGCGCGGCAG CGACGGAATG TCGATCGACG ATGTCGCGCC GCTGGCGCGC GAGATCGCGG CCGACACGCT CGAACGGGCG GGCGGCGTGC TGTTCACAGG TTTTCACGTG CCGTCGATCG ACACGTTCCA GCAGTTCGCG GCGTCGTTCG GCGATCCGCT GATCGGCTAT GAATACGCAT CGACGCCGCG CAGCCAGGTC GAAGGCGCGG TCTACACGTC GACCGAATAC CCGCCGCACC GCGCGATACC GCTGCACAAC GAGCAGTCGT ACACGCGCGA ATGGCCGCTG CGGATCTGGT TCCACTGCGC GCTCGCGGCG CCGAAGGGCG GTGCGACGCC GATCGCGGAC AGCCGCGCCG TCTACCGCGC GCTCGATCCG GCGCTGATTG CGCGCTTCGA GAAGCGCGAA CTGCTGTACG TGCGCAATTT CGGGCAGGGG CTCGATCTGC CGTGGCAGCA GTCGTTCGGT ACCGACGAGC CGGCCGAAGT CGAACGGATG TGCGCGGTGC GCGGCATCGA ATGCGCGTGG CGCACCGACG ACGACGGCGA GCTGCTGCTG CGCACCCGCG AACGTTGCCA GGCCGTCGCG CGCCATCCGC GCACCGGCGA CCGCGTGTGG TTCAACCAGG CGAACCTGTT TCACCTGTCG GCGCTCGACG ACGACATGCA GGAAGCGCTC GTCGACGCGG TCGGGCTCGA GAACGTGCCG CGCAACGTGT ATTACGGCGA CGGCGAACCG CTCGAAGCCG ACGCGCTCGC GGAGATCCGC GGCGTGCTCG ACCAGCAGCG CATCGTGTTC CCGTGGCGCA CGGGCGACGT GCTGATGCTC GACAACATGC TGACCGCGCA TGCGCGCGAC CCGTTCGAGG GGCCGCGCAA GGTCGTCGTC GCGATGGCGC AGAGTTATAC GGTCCCGCGC GACCGAACGG AGGATTGA
|
Protein sequence | MTLLSLPTLD DLRIEPGLPT VVSPRGSDGM SIDDVAPLAR EIAADTLERA GGVLFTGFHV PSIDTFQQFA ASFGDPLIGY EYASTPRSQV EGAVYTSTEY PPHRAIPLHN EQSYTREWPL RIWFHCALAA PKGGATPIAD SRAVYRALDP ALIARFEKRE LLYVRNFGQG LDLPWQQSFG TDEPAEVERM CAVRGIECAW RTDDDGELLL RTRERCQAVA RHPRTGDRVW FNQANLFHLS ALDDDMQEAL VDAVGLENVP RNVYYGDGEP LEADALAEIR GVLDQQRIVF PWRTGDVLML DNMLTAHARD PFEGPRKVVV AMAQSYTVPR DRTED
|
| |