Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A5477 |
Symbol | |
ID | 3750698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | - |
Start bp | 2561959 |
End bp | 2563347 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637763786 |
Product | N-formimino-L-glutamate deiminase |
Protein accession | YP_369715 |
Protein GI | 78066946 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR02022] formiminoglutamate deiminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAGCC TCATGACCGA CACCCTGTTG TTCGCAGACC ATGCGTACCT GCCCGAAGGC TGGCGCCGTA ACGTGCTGCT GCGCTGGGAC GCGGCCGGCA CGCTGACCGG CGTGACGCCC GACACCGACG CACCCGCTGG CGTCGCGCGC GCGGCCGGGC CCGTGATGCC CGGTATGCCG AACCTGCATT CGCACGCGTT CCAGCGCGCG ATGGCGGGGC TCACCGAATA CCGCGCGAAT CCGTCCGACA GCTTCTGGAG CTGGCGCGAC CTGATGTACC GCTTCGCGCT GAAGATCACG CCCGACGCGC TCGCGGCAAT CGCGCGCTGG CTGTATGTCG AGATGCTCAA GTGCGGCTAC ACGTCGGTGT GCGAATTCCA CTACGTGCAC CACACGCAGG ACGGCTCGCG CTATCCGCAG ATCGCCGAGC TCGGCACGCG CGTGATCGAC GCCGCACGCG CGGCCGGCAT CGGCATCACG ATGCTGCCGG TGTCGTACCA GTTCGCCGGC TTCGGCGACA AGCCGCCGCG CGACGACCAG CGCCGTTTCA TCAATACGCC CGACGGCCTG CTCGAGCTGC TCGACGCGAT GCGTCGCGTG GCGCCGGAGC ACGGCGGGCT GCGCTACGGC GTTGCGCCGC ACTCGCTGCG GGCGGTATCC GAGAACGGGT TGCGCGTGCT GCTCGAAGGG TTGCCCGGCG ATGCGCCCGT GCACATCCAT ATCGCCGAGC AGACGGCCGA AGTCGACGAC TGCGTGCGTG CCTACGGTGC GCGCCCCGTG CAATGGCTGC TCGATCGCTT CGACGTCGAT GCGCGCTGGT GCCTGGTGCA CGCGACGCAC GTCGACGCGG CCGAAACGGC AGCGCTCGCC AAGCGTCGCG CGGTCGCCGG CCTGTGCCTG ACGACCGAAG CAAATCTCGG CGACGGCGTG TTCCCGGCCG TCGACTATCT CGCGCAGGGT GGTGTGATCG GTGTCGGCTC GGACAGCCAC GCGTCGGTCG ACTGGCGCTC GGAATTGCGC CTGCTCGAAT ACGGGCAGCG GCTCGTGCAT CGCGCGCGCA ACGTGCTGGC GAGCGACACG CAGGCGCACG TCGCCGATCG CCTGTTCGAC GCTTCGCTTG CGGGCGGTGC ACAGGCCAGC GGGCGGCATG TCGGTGCGCT GCGCGAAGGG TGCCGCGCCG ACTGGCTCGT GCTCGATCCC GATCATCCGG CGATCGCCGA ACACGACAGC ACGTCGTGGT TGTCGGGTAT CGTGTTCGCG GAGCACGGCG AGACGCCGGT GCTCGACGTC TACACGGGCG GCGAGCGCGT CGTGAGCGGC CGCCGTCATC GCGACGAAGC CGTCGCGTAT GCCGACTACC GCGCCGCGCT GGCGCAACTG CTGCGCTGA
|
Protein sequence | MDSLMTDTLL FADHAYLPEG WRRNVLLRWD AAGTLTGVTP DTDAPAGVAR AAGPVMPGMP NLHSHAFQRA MAGLTEYRAN PSDSFWSWRD LMYRFALKIT PDALAAIARW LYVEMLKCGY TSVCEFHYVH HTQDGSRYPQ IAELGTRVID AARAAGIGIT MLPVSYQFAG FGDKPPRDDQ RRFINTPDGL LELLDAMRRV APEHGGLRYG VAPHSLRAVS ENGLRVLLEG LPGDAPVHIH IAEQTAEVDD CVRAYGARPV QWLLDRFDVD ARWCLVHATH VDAAETAALA KRRAVAGLCL TTEANLGDGV FPAVDYLAQG GVIGVGSDSH ASVDWRSELR LLEYGQRLVH RARNVLASDT QAHVADRLFD ASLAGGAQAS GRHVGALREG CRADWLVLDP DHPAIAEHDS TSWLSGIVFA EHGETPVLDV YTGGERVVSG RRHRDEAVAY ADYRAALAQL LR
|
| |