Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0215 |
Symbol | |
ID | 6973607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 232980 |
End bp | 233858 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643389746 |
Product | OHCU decarboxylase |
Protein accession | YP_002274627 |
Protein GI | 209542398 |
COG category | [R] General function prediction only [S] Function unknown |
COG ID | [COG2351] Transthyretin-like protein [COG3195] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR02962] hydroxyisourate hydrolase [TIGR03164] OHCU decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.64075 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0296602 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCG GCATCATGGA CCGGGTCAAC CGGCTGGACC CGGCGGATTT CGTCGCCCTG TTCGGCGCGC TGTACGAACA TTCGCCCTGG GTCGCGGCAC GGGCGGCGTC CCTGCGGCCT TTTCCGGACC CGGACGCGAT GCTGGCGGCC ATGAACCAGG TGCTGGATCG GGCCACCGAC GCCGAGAAGC TGGCGCTGGT GCGCGCGCAT CCCGAACTGG CGCGGCGGGC CGGGGTGGAC CCGACCCTGA CACAGGCCTC GGCCGCCGAG CAGGCATCCG CCGGGCTGGA CCGTCTGTCG CCCGAGGAAT ACGCCCGGTT CAACCGCCTG AACGACGCCT ACGCCGCGCG CTTCGCCATG CCGTTCGTGA TCTGCGTGCG GTTGTCGGAC AAGGATTTCA TCCTGTCCGA GATGGAACGC CGCGTCGTCC ACACGCCGGA GCAGGAGGTG CGGACCGCGA TCGTGGAAAT CGGCAAGATC GCCAGGCTGC GCCTGGCCGA CGCCCTGGCG CGGCTGGAAA AGGAAGCCGT GATCACCCTG TCCAGCCATG TGCTGGACCT GGTCTCCGGC CGGCCGGCCG CCGGGATGGA CATCACGTTG TGGTCGGGCG CCACACGCCT GTTCAGCGGG CGAACCAACG GGGATGGGCG TTGTCCGGAC CTGGCCGGGA TCGGTGCCCT GGCACCGGGC GCCTATCGTC TGGAATTCGG GGTTGCGGCG TATTTCCGGG GGCAGGGCGT GGCGCTGAGC GACCCGCCGT TCCTGGATAT CGTGCCGATC GCCTTCGGCA TCGCCCCGCC TGTGGACGGG AAGGGCGGGC ACTATCATGT GCCGCTGCTG GTCTCCCCGT ACGGGTTTTC GACCTATCGG GGAAGCTGA
|
Protein sequence | MTIGIMDRVN RLDPADFVAL FGALYEHSPW VAARAASLRP FPDPDAMLAA MNQVLDRATD AEKLALVRAH PELARRAGVD PTLTQASAAE QASAGLDRLS PEEYARFNRL NDAYAARFAM PFVICVRLSD KDFILSEMER RVVHTPEQEV RTAIVEIGKI ARLRLADALA RLEKEAVITL SSHVLDLVSG RPAAGMDITL WSGATRLFSG RTNGDGRCPD LAGIGALAPG AYRLEFGVAA YFRGQGVALS DPPFLDIVPI AFGIAPPVDG KGGHYHVPLL VSPYGFSTYR GS
|
| |