Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0625 |
Symbol | |
ID | 6974022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 700643 |
End bp | 702436 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643390156 |
Product | Heparinase II/III family protein |
Protein accession | YP_002275032 |
Protein GI | 209542803 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.27993 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCCTGC GACGCTGGAG CCAGGACGCG CGTCTTTCAC TGGCACGCCT GCCGTCGCTG GCCGGGTTGG GCCGCGTCCC GCCCCAGCCG GTCCATGCCG TTCGCGACCT GTGGCCGGGC GATGCGGCAT CGGGCGCGCG GCTGCTCCGC GGCAGCCTGT CTCATGCCGG CGTCACGCGG CCGATCGGCC CGGGCCGGTG GGAGGACCCG TCCTACCCCG AGCGCTTCCG GGCCTGCCTG CACGGATTCG CCTGGCTGCG CGACCTGCGC GCGGTCGGGA CCGATTCGGC GCGGCTGCAG GCGCGGGCGC TGGTCGATGA CTGGCTGTCC CATCCGCCCA GCGACCCGAT GGTGCGTGAC TGCGCCGTGA CGGGCACGCG GCTGGCGTCG TGGCTGGGCC ATTACGATTT CTTCGCCGCT TCCGCCGATG ACGGGTTTCG CCAGCGCCTG ATGCAGCGCC TGCTGGCGGA AGGACGGACG ATTGCCGCCC TGATGCCGCC CGAATCGCAT GACTGGCGGG CGCTGGCCGC GTTCAAGGGC CTGCTGGCGG CGGCGATCGC CATGCCCGAC CATAGCGGCT TCCTGGTGCG GTTCCGGCGC TATATCGACG CCGAACTGGA ACGGCAGATA CTGCCGGACG GCTGCCATAT CCAGCGCAGC CCGGAAATCC AGTTCCTGGT GCTGCGCGAA CTGGCGGAAA TGCACGCGAT GCTGCACGCC GCGCAGATCG CCCCGCCCAT GGCCCTGACC CTGGCGCTGG ATCGGATGAG TCCCGTTCTG CGCGCGATGC GGCATGGCGA CGGCGGGCTG GCGCTGTTCA ACGGCAGCCA CACGGGCAAT GTCGCGATGA TCGAGACGGT GCTGTCGCAG GCGACGCGCA CGCGCGTGGT GGCCACCGCG ATGCCGGACG GCGGCTTCAT CCGCCTGCAG GTCGGCCGGT CGCTGCTGCT GGTGGACGCG GCCCCCCCGC CGCCGCCCGG CTTCGATGAA GATGCCCATG CCGGCACGCT GTCGTTCGAA TTTTCGGTGG CGCGGCGGCG GGTGATCGTC AATTGCGGCG CGGGCGAGGG GCCGGAATGG CGACGGGCCC TGCGCGAAAG CGCGGCCCAT TCCCTGCTGG TTCTGGAGGA TACCTCGTCC TCGGACTTCG CGCCGCAGGG CGGAATCCTG CGCCGGCCGG TCCATGTGAC GGCCGAACAG GTGGCCCAGG ACGGCGCGCA TTGGCTGGAC CTGTCCCATG ACGGCTATCA CGCGCCGTTC GGCGCATCCT GGCGGCGGCG CCTCTATCTG GGAAACGGGG GCGAGGACCT GCGGGGCGAG GAAATCGTCG AGGGCGAGCG CCAGCAATCC TTCGTGCTGC GCTTCCACCT CCATCCGTCG GTCGGTGCGG AATGGGATGC CGATGCCCAG ATCGTCATCC TGGACGTCGG CGGCCAGATC TGGAAATTCC GTGCCGACGG CGGCAAGGTC GCGGTCGAGG AAAGCGTCTA TTGCGGCGGA ACGACCCCCG AGCGCAGCCG GCAGCTCGTC GTCCGGGTGC GTCCCGGCGA CCATGCGGAC GAAGATCAGG CAGAGGACAA TCAGGCGGAT ACCGATCAGG CGGATAAGGA GCCGGCGGAT GAAGGCCGGA CAGATGAGGA TCGGGCCGCC CGGGACCATG CGGATGAAGA CGGGCCGGCC CGGAACGCCG ACACCCCTGA CGCGGCCCCC CCGGCGAAGC CCGTGCCCCA GGCCGAGAGC GGCGAGCGTA CACGCCAGGT CGTGCGCTGG GCGTTGATGC AGATGGAAGG GTAG
|
Protein sequence | MVLRRWSQDA RLSLARLPSL AGLGRVPPQP VHAVRDLWPG DAASGARLLR GSLSHAGVTR PIGPGRWEDP SYPERFRACL HGFAWLRDLR AVGTDSARLQ ARALVDDWLS HPPSDPMVRD CAVTGTRLAS WLGHYDFFAA SADDGFRQRL MQRLLAEGRT IAALMPPESH DWRALAAFKG LLAAAIAMPD HSGFLVRFRR YIDAELERQI LPDGCHIQRS PEIQFLVLRE LAEMHAMLHA AQIAPPMALT LALDRMSPVL RAMRHGDGGL ALFNGSHTGN VAMIETVLSQ ATRTRVVATA MPDGGFIRLQ VGRSLLLVDA APPPPPGFDE DAHAGTLSFE FSVARRRVIV NCGAGEGPEW RRALRESAAH SLLVLEDTSS SDFAPQGGIL RRPVHVTAEQ VAQDGAHWLD LSHDGYHAPF GASWRRRLYL GNGGEDLRGE EIVEGERQQS FVLRFHLHPS VGAEWDADAQ IVILDVGGQI WKFRADGGKV AVEESVYCGG TTPERSRQLV VRVRPGDHAD EDQAEDNQAD TDQADKEPAD EGRTDEDRAA RDHADEDGPA RNADTPDAAP PAKPVPQAES GERTRQVVRW ALMQMEG
|
| |