Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2648 |
Symbol | |
ID | 5734528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3398787 |
End bp | 3399779 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279790 |
Product | SMP-30/gluconolaconase/LRE domain-containing protein |
Protein accession | YP_001545414 |
Protein GI | 159899167 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3386] Gluconolactonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0707367 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCGGA TCTATCTATT AATGCTGATT ATCGGTGTGA GTTTAACGAG TTGTGGGAGC ACCGTCGCCC CGACTGCAAC GATCGCACCA AGCACAACTA GCGTTAGCCA AGCTGAAGCA ACTGCAACGG TTGCCCCCAC GCTTGAGCCA ACAACCGCCC CAATTGCCGC AATTACTGCC CCCATTACCT TGAGCGAAGG TTTTGCCCAA CCAGAAGGCA TGTTACACAA TCAGCTTAGC GATCTGTATC TGGTTTCAAA TATTAATGGT ACACCATTTG GCCGCGATGA TAACGGCTAT ATTGCGCAAA TTAACCCCGA TGGCACGCTT AATCAAGCCA AATGGATCGA TGGCGCGAGT GCTGAGGTCG AATTGAATGC CCCCAAAGGC ATGGCGATTG CCAATAATAC CCTGTATGTG GCCGATCTGG ATAGCGTGCG CTTGTTCGAT GCAGTTTCGG GCGAGGTCAA AGGCACAATT GCAATTAGCG GTAGCAGCTT CTTAAATGAT CTTGTGGCGC GTGATGATGG AACGGTGTTT GTCAGCGACA TGGGAATTAA CGAGGCGTTT GGCAGCACTG GTACGGCGGC GATCTATCAA ATTGACCCTG CTGGTAATGT TTCGCTGGCG GTCAAACTTG CCGATGGTAA TCCTAATGGC TTGGCTATGA CTGCTGATGG TACGCTGTTG GTTTGCCGCT ACGATACTGC GGTTGAGATT TTTGAACTGA CTGCTGATGG TATATTGCAG CCTTATCGCA AGGCTAGCGC TAGCCAACAC GATGGCTTGG TAGTGTTAGC TGATCAAAGT TTGTTGGTTT CATCGTGGCA AACCGCCAAT ATTCATCAAT TAATGGCTGA TGGCAGCGAA CGCATTATTT ACCATGGCCC AACTGGCGCT CCCGCCGATA TTAATCTTGA CCATCAACGT AAAGTTCTAT TGATGCCGCT GATTATGACC AACCAAGTGG TATTATGGCC GTTAGCTGAC TAA
|
Protein sequence | MARIYLLMLI IGVSLTSCGS TVAPTATIAP STTSVSQAEA TATVAPTLEP TTAPIAAITA PITLSEGFAQ PEGMLHNQLS DLYLVSNING TPFGRDDNGY IAQINPDGTL NQAKWIDGAS AEVELNAPKG MAIANNTLYV ADLDSVRLFD AVSGEVKGTI AISGSSFLND LVARDDGTVF VSDMGINEAF GSTGTAAIYQ IDPAGNVSLA VKLADGNPNG LAMTADGTLL VCRYDTAVEI FELTADGILQ PYRKASASQH DGLVVLADQS LLVSSWQTAN IHQLMADGSE RIIYHGPTGA PADINLDHQR KVLLMPLIMT NQVVLWPLAD
|
| |