Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2710 |
Symbol | |
ID | 5734591 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3464000 |
End bp | 3464845 |
Gene Length | 846 bp |
Protein Length | 281 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641279853 |
Product | inosine guanosine and xanthosine phosphorylase family protein |
Protein accession | YP_001545476 |
Protein GI | 159899229 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0005] Purine nucleoside phosphorylase |
TIGRFAM ID | [TIGR01697] inosine guanosine and xanthosine phosphorylase family [TIGR01700] purine nucleoside phosphorylase I, inosine and guanosine-specific |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00787639 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCA ACCGAACTCA AGCAATCACT GGCGCAGTCG CTGCCATTCG TCAATATACC GAGCGCGTAC CCCAAGTTGG CATTATCCTT GGCTCTGGCT TGAGTCAACT TGTTGATCAT ATTCAGGATG CCGTGGTAGT TCCCTACACT GCGATTCCAG GCTTCGCACC TTCAGCGGTG CCAGGCCATC GCGGTGAACT TGTTTTAGGT GAACTTGGTG GGGTTTCTGT GCTGGCAATG CGTGGCCGCT TTCACTTCTA CGAAGGCTAT GCCATGGACG AAGTAACCTT GCCTGTGCAT GTGATGCGAG CGCTGGGAGC CGAGATATTA ATTGTTACGA ATGCGGCTGG TGGCTTAAAT GCCAATTGGC AGGTCGGCGA CTTAATGCGC ATCAGCGACC ATATTTTTAT GCCAGGCATG GCGGGCTTTC ACCCATTGCG CGGCCACAAC GACGATACGC TTGGCCCACG CTTCCCAGCC ATGCTCAACG CCTATGATTC TGAATTAGGT GCAATGGCCA AGGCTGCCGC TGAACGGGCT GGCGCAACTC TGCGCGAAGG GGTTTATGCC ATGCTAGCCG GGCCATCGTT TGAAACTGGC GCTGAGATGA ACTATCTGCG CGGGGTTGGG GTTGACGCTG TGGGCATGTC TACCGCCCCT GAAACGATTG TGGCTCGCTA CCGTGGTATG CGCGTCTTGG GCATTTCGTT GATCACCAAT ATTGCCCACC CCGACGCGCC ACCAGCCAAC CATGAAGAGG TGCTTGAGGC AGGCGAAACT GCCAAGCCCA TGTTCAGCGC ATTAATCACC GATGTGCTAA GCCAAATCGC TGCCGATGGC AAGTAG
|
Protein sequence | MTTNRTQAIT GAVAAIRQYT ERVPQVGIIL GSGLSQLVDH IQDAVVVPYT AIPGFAPSAV PGHRGELVLG ELGGVSVLAM RGRFHFYEGY AMDEVTLPVH VMRALGAEIL IVTNAAGGLN ANWQVGDLMR ISDHIFMPGM AGFHPLRGHN DDTLGPRFPA MLNAYDSELG AMAKAAAERA GATLREGVYA MLAGPSFETG AEMNYLRGVG VDAVGMSTAP ETIVARYRGM RVLGISLITN IAHPDAPPAN HEEVLEAGET AKPMFSALIT DVLSQIAADG K
|
| |