Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4182 |
Symbol | |
ID | 5736044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5334020 |
End bp | 5335324 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641281337 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | YP_001546942 |
Protein GI | 159900695 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.953072 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGCGG TTGATCTGAT TATTAAAAAA CGCAATGGAG CACAGCTCTC AACCGAAGAA ATTCAATGGC TGATTCAGGG CTATACCAAC GGCAGTGTGC CCGATTATCA GATGGCGGCG TGGGCCATGG CGGTTGTGCT CAAAGGCATG GATGATCGCG AAACCACCGA TTTAACCCTT GCCATGGCCG CTTCGGGCGA TCAGCTCGAT TTGCGTGATT TCGCGCCCGA TGCCGTTGAT AAGCACTCGA CTGGCGGGGT TGGCGATAAA ACTAGCCTTG TGCTGGGGCC AATGTTGGCA GCCGTTGGTT TGCAAGTTGC CAAAATGTCG GGGCGGGGCT TGGGCTTTTC GGGCGGCACG CTCGATAAAC TTGAGGCCAT CCCCAACATG CGCATCGACC TGAGCGAAGA TGAGTTTCGC CATGCCATGC GTGAGATTGG CATGGTGATT ATGGGCCAAA CTGCTGATCT CGCACCTGCC GACAAAAAGC TGTATGCGTT GCGCGATGTG ACTGGCACTG TCGAATGTAT TCCGCTGATT GCAGCCAGCA TTATGAGCAA AAAGCTGGCG GCTGGAGCCA AAAGCATCGT ACTCGATGTC AAGGTTGGGG CGGGGGCGTT TATGAAAACC CTCGATCAAG CCCGTGATTT GGCCCGAACG ATGGTGCGGA TCGGCCAATT GGCTGGCCGC AATGTCGCTG CGATCCTTTC GTCGATGGAG CAACCGTTGG GCTTGACGAT CGGTAATGCG CTGGAAGTGC GCGAAGCGAT TGAAACGCTC CAAGGTCGCG GCCCCGGCGA TTTGGTTGAA GTTTGTTTGA CCCTTGGCTC ACATCTGTTG GTTTTGGCTG GCAAGGCCCA AAATCTTGAT GATGCCCGCC AACAGTTGCA AGCAAGCTTG GATAACGGCC AAGCTTGGGC TAAATTCCGT GAGTTTGTTG CCCAGCAAGG CGGCGATCTC ACGGTGATTG ACCAACCAGA AACCCTGCCA ATCGCCCCAA TTCAAATCAG TTTGCTGGCC GAGAGCAGCG GCTTTGTCCA ACGCATCGAT GCCGAAACCT GTGGGATTGT GGCGACCGAG CTTGGAGCTG GTCGCGCCCG CAAAGAAGAT GCGATCGATC CGGCGGTGGG CTTGGTGCTT GAGCGTAAAG TTGGCGAGCC AGTTCAAGCG GGCGAGGCCT TGTTGACAGT GCATGCTGCT GATCAGCAAC GGGCCGAGGT TGCTTTGGCC GCACTCAAAT CGGCAATTAC GATCAGTGCT ACGTCGGTTG AAGCCTTGCC CTTGGTTTTC GAAAGCGTTG CCTAG
|
Protein sequence | MRAVDLIIKK RNGAQLSTEE IQWLIQGYTN GSVPDYQMAA WAMAVVLKGM DDRETTDLTL AMAASGDQLD LRDFAPDAVD KHSTGGVGDK TSLVLGPMLA AVGLQVAKMS GRGLGFSGGT LDKLEAIPNM RIDLSEDEFR HAMREIGMVI MGQTADLAPA DKKLYALRDV TGTVECIPLI AASIMSKKLA AGAKSIVLDV KVGAGAFMKT LDQARDLART MVRIGQLAGR NVAAILSSME QPLGLTIGNA LEVREAIETL QGRGPGDLVE VCLTLGSHLL VLAGKAQNLD DARQQLQASL DNGQAWAKFR EFVAQQGGDL TVIDQPETLP IAPIQISLLA ESSGFVQRID AETCGIVATE LGAGRARKED AIDPAVGLVL ERKVGEPVQA GEALLTVHAA DQQRAEVALA ALKSAITISA TSVEALPLVF ESVA
|
| |