Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4431 |
Symbol | |
ID | 5736282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5670318 |
End bp | 5672081 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281594 |
Product | hypothetical protein |
Protein accession | YP_001547191 |
Protein GI | 159900944 |
COG category | [R] General function prediction only |
COG ID | [COG3211] Predicted phosphatase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAGCC ACGATCAGCG TGCAGAGCGC AAAAGTATTT GGCACGATTT ATTAGAACGT CGGATCAATC GCCGGACCCT TGTTGCCAGT GGTGCTGCCG CTGCTGCGGT CGCGGCCTTG CCCTTAGATC TGCAAACTGC TGAAGCGGCG CACTATCATG CTCCATTGTC AGCCCCAGCC TTGGCTCAAC GCCAAGCTCA AGGTTCATTG CCCTTCAAAC CAATCAGCCC CAGCACCGCC GACGATTTGA TTTTGCCCGA AGGCTTCCGC TACGATTTGT TGGCCCACCG CGGCCATGAT ATGGGCGATG GCAGTTTGTT TGGCGAAAAT GCCGATTTCC TGGCGTTCTT CCCAATCGAT ATGCTCCAAA AAGGCCTCGA CCAAAATCGC CCACAATTTG GCTTTACCCG CAGCGATTTA TCCAGCACTG ATGGTTTGTT GCTGGTCAAC CACGAATATA TCAACCCTAT GTTTATCTCG GGCTACACTG GCTCAGGCGC AAAATCTGGC GATCAAATTA ACGCCGAAAA GCATATGGTT GGCATGAGCG TGATTCGGGT TAAGCGCAAT AGCGATGGCC GTTGGTATTT CGACCAAACT GATACCGCTC ACAACCGCCG CATCGATGCA ACTACTCCAA TCACCTTAAC TGGCCCAGCC GCGCAACTTG ATGGTGGCCC GATGGCAATT GGCTCACTTG GCAATTGTTC CGGTGGTGTA ACACCTTGGG GCACAGCACT GAGCTGCGAA GAAAACTTCC AAGATTATCC AAATCCAGCA CCAACTGGCT ATGGCTGGGA ACCAGAAATC TACGGCAAGC GCCACTACGG TTGGGTCGTC GAAGTTGATC CATTCGATAA AAACAGCATG CCACGCAAAC ATACCGCCAT GGGTCGCTTC CGCCACGAAA ATGTAGCGGT ACGAGTTGGC AGCGACGGCA CGGTTGTAGC CTATATGGGC GATGACAAAG CCGATTCATG CGTCTATAAG TTTGTGGCTG ACCGCAAATT GACCAACTTG GCAGATCGCC CAGGCAATAT GCAAATTCTC GAAAGCGGCC AACTCTATGC CGCCGACTTT GCCAATGGCA AGTGGATTTT GCTCGATTAC AATAGCCAAA GCGCCTTGCA AAGTGCCAAA GATAGCAAAG GCAATTTGCT ATTTAGCTCG CAAGCCGATG TTTTGGCCGA TACCCAAGCC GCTGCCATGG CGCTCAAAGC CACGCCCGTT GATCGCCCAG AAGATATTGA AATTCACCCA CTCGATGGCA GTGTCTATGT TGCCTTGACC AATAATACTG GCCACGGCAA CTTCCACGGC CAAATCGTGC GCATGGCCGA AACCGACAAT AATCCAGCTG CAACCAGCTT CGAATGGAGC ATCTTCGCGG TTGGTGGCTC GCAAAGCGGC TTCTCATCGC CCGACAATTT GGTGTTCGAT GGCGAAGGCA ACTTGTGGAT GGTAACCGAC ATCTCATCAT CACGCACCAA CAAAGGGATC TACAAATTCC AAGGCAACAA CGGTCTCTTC TTCTTCCGCA CCAGCGGCCC TGATGCTGGG ATCGCCTTCC AATTTGCCTC CGGGCCAGTG GAAAGCGAAA TGACTGGGCC ATGCTGGTCG CCTGATGGCC GAACCCTGTT CTTGGCGATT CAACACCCAG GTGAAGAATC CAAGAGCTTG ACCGAACTGA GCAGCCACTG GCCAATTGGT GGTAACGAAG TGCCGCGCTC AGGGGTTGTC GCAATTACCG GGTTCAAGCG CTAG
|
Protein sequence | MTSHDQRAER KSIWHDLLER RINRRTLVAS GAAAAAVAAL PLDLQTAEAA HYHAPLSAPA LAQRQAQGSL PFKPISPSTA DDLILPEGFR YDLLAHRGHD MGDGSLFGEN ADFLAFFPID MLQKGLDQNR PQFGFTRSDL SSTDGLLLVN HEYINPMFIS GYTGSGAKSG DQINAEKHMV GMSVIRVKRN SDGRWYFDQT DTAHNRRIDA TTPITLTGPA AQLDGGPMAI GSLGNCSGGV TPWGTALSCE ENFQDYPNPA PTGYGWEPEI YGKRHYGWVV EVDPFDKNSM PRKHTAMGRF RHENVAVRVG SDGTVVAYMG DDKADSCVYK FVADRKLTNL ADRPGNMQIL ESGQLYAADF ANGKWILLDY NSQSALQSAK DSKGNLLFSS QADVLADTQA AAMALKATPV DRPEDIEIHP LDGSVYVALT NNTGHGNFHG QIVRMAETDN NPAATSFEWS IFAVGGSQSG FSSPDNLVFD GEGNLWMVTD ISSSRTNKGI YKFQGNNGLF FFRTSGPDAG IAFQFASGPV ESEMTGPCWS PDGRTLFLAI QHPGEESKSL TELSSHWPIG GNEVPRSGVV AITGFKR
|
| |