Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1752 |
Symbol | |
ID | 5733639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2038634 |
End bp | 2040031 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278894 |
Product | hypothetical protein |
Protein accession | YP_001544523 |
Protein GI | 159898276 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000233386 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCGTA TGATCGTTTT TGCTATTGTT GTCTTGTTGA TGGCCGCGTG TAGCTCGCAA TGGCCTAGCG CCCCAATTGC CAAAAGTACT CCTCATAATC CAGCCAATTT TCCTCCCAGC CCGCAAAACT CACCTACCAT TGGTCAATGC CCGGTTTTTC CACTTGATAA CATTTGGAAT ACCCCAATCG ACACCTTGCC TGTCCACCCT CGTTCGAGCC AATATATTGC CAGCATTGGC GGCAGCGAAA CCTTGCACCC TGATTTTGGC GCGGCCCAAT GGAATGGTGG TGATATTGGG ATTCCCTATG TGGTTGTGCC CGCTAACCAA CCAACTGTGA CCGTTAATTT TGTCGATTAT CCGCATGAGA GTGACCCCAC CACTGGCTCA GGCCAATACC CAGTGCCACC GAATGCGCCA CGTGAGCATG GTAGCGACCA TCATGTGTTG GTGGTGCGTG AAGGTGAATG TAAGCTGTAT GAGCTGTACA ATGCCACCAA AATTAACGAT ACAACTTGGA ATGCCAGTAA CGGAGCAATC TTTGATTTAC GCTCGAATAG CTTACGACCT GATACCTGGA CTTCCGCTGA TGCAGCAGGT TTGCCAATCT TGCCTGGCTT AGTGCGCTAC GAAGAGGTGC AAGCAGGCGA GATCAACCAT GCAATTCGCT TTACGATTCA GCGTTCACAA CGGGCCTATG TCTGGCCAGC GCGGCATTTT GCGTCATCAA TTACCGACCA AAATGTGCCA CCAATGGGTA TGCGCTTTCG GCTCAAGGCA TCGTTTGATA TTTCGGGGTT TTCCAGCGAG ATGCAAGTTA TTTTGCGGGC AATGCAGCGC TATGGCATCA TTGTGGCCGA TAATGGTTCA GATTGGTATA TTTCTGGCGC ACCGAATCCC AATTGGGATG ACGATAATTT GGTGAGTAGT TTCGACCAAA TTCGCGGTGA TCATTTTGAA GCTATGGATA GCTCGAGCTT GCAACTCAAC CCTGATTCGG CAGCGGTTAT CGCTAGTGCT GCCCCGCAAC CAAGCAAATT GGCCGAATTT GGGGGCGTTG ATCAAGGCCA ACAATTGCGC TATGCGATTA CAGTGGTTGG CACAGGCAGC CCACAAACCA TGAATGATCA ACTGCCTGAT GGCCTAACAA TTGTTCCGGC GAGCGCCACG ATTAACCCAA GCAATTTGGC GGCCCCAGCC ATTAGCAACA ATAGTGTTCA GTGGAGCGGC ACAATTCCCA ATTCGCAGAG TGCGGTGATT AGCTTTCGTG CAACGGTCAG CACCAACGAG CGCCGCGTTA TTATCAATAC TGCCCAGATT AATGCTGCCA CAGTGCAGGC CAGCATTATT GCCAATGGCT ATCGTGTTTG GTCGCCCATG GTGCGCAAAC TGAAGTAG
|
Protein sequence | MRRMIVFAIV VLLMAACSSQ WPSAPIAKST PHNPANFPPS PQNSPTIGQC PVFPLDNIWN TPIDTLPVHP RSSQYIASIG GSETLHPDFG AAQWNGGDIG IPYVVVPANQ PTVTVNFVDY PHESDPTTGS GQYPVPPNAP REHGSDHHVL VVREGECKLY ELYNATKIND TTWNASNGAI FDLRSNSLRP DTWTSADAAG LPILPGLVRY EEVQAGEINH AIRFTIQRSQ RAYVWPARHF ASSITDQNVP PMGMRFRLKA SFDISGFSSE MQVILRAMQR YGIIVADNGS DWYISGAPNP NWDDDNLVSS FDQIRGDHFE AMDSSSLQLN PDSAAVIASA APQPSKLAEF GGVDQGQQLR YAITVVGTGS PQTMNDQLPD GLTIVPASAT INPSNLAAPA ISNNSVQWSG TIPNSQSAVI SFRATVSTNE RRVIINTAQI NAATVQASII ANGYRVWSPM VRKLK
|
| |