Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4160 |
Symbol | |
ID | 5736021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5306860 |
End bp | 5308410 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281314 |
Product | hypothetical protein |
Protein accession | YP_001546920 |
Protein GI | 159900673 |
COG category | [S] Function unknown |
COG ID | [COG1700] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00251494 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCAGAGT TAATTCCACT GACAATTAAC GGTATCGATA GCCAAGCAAC TGTGCATGTC ACTGCCAGCG CTTGGAATGA GTGGGCCGTG GTCAACATGG CTTGCCCAGC CCAAACCAAT GTGCGCAATG CGCAGCTTTC GATTGGTGCT GATAATTTGG GTGTGCCCCA AGTCAGCCCA TTTGATCCAA CTTGGCGTTG GTCGTGGTTG CCACGTGGCC AGGCGGGCAC AGTTTATGGC CGGTTGCAGA TTGAATGGGC TGATGGCGAA ATTCAGCAGC AGCAGTTTCA ATTTGAGCTT CAGCCACATT TGCTTGATCG TGAATTGTGG CGAGCATTGC TGAACGATCT CAGCTCGTTG GCTCGTTCGC TCGCCTTGCG AATTGCCAGC CCCAGCTTTG CCCAAGCGGT GTTAGTGCCG TTGCTCCCCG ATGATCCTAG CCCATTTTTA GAAGCGCTGA GCTTGATTAA CCAAAGTAGC CAGCAAGCGA GCCAGATTGT ACGCAGCTTG CAGCGTCAGG CCAAATCGAC ACTTGAACGC CAACCGCGAA CAACTGACTT GGGCACGGCT CAGCAATTTA AGCTCGATCA ATTGGCTCAG CCCAGCGAGC GTTATCACGT GCTCGAACCG TATGGCCTCG TGCCAGAGCA GGTGCAGGCT GAGCATGCCC AAGCTTCATT CGATCTGCCA GAGCATCGCT GGTTGATTGG CTTGATTCAG CAAATTGAGC GGCGTTTACG CAATTTACGT CGCCTTGCTC GTGAACAACG CCAGCTTGAT TTAACCAGTA TTCAAGCAAC AATTGAGCAA CGACTGACGG CTTTGCGCCA ATTACGCCAA GCAGCGCCCT TGGCGGGTTT AAAAGCTCGC CAGCAGCCAG TCCAAAGCCA ACTGATCAAC CGTGATGCGC GTTACCGACC GATTCGCCAG TTGGCGCGAA GTTTGCATGA ACAGCCGTTG CTCACCTTGG AAGTTGGCAG TTTGGCCTTG CCCTTGGCCG ATGTGCCAAC CCTCTATGAG CAATGGTGTG CGCTAGCTGT CGCCCAGGTT TTAGCCGAAT TAGGCCAGGT TGAAGCCCAA CATCTGCTGA TCGATAACCC TCAGCGTGAA CGTTGGGTGC TTGAATTAAA TTCCGCCACA CCGTTGTTGA GCGTGCGGAT TGGCTCACAG CTTTGGCATT TGCGCTACCA AGCGCGATTT AGCGCCCAGC CCGATTCTGA TGGTTTTTAT AGCCTTGATC GATATTTGCG CATCCCCGAT TTGGTGTTAC AAACTGCCAC GGCCAATGCC AAGCAGGTGT TGGTGCTTGA TGCCAAATAT CGCCGAGCGC CTGACCAGCG GGTTCCGCAA AGTGCGCTTG ATGATGTCTA TGCCTATCGT GGCAGCTTGG GCTACAATGG TCAGCCATGT GTGCTAGCCG CCGCAATTCT GTACCCACAA GCAAACACAC TTGAGGAATT TGGCTCGATT GCGGCAATTG GCCTCATTCC CAATCAGCTT AATCAGCTAA AAACCTGGTT GGAGCGTTGG CTCAACCAGC TTGATCAATA A
|
Protein sequence | MAELIPLTIN GIDSQATVHV TASAWNEWAV VNMACPAQTN VRNAQLSIGA DNLGVPQVSP FDPTWRWSWL PRGQAGTVYG RLQIEWADGE IQQQQFQFEL QPHLLDRELW RALLNDLSSL ARSLALRIAS PSFAQAVLVP LLPDDPSPFL EALSLINQSS QQASQIVRSL QRQAKSTLER QPRTTDLGTA QQFKLDQLAQ PSERYHVLEP YGLVPEQVQA EHAQASFDLP EHRWLIGLIQ QIERRLRNLR RLAREQRQLD LTSIQATIEQ RLTALRQLRQ AAPLAGLKAR QQPVQSQLIN RDARYRPIRQ LARSLHEQPL LTLEVGSLAL PLADVPTLYE QWCALAVAQV LAELGQVEAQ HLLIDNPQRE RWVLELNSAT PLLSVRIGSQ LWHLRYQARF SAQPDSDGFY SLDRYLRIPD LVLQTATANA KQVLVLDAKY RRAPDQRVPQ SALDDVYAYR GSLGYNGQPC VLAAAILYPQ ANTLEEFGSI AAIGLIPNQL NQLKTWLERW LNQLDQ
|
| |