Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2726 |
Symbol | |
ID | 5734607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3480806 |
End bp | 3482587 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641279869 |
Product | polymorphic outer membrane protein |
Protein accession | YP_001545492 |
Protein GI | 159899245 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01376] Chlamydial polymorphic outer membrane protein repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0479266 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCTGA AAACGATAAT CGGTCTGGGC AGCATGCTGC TTGGTTTGGC GACCGCTGGA ATTGGTCATG CCCAACAAAC CATTGCAACG CAGGCGGCTG CTTCGATTTT TACCGTAACC ACCTTTAATG ATGAAGTTAA TCAGAATGGC CAATGCTCGT TGCGCGAGGC AATTGTGGCG GCCAATACCA ACAGTGTCAG CGCCGATTGT GGCAATGGAG CGCTAGTCAC GACGATCAAT TTAGCCAGCG GAACCTATAC GCTGAATATC GATAACCCTG CGCCAGTTGG CACGGAGTTT CGGCTTGATG ATAATAGCGG CTTATATGGC GATTTGGATA TTACCGATAC GCTGAGTTTG GTCGGCGCGG CTCCGAATCT GACGATCATT AGTGGCCTGA GTGTCGCCAA TACCAATCGA ATTCTGCATA TTCATAGTGG TACGGTTAAT CTGAGCAACC TGAGCTTAAT CGAAGCACAT AGCCCATGGA ATGGTGCTGC AATTTATAAT CAAGGCCAGC TAACCATCAA TAATGTTAAT CTGAATAATA ATCAGACCGT TTTGAATCTG AATTACAACG ATAATCTCGC TGCCTTACAT GGCGGAGCAA TCTACAATGC TGGCAGTTTA TCGATTAGTA ATAGTGCATT GATTGGTAAT CGTACTGCCG CAACCTTGCA TGATTTTAAT GGCGGAGCAA TTTATAATAC CGCTCAGTTA ACAATTAATG CGGTTGACTT TCGCCAAAAT CAAGCAGTCA ATGGTGGGGC AATTTATAGC ACTTCGACCG ATCAACAATT AAGCATTAGT ACCAGCGAAT TTACTGATAA CGAAGCAATC ATTGTGAACG ATAACGAAGA TCCGCAAGGT GGTGCGTTGT GGTTGAATAG CACCAAGACG ATCAGCAATA CGATCTTTCG GAGCAACGAG GCCAATGCTG GTGGTGCAAT CTATGCGCTC AAGGCTTTGA CGATCAATAC GAGCATCTTT GATCGCAATG CTGCTGGAAG TGGCAATGGG ATTGCTCATG GCACGGCAAT TCATAGCAAA GGCCAGTCGA ATATTAATCG CAGTATCTTT AAATTCAATG CTGGTGATTC AGCAGCGATT GGTAATGCTG GCACGATGCA ACTTGATCGC GCTACAATCA TCAATAATTC CAATCGAGCA ATTGAATTTA TGGGCACTGC GCCAATTGGC ACTGGCGGCG GAGTTCACAA CACCAATCAA TTAACCATTA TCAATAGCAC GATTAGCAAT AATACGGCTA ATTTGGCGGG TGCAGGGATT TATAATTCGC GCTCATTGCA CTTGATCAAC ACTAGTGTGA TTAGCAATGG CATTAAATTT GGCGATGAAG ATGGAGTTGG CGGTGGAATT TATACGCTAC CCGCCAGTAC GAGCTACCTT ACCAATACCT TGATCAGCCT CAATACAGCG CCTGCGGGAG CCGATTGCTG GGGGGCGACG CAAGGTTTTT ATGCGCTGAT TAGCGAGCTT GATGATTGCC AACTGGCTGG CAATACGAAT CTTGGTGATC TCGCGGCTCA AACCAGCCCG CTAACCCAAG TTGATGACTA CACTTGGGTT GCACCATTAA CCCAAACCAG CCCAGCGGTT GATGCGGGCA ACAACCAAGC CTGCCCCGCG AGCGATCAAC GCGGTCAATC ACGGCCAATC GATGGCAATG GCGATTTGTT GGCTCGCTGC GATCTTGGTG CTTACGAATT TGTTTCGACC CAAACCTTCT ACCATCAATT TGCCCCTTGG GCCACCCGCT AA
|
Protein sequence | MHLKTIIGLG SMLLGLATAG IGHAQQTIAT QAAASIFTVT TFNDEVNQNG QCSLREAIVA ANTNSVSADC GNGALVTTIN LASGTYTLNI DNPAPVGTEF RLDDNSGLYG DLDITDTLSL VGAAPNLTII SGLSVANTNR ILHIHSGTVN LSNLSLIEAH SPWNGAAIYN QGQLTINNVN LNNNQTVLNL NYNDNLAALH GGAIYNAGSL SISNSALIGN RTAATLHDFN GGAIYNTAQL TINAVDFRQN QAVNGGAIYS TSTDQQLSIS TSEFTDNEAI IVNDNEDPQG GALWLNSTKT ISNTIFRSNE ANAGGAIYAL KALTINTSIF DRNAAGSGNG IAHGTAIHSK GQSNINRSIF KFNAGDSAAI GNAGTMQLDR ATIINNSNRA IEFMGTAPIG TGGGVHNTNQ LTIINSTISN NTANLAGAGI YNSRSLHLIN TSVISNGIKF GDEDGVGGGI YTLPASTSYL TNTLISLNTA PAGADCWGAT QGFYALISEL DDCQLAGNTN LGDLAAQTSP LTQVDDYTWV APLTQTSPAV DAGNNQACPA SDQRGQSRPI DGNGDLLARC DLGAYEFVST QTFYHQFAPW ATR
|
| |