Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1514 |
Symbol | |
ID | 4710715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1640204 |
End bp | 1641796 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639855981 |
Product | eight transmembrane protein EpsH |
Protein accession | YP_001003083 |
Protein GI | 121998296 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02602] eight transmembrane protein EpsH (proposed exosortase) [TIGR02914] EpsI family protein [TIGR03109] exosortase 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0189449 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGGCAGC TGATGGCCAG TAACCCGGAT GCGATTGTCG GCGGTGGGCT CGGGGATCGA GCGCCGTGGT ACCTGCACGG GGCGGCATTG GCACTGCTCG GCGTGGTGCT GGCGATCGCC TTCCTGCCAA CCTATCAGGC TATTGTGGGT ATTTGGTCAC GCTCAGAGAC CTTCGCCCAC GGTTTCCTGA TCGTGCCCAT CGTCCTCTTC CTGGTTTATC GGCTGCGTCA CCCCCTGGCG GATCAACAGC CCAGAGTGCA ACCACTGGCC CTGGTGCCGG TCGCCGGACT GGTCCTGCTC TGGGTGCTCG GCGCGTTGGT GGACGTCGAC TCCGTGCGCC ACTTCGCGGC GGTGCTCCTG ATCCCGGCTG TCGTCTGGCT GAGCCTGGGC AACGCCGTCG CCTGGACGCT GCTCTTCCCG CTGGCCTACC TGATCTCTGC CGTCCCATTC GGTGAGTTCC TGGTCCCGCC GCTGATGGAC TGGACTGCCG ACTTCACGGT ATGGGCAGTG CAGCAAACCG GCGTGCCGGT CTATCGCGAA GGACTGAACT TCGAGTTGCC GACCGGCCGC TGGTCCGTGG TCGAGGCGTG CAGTGGCGTG CGTTACCTCA TCGCTACCGT CGCCCTGGGC ACGCTGTACG CCTACCTGGT TTACCGAAGC TGGATGCGCC GGCTGGTTTT CGTGGCCTTC TCGTTCCTGG TGCCGATCCT CGCCAACGGT CTGCGCGCCT ATGCGATCGT GATGATCGGA CATCTCAGTG GCATGGAGTT GGCCGCGGGG GTCGATCATC TGATCTATGG CTGGGTGTTC TTCGGTGCGG TGATCGCGCT GATGTTCTGG ATCGGGACCT ACTGGCGCGA GGATCGGCCG ATCTCCGAGG GGGCAGCGCC CGGTCCAGGC GGCGGTGGCA TGGCGGAGCG GCTTAGCGAC AGTACAGGGC TTGGATCGCG TTCGGTTGCT GCGGTTGCCG GGGTGGCGTT GACGGGGGGA GTACTGGTCG CTTCCGGGCC GCTCTACGCC GGATGGATGA ATCAGCGGGA TCTCGGCCCT GTTGCCGGGT TGGAGGAGGC GGAGCTGCCC CTCAATGACT GGGAGGCGAT CGAGGCCGAT CCCTGGGAGC CGGGGTATCG CAACGCGCGC GCGGCCTTCC ACCGGCACTA TGTCGATGGG CAAGGGGTTC CGGTGGGGGT CTACGTGGGC TACTACCGGG AGCAATTCCG GCACGGGAAT ATGATCACTT GGGATAATAC CATGGCCGGC CGGGATCGGG ACGCCTGGCG GCAACGCTCG GCCGGGCGGG CGGAGATCGA TGATTGGACC CGCCCGGCAC GATTCGAGCT CACCGGGCCG AATCGACAGA TCCTGGCCTG GCGTTGGTAC TGGGTGACGG ACCGGCTGAC CACCAGCCCC CACGAAGTGA AGGCGCGGGA GTCGCTGTCC CGCTTGCTCG GGGGGCGCGA CGATGCAGCG CTGGTGGTGC TCTATGCGCT TTATCCCGAT GATCCAGAGG AGGTTGAGCC GGCCCTGCGT CGTTTTGCGG AGGCTGCGCT GCCCGAGTTG TTGGGGACCC TGGAGGAGGT TCGGGGACGT TGA
|
Protein sequence | MRQLMASNPD AIVGGGLGDR APWYLHGAAL ALLGVVLAIA FLPTYQAIVG IWSRSETFAH GFLIVPIVLF LVYRLRHPLA DQQPRVQPLA LVPVAGLVLL WVLGALVDVD SVRHFAAVLL IPAVVWLSLG NAVAWTLLFP LAYLISAVPF GEFLVPPLMD WTADFTVWAV QQTGVPVYRE GLNFELPTGR WSVVEACSGV RYLIATVALG TLYAYLVYRS WMRRLVFVAF SFLVPILANG LRAYAIVMIG HLSGMELAAG VDHLIYGWVF FGAVIALMFW IGTYWREDRP ISEGAAPGPG GGGMAERLSD STGLGSRSVA AVAGVALTGG VLVASGPLYA GWMNQRDLGP VAGLEEAELP LNDWEAIEAD PWEPGYRNAR AAFHRHYVDG QGVPVGVYVG YYREQFRHGN MITWDNTMAG RDRDAWRQRS AGRAEIDDWT RPARFELTGP NRQILAWRWY WVTDRLTTSP HEVKARESLS RLLGGRDDAA LVVLYALYPD DPEEVEPALR RFAEAALPEL LGTLEEVRGR
|
| |