Gene Hhal_1514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1514 
Symbol 
ID4710715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1640204 
End bp1641796 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content66% 
IMG OID639855981 
Producteight transmembrane protein EpsH 
Protein accessionYP_001003083 
Protein GI121998296 
COG category 
COG ID 
TIGRFAM ID[TIGR02602] eight transmembrane protein EpsH (proposed exosortase)
[TIGR02914] EpsI family protein
[TIGR03109] exosortase 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0189449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGCAGC TGATGGCCAG TAACCCGGAT GCGATTGTCG GCGGTGGGCT CGGGGATCGA 
GCGCCGTGGT ACCTGCACGG GGCGGCATTG GCACTGCTCG GCGTGGTGCT GGCGATCGCC
TTCCTGCCAA CCTATCAGGC TATTGTGGGT ATTTGGTCAC GCTCAGAGAC CTTCGCCCAC
GGTTTCCTGA TCGTGCCCAT CGTCCTCTTC CTGGTTTATC GGCTGCGTCA CCCCCTGGCG
GATCAACAGC CCAGAGTGCA ACCACTGGCC CTGGTGCCGG TCGCCGGACT GGTCCTGCTC
TGGGTGCTCG GCGCGTTGGT GGACGTCGAC TCCGTGCGCC ACTTCGCGGC GGTGCTCCTG
ATCCCGGCTG TCGTCTGGCT GAGCCTGGGC AACGCCGTCG CCTGGACGCT GCTCTTCCCG
CTGGCCTACC TGATCTCTGC CGTCCCATTC GGTGAGTTCC TGGTCCCGCC GCTGATGGAC
TGGACTGCCG ACTTCACGGT ATGGGCAGTG CAGCAAACCG GCGTGCCGGT CTATCGCGAA
GGACTGAACT TCGAGTTGCC GACCGGCCGC TGGTCCGTGG TCGAGGCGTG CAGTGGCGTG
CGTTACCTCA TCGCTACCGT CGCCCTGGGC ACGCTGTACG CCTACCTGGT TTACCGAAGC
TGGATGCGCC GGCTGGTTTT CGTGGCCTTC TCGTTCCTGG TGCCGATCCT CGCCAACGGT
CTGCGCGCCT ATGCGATCGT GATGATCGGA CATCTCAGTG GCATGGAGTT GGCCGCGGGG
GTCGATCATC TGATCTATGG CTGGGTGTTC TTCGGTGCGG TGATCGCGCT GATGTTCTGG
ATCGGGACCT ACTGGCGCGA GGATCGGCCG ATCTCCGAGG GGGCAGCGCC CGGTCCAGGC
GGCGGTGGCA TGGCGGAGCG GCTTAGCGAC AGTACAGGGC TTGGATCGCG TTCGGTTGCT
GCGGTTGCCG GGGTGGCGTT GACGGGGGGA GTACTGGTCG CTTCCGGGCC GCTCTACGCC
GGATGGATGA ATCAGCGGGA TCTCGGCCCT GTTGCCGGGT TGGAGGAGGC GGAGCTGCCC
CTCAATGACT GGGAGGCGAT CGAGGCCGAT CCCTGGGAGC CGGGGTATCG CAACGCGCGC
GCGGCCTTCC ACCGGCACTA TGTCGATGGG CAAGGGGTTC CGGTGGGGGT CTACGTGGGC
TACTACCGGG AGCAATTCCG GCACGGGAAT ATGATCACTT GGGATAATAC CATGGCCGGC
CGGGATCGGG ACGCCTGGCG GCAACGCTCG GCCGGGCGGG CGGAGATCGA TGATTGGACC
CGCCCGGCAC GATTCGAGCT CACCGGGCCG AATCGACAGA TCCTGGCCTG GCGTTGGTAC
TGGGTGACGG ACCGGCTGAC CACCAGCCCC CACGAAGTGA AGGCGCGGGA GTCGCTGTCC
CGCTTGCTCG GGGGGCGCGA CGATGCAGCG CTGGTGGTGC TCTATGCGCT TTATCCCGAT
GATCCAGAGG AGGTTGAGCC GGCCCTGCGT CGTTTTGCGG AGGCTGCGCT GCCCGAGTTG
TTGGGGACCC TGGAGGAGGT TCGGGGACGT TGA
 
Protein sequence
MRQLMASNPD AIVGGGLGDR APWYLHGAAL ALLGVVLAIA FLPTYQAIVG IWSRSETFAH 
GFLIVPIVLF LVYRLRHPLA DQQPRVQPLA LVPVAGLVLL WVLGALVDVD SVRHFAAVLL
IPAVVWLSLG NAVAWTLLFP LAYLISAVPF GEFLVPPLMD WTADFTVWAV QQTGVPVYRE
GLNFELPTGR WSVVEACSGV RYLIATVALG TLYAYLVYRS WMRRLVFVAF SFLVPILANG
LRAYAIVMIG HLSGMELAAG VDHLIYGWVF FGAVIALMFW IGTYWREDRP ISEGAAPGPG
GGGMAERLSD STGLGSRSVA AVAGVALTGG VLVASGPLYA GWMNQRDLGP VAGLEEAELP
LNDWEAIEAD PWEPGYRNAR AAFHRHYVDG QGVPVGVYVG YYREQFRHGN MITWDNTMAG
RDRDAWRQRS AGRAEIDDWT RPARFELTGP NRQILAWRWY WVTDRLTTSP HEVKARESLS
RLLGGRDDAA LVVLYALYPD DPEEVEPALR RFAEAALPEL LGTLEEVRGR