Gene Hhal_1458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1458 
Symbol 
ID4710998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1572622 
End bp1574964 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content67% 
IMG OID639855925 
Productsurface antigen (D15) 
Protein accessionYP_001003027 
Protein GI121998240 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4775] Outer membrane protein/protective antigen OMA87 
TIGRFAM ID[TIGR03303] outer membrane protein assembly complex, YaeT protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATCCGAC GCTGGGGGCG CAATACGGTC GTCGCGCTGC TGCTCGGCGC GGCTACGGTC 
GCTACGGCCA CGGCGGATTC CTTCGTCGCC GAGCAGATCC GGGTCGAAGG ACTGGAGCGG
ATCGACGAGG GCACGGTCTT CAGCTACCTG CCCATCGAGC CCGGTGACCG CGTGGGGAGC
GGCGAGGTGG CGGGGGCGAT CCGTGAACTC TACCGCTCCG GCTTCTTCCG TGACGTCGAG
CTGGCTCGGG ACGGCGATGA GCTAGTGGTG CGGGTGCAGG AGCGGCCGTC GATCGCCCGA
CTCCAGTTCG AGGGCAATGA GCAGATCCGC ACCGAAGACC TGCGCGAAGC CCTGCGCAGC
GCCGGTATGG CCGAGGGCCA GGTGTTCAAC CGTGCGCAGC TGGCCGAGAT CGAGCGGGAG
CTGCGTCAGC AGTACTTTGC CCAGGGCAAG TACGACGTCT CCGTCGAGAG CACCGTCTCC
CCGCTCGAGC GCAACCGGGT GGCGATCCGC ATCGATATCG AAGAGGGCCC CGTCGCCGAG
ATCCGCGAGA TCCACTTGAC CGGTAACGAG GCGTACTCCG ACCGCGAACT GCGCGGCGTG
TTCGAGCAGC AGGCCCGGCG CTGGTGGGCG CCGTGGTCGC AGCGGCACCG CTATGCTCGC
GATCTGCTCG CCAACGACCT CGAGAGCCTG CGCGGGCACT ATCTGGACCG CGGGTTCCTG
AATTTCGCGG TCACCTCCAC CCAGGTCTCC ATCAGCCCCG ATCGGCGCGA CGTCCACGTC
GCCGTGAACG TCGCCGAGGG CGAGCAGTAC GAGATCGGCA GTATTGACGT CCAGGGGCAT
CTGGTCGTGC CTCCGGAGGA GCTGCGCGAG CTGATCGCCC TGGAGCCGGG CGAGCTCTAC
TCGCGCAGTC GCGTGAACGC CGGCTCCTCG GCCATTCGGT CCCGGCTGGC CGAGGAGGGC
TACGCCTTCA GCAATGTCAA CGTCCGCCCC CGGGTGGACG AGGAGGAGCA GGTTGTCGAC
CTGACTTATG TGGTGGATCC CGGTGCGCGG GTCTATGTGC GGCGGATCAA CATCACCGGC
AACCAGCGCA CCCAAGACGA GGTCATCCGG CGGGAGATGC GCCAGCTCGA GGGGGCCCCG
CTGTCCAGTG AGGCGCTGGA GCGCTCGCAG ACGCGGCTCA ATCGGCTCGG GTTCTTCGAC
TTCGTCAACG TCGAAACCCC CCGGGTGCCG GGGGAGGACG ACCGGGTGGA TGTGGACGTC
AGTGTCCAGG AACGGATGAC GGGACAGCTG CAGGCCGGCG TCGGTTACGG CGATGTCCAG
GGGTTGCTGG TCAACTTCTC CGTCTCACAG GACAACCTGC TCGGCACCGG CGACCGGCTG
AGCATGACGG TGAACAACTC CAGTGTCTCC ACCATCTACA ACGTCTCGTA TACGGACCGC
TACCACACCC AAGAGGGCGT CTCCCAGACC CTGTCCGCCG GCTACCGCGA GACCCGCTCC
CGCCGGGCCG ACCTGGCAGA CTACGACCTG ACCTCGGGGC ACGGCTCGGT GGAGTACAGC
GTGCCGTTCA CGGAGGTGGA TCGACTGGCG GCGCGGCTGC GCCTGGAGCG CATCGGTATC
GATACGCGCA GCGATACACC GGACTGGATC AGGGAGTATC TCTCGGACCA GGGCGACGAT
CGTTTCACCA TGGTCAAGCC GCGCCTGTCC TGGAACCGGG ATACCCGAGA CCGCGGGGTG
TTTCCCACCG CCGGCGGCCG GCAGCGGCTG TCCCTGGAGG GGACGATCCC CGGCGATGAT
CTGGAGTTCT ACAAGCTGAC CTACGAGAAC CGCCGCTACT GGCCGCTGCG CTGGCTCGGC
GACCGCACCA CCTTCTCCGT GGAGGGCCAG GTCTCCTACG GCGACGGGTA CGGCGATTCC
AGCTCGCTGC CGTTCTTCGA GAACTACTAC GCCGGCGGCG TGCGCACGGT GCGCGGCTAC
CGTGGCAACT ACCTCGGACC GCGCGAGGAG GGCAGGGACC CGATCGGCGG CAACGCCCGC
GTACTCACCA AGGCGCAGGT CATTTTCCCG CCCACGCCGG AGTCGCAGTC GGTGCGCATG
GCGGCCTTCG TCGATGCCGG GCAGGTCTTC AACACCCGCC TGGATGAGTA CGAGTTCGAC
AGAGAGATCA TTCCGGATGG TCTGGATCGG GTGGACCTGA GCGAGCTGCG GGCCGCGGCC
GGTGTCTCGC TGATCTGGAT GTCGCCGGTG GGGCCGCTGA CCTTCAGTCT GGCCGAACCG
CTGAACGACA CGGGGGACGA CGAGACCGAG ACCTTCCAGT TCTCGCTCGG GACGGAATTC
TAG
 
Protein sequence
MIRRWGRNTV VALLLGAATV ATATADSFVA EQIRVEGLER IDEGTVFSYL PIEPGDRVGS 
GEVAGAIREL YRSGFFRDVE LARDGDELVV RVQERPSIAR LQFEGNEQIR TEDLREALRS
AGMAEGQVFN RAQLAEIERE LRQQYFAQGK YDVSVESTVS PLERNRVAIR IDIEEGPVAE
IREIHLTGNE AYSDRELRGV FEQQARRWWA PWSQRHRYAR DLLANDLESL RGHYLDRGFL
NFAVTSTQVS ISPDRRDVHV AVNVAEGEQY EIGSIDVQGH LVVPPEELRE LIALEPGELY
SRSRVNAGSS AIRSRLAEEG YAFSNVNVRP RVDEEEQVVD LTYVVDPGAR VYVRRINITG
NQRTQDEVIR REMRQLEGAP LSSEALERSQ TRLNRLGFFD FVNVETPRVP GEDDRVDVDV
SVQERMTGQL QAGVGYGDVQ GLLVNFSVSQ DNLLGTGDRL SMTVNNSSVS TIYNVSYTDR
YHTQEGVSQT LSAGYRETRS RRADLADYDL TSGHGSVEYS VPFTEVDRLA ARLRLERIGI
DTRSDTPDWI REYLSDQGDD RFTMVKPRLS WNRDTRDRGV FPTAGGRQRL SLEGTIPGDD
LEFYKLTYEN RRYWPLRWLG DRTTFSVEGQ VSYGDGYGDS SSLPFFENYY AGGVRTVRGY
RGNYLGPREE GRDPIGGNAR VLTKAQVIFP PTPESQSVRM AAFVDAGQVF NTRLDEYEFD
REIIPDGLDR VDLSELRAAA GVSLIWMSPV GPLTFSLAEP LNDTGDDETE TFQFSLGTEF