Gene Hhal_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1947 
Symbol 
ID4710761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2143899 
End bp2145080 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content68% 
IMG OID639856420 
Producthypothetical protein 
Protein accessionYP_001003513 
Protein GI121998726 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.309501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGGA CATCGACCGT GGCGGCTGTG GCGGGACTGT TTGCATGGGG CGGTGCGGCG 
TCGGCGGCAC AGGCCGGCGA GGTCAGCCTT TATGGGCAGG TCCACGCCGG CCTGCACCAG
TTCGCCTACG AGGACTCTTC CGACGTCACC AAATTCACGG ACCAGGGGCG GACGCGCTGG
GGCGTTTCCG GGCAGCAGCC CCTGGACGAC GAGTGGACGG CCATCGGCCA ACTGGAGTGG
AGCGCCAGCC CGCACCTGGG GGACGACGAT TTTAGTCGAC GGATCAGCTA CGTGGGAGTG
GACAGCCCCT ACGGTGAGCT GGTCGTCGGC ACGGTCCACG CCGCCTACAA GACCCTCGGC
GGCGTGCGCT GGGATCCGCT GGTGGCCACC GAGCTGCAGC AGCGGCGCAC CGGCGGGATG
TCCGGCGGCT CCTTCGGCCA CAACGATTTC GTCAACCGCG CGGTGCAGTA CGTCAGCCCG
GAGATGGCCG GGCTGCAGCT GCACGCCCAG ATCGGGGTGG AGGACGACAA CCAGGACCGG
ACGCTGTCGA GCCCCGATCC GGGTGATGCG GATCAGGACC TGCAGCAGGG CGATGTCATC
CTTGGTGCCA GCTACCTCGG CCTCCCTGAC TGGCACTTCA TCGCCGCGGT GATGCACCTC
GACGAGCGGT TCACCGACGT GGATGACGTG GACGACGGCG ATACCAACTG GAAGGTGGGG
GCCCGCTGGG CGCCGGATGC GTTCTCTCTG GCCTACCAGT ACGAGTCGGT GGAGATCATC
CGCGGGCCCG GGGGCGCCGG CCGGATCGAC AACCTAGTGG GGGATCCCTC AAACCGGGTG
GACGGCGAGA GTACAACGGA TGACGACCCA GCCTTCTACG ACGGCCGCTT CACCGACGCG
GTGGACCACC ACGCGCTGAT CGGGACCTAC CAGCAGGGGC GCAACCAGTG GGTGCTCGCC
CTCGGTCACG CCGACGCCGA TGGCGACGAC GAGGACGTCA GCTCGATCAC CGGCGCCGTG
GTCCATCAGC CCCACGAGGA TTTCCGGGTG TACGCCGGCG TGCAGTACCA GTCCTTCGAC
GATGCGATCG GTAGCGCCGG CGAGGATCCC GACGAGGCCG CCGACGATCA CCTGACCACC
TACGCCATCG GCGCCCGGTA CGACTTCGGG GCGACGTTCT GA
 
Protein sequence
MRRTSTVAAV AGLFAWGGAA SAAQAGEVSL YGQVHAGLHQ FAYEDSSDVT KFTDQGRTRW 
GVSGQQPLDD EWTAIGQLEW SASPHLGDDD FSRRISYVGV DSPYGELVVG TVHAAYKTLG
GVRWDPLVAT ELQQRRTGGM SGGSFGHNDF VNRAVQYVSP EMAGLQLHAQ IGVEDDNQDR
TLSSPDPGDA DQDLQQGDVI LGASYLGLPD WHFIAAVMHL DERFTDVDDV DDGDTNWKVG
ARWAPDAFSL AYQYESVEII RGPGGAGRID NLVGDPSNRV DGESTTDDDP AFYDGRFTDA
VDHHALIGTY QQGRNQWVLA LGHADADGDD EDVSSITGAV VHQPHEDFRV YAGVQYQSFD
DAIGSAGEDP DEAADDHLTT YAIGARYDFG ATF