Gene Hhal_1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1969 
Symbol 
ID4710336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2169472 
End bp2171193 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content68% 
IMG OID639856442 
Productsurface antigen (D15) 
Protein accessionYP_001003535 
Protein GI121998748 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0729] Outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCTGG CTGCGCTGGT CCTGCTGATC CTCCTGGCCC TGCTTGGCCG TCCGGCCTAC 
GCCGGGGTGG AGGTGGTCAT CGAGGGGGTT AGCGGCGAGC TCGCCGATCA GGTGCGCGGC
CACGTGGGGG AGCCGGCGTC TGCCGATCCC GCCGCGATCA CCGCGTTCCG TCGCCGCGCG
GTGGAGCGGG CCGAACGCGG TTTGCAGGCC GTGGGCCACT ACGATGCACA GATCGAGGTG
CGTCGGGAGC GTCTCGACGA ACAGGTGCGA TTGACCATCG TCGTCGACCC TGGCGAACCG
GTGCGTCTGA GCCGGATCCA TGTGCTGATC ACCGGACCGG GAGGGACCGA TCCGGCCTTC
GCTGGCATCG AGCAGCGCTT GGGGATCGGT GAGGGTGATG TTCTCCACCA CGGTCGCTAC
GAGGCGGCGC GTCGGGCTAT CCAGAACCTG GCGCTGGACC AGGGGTACTT CGATGGTCGC
TACGTCACCC GGCGCGTGGA GGTCGACCCG GAGGCCCGCG AGGCCGAGGT GATCCTGCAC
TACCACACCG GTGTGCGCTA CCGCTTCGGC GCGGTACGGT TCTCGGAATC GCCGCTGGCC
GAGGCGTTCC TGCAGCGGCT GGTTCCCTTC GAACGCGACG AGCCGTATAC CGCCGAACAG
GTGGCGGCCT TCAACCGCGC GTTGCTCGAC AGTGGCTACT TTTCCGATGT GCGGGTGCGT
CCGCGGCGGG ATCGCACCGA GGATGATCAG GTGCCGGTGG ACGTGGACCT CTCCGCGCGG
GCCCGGCACG AGATCACCAC TGGCGTCGGC TTCACCACCG ACCTCGGCGC CCGCGTGCGC
CTGGGCTGGC GTCGGCCGTG GGTGAACCAG TGGGGGCACT CGCTGGCGGT GGAGAGCGAG
ATCGCCGAGC GCCGGCAGAA CCTCATCAGC ACCTATACGG TGCCGCTGCG CGATCCCCTG
CGCACCCAGC TGGAGTACCA GCTCGGTATC CAGGCGCAGG ACGTGGCCGA TATCGACACC
GAACAGGTCA CCGCATCGGT TCAGCACCGG CATCGCCTCG AGAGCGGTTG GCAGCAGGTC
CTGTCCCTGC GCGCCTACCG CGAGCGTTAC CGCATCGACG ACGATCAGCG CACCACCCAG
CTCTACATCC CCGGGGTTAG CTGGAGCCGG GTGCGCAGCC GCGGTGGGCT CGATCCGCGC
TGGGGCGACC GGCAAATGCT GAGCCTGGAG GTGGCCGACC CGGATCTGGC TTCGGACATC
GAGCTGCGCC GCGTGCGCAC CGCCACCCGC TGGGTGCGCA CGCTGGGGGA GCGCCACCGG
TTCCTTATCC GTGGCGAGGT CGGTGCGCTG GCCACGGACT CGTTCGTCGA TGTGCCGCCT
TCGCTGCGCT TCTACGCCGG TGGCGATCAG AGTGTACGCG GCTATAAGTA CCAGACTCTG
GGGCCCGAGG AAGATGGGAC CACCATCGGT GGCCGTTATC TGGCGGTGGG CAGTGCCGAA
TACGGTTATC AGCTCACCCC CAACTGGCGC CCGGCGATCT TCGTGGACAG CGGTAATGCC
TACGCCGACT GGGATGACTT GAGTGCTGAG GCGAAGACGG GTGCTGGCTT CGGCATCCGC
TGGTCGTCCC CGGTGGGGCC GGTCCGCCTC GACCTCGCCT CCACGGTGGG GGAAGCGGAC
GACTCCTGGC GTCTCCACTT CTCGATGGGG TCGGATCTGT GA
 
Protein sequence
MRLAALVLLI LLALLGRPAY AGVEVVIEGV SGELADQVRG HVGEPASADP AAITAFRRRA 
VERAERGLQA VGHYDAQIEV RRERLDEQVR LTIVVDPGEP VRLSRIHVLI TGPGGTDPAF
AGIEQRLGIG EGDVLHHGRY EAARRAIQNL ALDQGYFDGR YVTRRVEVDP EAREAEVILH
YHTGVRYRFG AVRFSESPLA EAFLQRLVPF ERDEPYTAEQ VAAFNRALLD SGYFSDVRVR
PRRDRTEDDQ VPVDVDLSAR ARHEITTGVG FTTDLGARVR LGWRRPWVNQ WGHSLAVESE
IAERRQNLIS TYTVPLRDPL RTQLEYQLGI QAQDVADIDT EQVTASVQHR HRLESGWQQV
LSLRAYRERY RIDDDQRTTQ LYIPGVSWSR VRSRGGLDPR WGDRQMLSLE VADPDLASDI
ELRRVRTATR WVRTLGERHR FLIRGEVGAL ATDSFVDVPP SLRFYAGGDQ SVRGYKYQTL
GPEEDGTTIG GRYLAVGSAE YGYQLTPNWR PAIFVDSGNA YADWDDLSAE AKTGAGFGIR
WSSPVGPVRL DLASTVGEAD DSWRLHFSMG SDL