Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1969 |
Symbol | |
ID | 4710336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2169472 |
End bp | 2171193 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639856442 |
Product | surface antigen (D15) |
Protein accession | YP_001003535 |
Protein GI | 121998748 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0729] Outer membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCTGG CTGCGCTGGT CCTGCTGATC CTCCTGGCCC TGCTTGGCCG TCCGGCCTAC GCCGGGGTGG AGGTGGTCAT CGAGGGGGTT AGCGGCGAGC TCGCCGATCA GGTGCGCGGC CACGTGGGGG AGCCGGCGTC TGCCGATCCC GCCGCGATCA CCGCGTTCCG TCGCCGCGCG GTGGAGCGGG CCGAACGCGG TTTGCAGGCC GTGGGCCACT ACGATGCACA GATCGAGGTG CGTCGGGAGC GTCTCGACGA ACAGGTGCGA TTGACCATCG TCGTCGACCC TGGCGAACCG GTGCGTCTGA GCCGGATCCA TGTGCTGATC ACCGGACCGG GAGGGACCGA TCCGGCCTTC GCTGGCATCG AGCAGCGCTT GGGGATCGGT GAGGGTGATG TTCTCCACCA CGGTCGCTAC GAGGCGGCGC GTCGGGCTAT CCAGAACCTG GCGCTGGACC AGGGGTACTT CGATGGTCGC TACGTCACCC GGCGCGTGGA GGTCGACCCG GAGGCCCGCG AGGCCGAGGT GATCCTGCAC TACCACACCG GTGTGCGCTA CCGCTTCGGC GCGGTACGGT TCTCGGAATC GCCGCTGGCC GAGGCGTTCC TGCAGCGGCT GGTTCCCTTC GAACGCGACG AGCCGTATAC CGCCGAACAG GTGGCGGCCT TCAACCGCGC GTTGCTCGAC AGTGGCTACT TTTCCGATGT GCGGGTGCGT CCGCGGCGGG ATCGCACCGA GGATGATCAG GTGCCGGTGG ACGTGGACCT CTCCGCGCGG GCCCGGCACG AGATCACCAC TGGCGTCGGC TTCACCACCG ACCTCGGCGC CCGCGTGCGC CTGGGCTGGC GTCGGCCGTG GGTGAACCAG TGGGGGCACT CGCTGGCGGT GGAGAGCGAG ATCGCCGAGC GCCGGCAGAA CCTCATCAGC ACCTATACGG TGCCGCTGCG CGATCCCCTG CGCACCCAGC TGGAGTACCA GCTCGGTATC CAGGCGCAGG ACGTGGCCGA TATCGACACC GAACAGGTCA CCGCATCGGT TCAGCACCGG CATCGCCTCG AGAGCGGTTG GCAGCAGGTC CTGTCCCTGC GCGCCTACCG CGAGCGTTAC CGCATCGACG ACGATCAGCG CACCACCCAG CTCTACATCC CCGGGGTTAG CTGGAGCCGG GTGCGCAGCC GCGGTGGGCT CGATCCGCGC TGGGGCGACC GGCAAATGCT GAGCCTGGAG GTGGCCGACC CGGATCTGGC TTCGGACATC GAGCTGCGCC GCGTGCGCAC CGCCACCCGC TGGGTGCGCA CGCTGGGGGA GCGCCACCGG TTCCTTATCC GTGGCGAGGT CGGTGCGCTG GCCACGGACT CGTTCGTCGA TGTGCCGCCT TCGCTGCGCT TCTACGCCGG TGGCGATCAG AGTGTACGCG GCTATAAGTA CCAGACTCTG GGGCCCGAGG AAGATGGGAC CACCATCGGT GGCCGTTATC TGGCGGTGGG CAGTGCCGAA TACGGTTATC AGCTCACCCC CAACTGGCGC CCGGCGATCT TCGTGGACAG CGGTAATGCC TACGCCGACT GGGATGACTT GAGTGCTGAG GCGAAGACGG GTGCTGGCTT CGGCATCCGC TGGTCGTCCC CGGTGGGGCC GGTCCGCCTC GACCTCGCCT CCACGGTGGG GGAAGCGGAC GACTCCTGGC GTCTCCACTT CTCGATGGGG TCGGATCTGT GA
|
Protein sequence | MRLAALVLLI LLALLGRPAY AGVEVVIEGV SGELADQVRG HVGEPASADP AAITAFRRRA VERAERGLQA VGHYDAQIEV RRERLDEQVR LTIVVDPGEP VRLSRIHVLI TGPGGTDPAF AGIEQRLGIG EGDVLHHGRY EAARRAIQNL ALDQGYFDGR YVTRRVEVDP EAREAEVILH YHTGVRYRFG AVRFSESPLA EAFLQRLVPF ERDEPYTAEQ VAAFNRALLD SGYFSDVRVR PRRDRTEDDQ VPVDVDLSAR ARHEITTGVG FTTDLGARVR LGWRRPWVNQ WGHSLAVESE IAERRQNLIS TYTVPLRDPL RTQLEYQLGI QAQDVADIDT EQVTASVQHR HRLESGWQQV LSLRAYRERY RIDDDQRTTQ LYIPGVSWSR VRSRGGLDPR WGDRQMLSLE VADPDLASDI ELRRVRTATR WVRTLGERHR FLIRGEVGAL ATDSFVDVPP SLRFYAGGDQ SVRGYKYQTL GPEEDGTTIG GRYLAVGSAE YGYQLTPNWR PAIFVDSGNA YADWDDLSAE AKTGAGFGIR WSSPVGPVRL DLASTVGEAD DSWRLHFSMG SDL
|
| |