Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0599 |
Symbol | |
ID | 4709682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 676122 |
End bp | 677696 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639855057 |
Product | hypothetical protein |
Protein accession | YP_001002187 |
Protein GI | 121997400 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.674493 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCAGG CGATTCGTGA CGGCATCAAG GGCTGGATTG CGTGGGTCAT CATCGGGTTC ATCGCCCTGC CGTTCATCTT CATGGGTGGC TACGAGTACT TCGGTGGTGG CCAGGACGAT GCAGTGGTCG CCCGCGTTGA TGGCGAAGAA ATCCCGCGCT CCCAGATCGA TCAGGCCGTC GAGCGCCAAC GCGCGCAGTT GCGCGAGATG TTTGGCGGCG ACCTGCCGGA TGGCGCCTTC GACGGCGCCG CGCTACGCCG CGAGGCCCTG GAGCAGCTGA TTGACGAGCA GCTGTTGCAC GCCTACGTGG GCAAGCAGGG GCTGCGGGTC ACGGATCAGG AGGTGGCGCA GACCATCCGC GGCCAGGAGA TCTTCCACGA GGGCGGGCAG TTTTCCCGGG CCCGATATCA GACCCTGCTC GAGCGCAACC GCCTCACGCC CGAGGACTAC GAGGGGCTCG TCCGGCGGGA TCTGAAGGCT GACCAGTTTC AGCAGGCGGT CTTCGCCAGC AGCATCAGCA CCCCCTCGCA GCTCGAGCGG CTGGTACGTC TGCAGGATGA GTCGCGCAGC TTTAGCTATG TGGAGATCGA CGCCGACCGG TACACCGACG AGGTCAGCGT CGACGACGCC GAGGTCGAGG CCCATTACGA GGCCCACACC GATGACTACA TGGCCCCGGA GGCGGTGCGT CTCGAGTATG TCGAGCTCGG GCCGCTGGCC CTGCGGGATC AGGTCGACGT CGACGACGAG ACGCTGCAGG AGCGCTACGA CGAGCGCTAC GGCGACGACG ATGACCCGCC AACGTTCGAT GACGTCCGCG AGGAGCTGTT GGCCGACTCT ATCCGTGAGC AGTACCGCAC GGAGCTGATC GAGGCCGGCG ACGAGCTCGG CAACATCGCC TTCGAGCAAC CCGACAGCCT GGAGCCGCTT GTCGACACCT TCGGGCTGGA GGTCCGCACC AGTGACTGGA TCGACCGCGA CGGCGGCGAA GGGATTGGCG ATCTCTCCGA GGTGGTGGAG GAGGCCTTCA GCGAGGACGT GCTCGAGCAC GGTTACAACA GCGACCTGAT CCGCGTCGAC GAGGATCGTT ACCTGGTGGT CCGGCTGCTT GAGCACCGCG AGGCGGAGCC GAAGCCGCTG GAAGAGGTGG CGGATACCAT CCGCGAGCAG CTTCGGCAGG AGCGCGCTGC CGACCTCGCC CGGGAACGGG CCGAGGAGCT GGTCGCTCGG CTGCGCGACG GCGACTCTCT GGATGAGCTG GCCGAGGAGC TGGAGGTCGA GCGCTTCACT GTGGAAGACG CCTACCGGGA CGATCGCAGC CACCCGGAGG CCGTGGTGCG TGAGGCCTTT GCCCTGGAGG TGGACGGCTA CGCCCGGGTG GAGCTGGATG ACGGCTCCGC GGCGCTGCTC CGCCTGGATG GGATCAGCCG CGGCGATCCG GAAGGGCTTT CGGCCCAGGA GCGGCAGCAG TTGCAGCAGC AGCTCCAGCG CATGGCGGGC GACTCGGAGG TGCGGGCGCT GATCCGCGCG TTGCGCGCCG AGGCGGAGAT CGAGATCGCC CGCGAGCGCC TCTAG
|
Protein sequence | MLQAIRDGIK GWIAWVIIGF IALPFIFMGG YEYFGGGQDD AVVARVDGEE IPRSQIDQAV ERQRAQLREM FGGDLPDGAF DGAALRREAL EQLIDEQLLH AYVGKQGLRV TDQEVAQTIR GQEIFHEGGQ FSRARYQTLL ERNRLTPEDY EGLVRRDLKA DQFQQAVFAS SISTPSQLER LVRLQDESRS FSYVEIDADR YTDEVSVDDA EVEAHYEAHT DDYMAPEAVR LEYVELGPLA LRDQVDVDDE TLQERYDERY GDDDDPPTFD DVREELLADS IREQYRTELI EAGDELGNIA FEQPDSLEPL VDTFGLEVRT SDWIDRDGGE GIGDLSEVVE EAFSEDVLEH GYNSDLIRVD EDRYLVVRLL EHREAEPKPL EEVADTIREQ LRQERAADLA RERAEELVAR LRDGDSLDEL AEELEVERFT VEDAYRDDRS HPEAVVREAF ALEVDGYARV ELDDGSAALL RLDGISRGDP EGLSAQERQQ LQQQLQRMAG DSEVRALIRA LRAEAEIEIA RERL
|
| |