Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1891 |
Symbol | |
ID | 4710689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2076839 |
End bp | 2079040 |
Gene Length | 2202 bp |
Protein Length | 733 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639856364 |
Product | hypothetical protein |
Protein accession | YP_001003457 |
Protein GI | 121998670 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGTATT TTCCGAAGTT TAGCGCGCTC GCCCTTGTGG CTTCAGGCGC CCTCGTACTC TCTGCCTGTG ACGACGCCAG TGATGGCGAC AGCGGCGGTG GCAGCAACGT TCAACTCAGC GGAACCGTGG TCGACGGCTA TGTGGCCGGC GCCCGGGTCT GGGTCGATCT CCAGGACAAT GGCCAGATCA ATTCCTGGGA TCCGGTCGCG CGGACGGATC GTTACGGCTT CTTCTCCTAT CGCCCGGAAT TAGATATTAA CGGCGATACG ATTCCCGCTC GCGACTATTG CGACCCCTAC GGCGACCACT ACAACGAGCG CTACTGTCTT CGTGTCTCCG ACGCGCACGA AGGCGGAACG CTGAGGATGG TGGGGGGGTA CGACGTCCTC ACCGGGGAGC CCTTCGAGGG CAGCATGAGC TACCGGCTCG ATTCGATCCA TGAGATCCGG AACCCGGCGG ATCTTGTCGT GAATCCGCTG ACGTCCGTTG CCAACCGAAA AGGCGATCCG GTCGGTGATT TCGGCGTCGA TTTTTGGGGA GACGCCGATG GCAGTAATGG CTGGGCTTGG GAGAGTGCGA AGGACGCCGA CAAGAAAGCG CTCTTTGAAG CCCTGGCTCT GCATAAGACG GTTGACGTGT TGGCTGCCGG TTTGGATGGT TGGCTCGATG AACAGGGAGT CGATCGCGAG AGAGTCGAGG ATGCTCTTGG GCTCGATCTC TCAATGGAGC TCTACCGGCT GATTCGTGAA AAACTGGGTA AACCCGATCT TGACGGGAGT GATCCGGGTG GCGTTGATTT TTCTCTAGAT AAAAGTGCTA TAGAAGATAT CCTTGGTGGT TTCAAGGAGT CTATATCATC AATCGATGAT AATTTTGGCA CTGATGGGTT CTCCAGTGAC GACGTGGCGG AAATCAACGA AGCCCTAAGA GGCCTGAAGC CACCCCCCTC CGGAAGCGCC ATTGGGGGCG CGTCAGAAAA CGACGTAAAG AGCAAAGCCC GTGCGGGCGA GGTGGCCGCT GCCGTTGCTC GCGAAGATGC GCGCTCCAGC GATAGTCCTA GTGAAGTAAG TGCGACTGGA AGCGGTAGTT CGAAGAACGG TGATTACCAA AAAATCCTTG ACGGGCTTTC CAGGGCGGGG GTCGGCGAAC AAGTCGATAT CCGCCAGCTC AGTGACTCCA TCCGGGGGAA AAGGGATAAC GGAGGACTTG ACGACAGTGA GGACGGCTGG GCTGAACTGA TCACTCAAAG CACAGTAAAG GCCGTCCTCC CGGAAATCAG CAAGACCAAG CTCGCACTTG AGGTAACGGG CGGGGACGCC GGGGAAGATA CCGGCAGGAT CCGCTTCTTC TTTGAGCCTG CATCTGATTT TCCCTCGGAT GATTATGCGG ACCCCGGGTT TTACAAGGGT GATGAGCACA AAGGTTACTC CGACGAAGGT GCGCTTACCG TCTGTCTCGA CGGAACATTT GAGAGCTTCG ACCTCGATCT TGATTCGGAG CAGGAAGAAG AGATCAAGGG TGGGGGCGGC GAGGCCGGCG AGGACAGCGT GGCACTGCGC ATGACCGGCA CATGGGAGCG CGCATCAGAC CGCGCCCTGC TCCTGAATGT CAACTTCGCC GGCGTGGAAG AGACGATGCA TCTGCGAGTG CGTAGCAGCT CGAACGATTA CACGCTCTCG CAGTTCTATA ACGAGACATT TTGGAAAAAA ACCGACGACG ACAGAGAGTG GTCAGGTTCT CCGAAGTTTA ATGATAGCGA AGAGCTCAAG CGCAGCGAAT TCTGGTCAAA GCTGTCGGAT CGCTACTGGG ATGAGGACTC CTGGCACGAG GGTGAGTGGC GAGGAGAGAA TGGTAATGGA AGTCTGGACT CAGGTGACTG GCGCGACGTA TTCCTTGGCG AAGAGTTCGA GCTGGATGAT GAAGACCGAT TCAACTTTGC CCACGATAAG GAAGAAAACG GTTCGGGATG GGAGCGTAGC AATGCCGCTG ATTTCGACGG CCTGACCCCC CGCGGCTGGG TGTTCGACCT TAGTTACGAG GGGGATACCG AATCCTGGGA TGTCGAGCAG ACGCGGGACG AGGCGCTTGC AGCGCTGGAA GAAGAATCGA ATGAGGACGC GGATGACTTC GAAATCCTCG TTTTCGAGGC GTTCGATGGA GAGGCCCCGG ATTCCCACGG GGCTTGCGGT GAGTCGCGGT AA
|
Protein sequence | MWYFPKFSAL ALVASGALVL SACDDASDGD SGGGSNVQLS GTVVDGYVAG ARVWVDLQDN GQINSWDPVA RTDRYGFFSY RPELDINGDT IPARDYCDPY GDHYNERYCL RVSDAHEGGT LRMVGGYDVL TGEPFEGSMS YRLDSIHEIR NPADLVVNPL TSVANRKGDP VGDFGVDFWG DADGSNGWAW ESAKDADKKA LFEALALHKT VDVLAAGLDG WLDEQGVDRE RVEDALGLDL SMELYRLIRE KLGKPDLDGS DPGGVDFSLD KSAIEDILGG FKESISSIDD NFGTDGFSSD DVAEINEALR GLKPPPSGSA IGGASENDVK SKARAGEVAA AVAREDARSS DSPSEVSATG SGSSKNGDYQ KILDGLSRAG VGEQVDIRQL SDSIRGKRDN GGLDDSEDGW AELITQSTVK AVLPEISKTK LALEVTGGDA GEDTGRIRFF FEPASDFPSD DYADPGFYKG DEHKGYSDEG ALTVCLDGTF ESFDLDLDSE QEEEIKGGGG EAGEDSVALR MTGTWERASD RALLLNVNFA GVEETMHLRV RSSSNDYTLS QFYNETFWKK TDDDREWSGS PKFNDSEELK RSEFWSKLSD RYWDEDSWHE GEWRGENGNG SLDSGDWRDV FLGEEFELDD EDRFNFAHDK EENGSGWERS NAADFDGLTP RGWVFDLSYE GDTESWDVEQ TRDEALAALE EESNEDADDF EILVFEAFDG EAPDSHGACG ESR
|
| |