Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1521 |
Symbol | |
ID | 4709509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1648674 |
End bp | 1650218 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639855988 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_001003090 |
Protein GI | 121998303 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0377577 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGCAGA TGGAGCAGGT CGTCCAGGAA GTGCTGCACC AACTGCGAGC GACCTGGCGT CGACGGTGGT GGATCCTTCC CATCGCTTGG CTGGTCTGTA TACCCGGGTG GGCGTACATC CACGCCCTCC CCGACACCTA CGAGGCGTCG AGCGAGGTCT ACGTCGATAC GGATTCGGTG CTCGGTCCAC TGCTCGGCGG CATGACGGTG CGACCGGACG CTGAGCAGCG GATGAACATG ATTACCAGCA CGCTGCTGAG CCGGGATAAC CTGCGTGAGA TCGCGCGGCA GGCGGACCTC GACATTCTCC TCGGCTACGA CGACATCGAC CGCGCCGTCG ATAGCCTGCA GGACATCGAT CTGCGCGGGG GTGACGAAAG TGGGGCTGGG GACAATATCT ATACGATCAG TTTTTCCGAT GAGGACCCGG AGGTGGCCTA TCGCGTGGTG CGTCAGACCG GTGACCTCTT CATGGAGCGG GGCCTCGGTG ACCCGCGGAC GGATCTGACG GCCTCGCGCG ATTTTATCGA GAATCAGCTC GACCGTTACG GCACTCGGCT GCGGGCGAAG GAAGAGCAGC TTGAGGATTT TCGCCGTGAG CACTCGGAGG TTCTGTTGGC CGGCGGCGAT TTTTATAGCC GCTTGCGGGC CGAGCGTGAG CGGTTAGCTG AGGCGGAGTT GGAGTTTGAG CAGGCTCAAA GCCATTATGA GAGCCTCATC GCACAGCTTG AGGGCGATGG CGATCACCCC GGATTAGTGC AGCCTCCGGA GTTCGAGAAT CCGGAGCTAG ACGCTCGCAT CAGTAACCTG GAATCCGAGC TTGATGAGCT GCGCCGCCAG TATACCGATC AGCATCCGGA CGTGGCGCAT ACGCAGCGCG TCCTTGAGGA TCTGCGCGAG GAGCGGGAGG AGCAGGCGGT GGAGTTCAGC GAGAGCCTGA TGGCCTCCCC GGTAGATCGC GTTGGTCTAG GTCAGCCCGA TCATCCGCTG CAGCTGGAGT TGGCCGAATC GGAAAGCCGT GTGGCCTCGC TGGAGACTCG GGTTGAGGAG CAGCGCCTGC GGGTCCAGGA ACTTGAGGCG GTCTCCGACG ACGTGCCGGA GATCGAGTCG GAATACAGTC GGCTCACACG GGACTACGAA GTCCTGCAGA ACAGCTACTC TGAGTTACGG GATCGGCTTG AGCAGGCTGT GCTGACCGGC GAAGTGGAGT CAGGCGCGGA CTCGGTGGAC TTTCGCGTCC TGCAGCCACC GGAGCAGCCA AGTGAGGCGG CAGCGCCGAA CCGCCCCCTG CTGGGCAGCG CGGTTCTGGT CCTGGGGCTG GGGGCCGGCA CGGGATTCGC CTTCCTGCTG GCCCAGATTC GCGGCACGGT GGCCGCGCCC GGCCAGCTCG GCGAGCTCAC CGGGCGGCCC GTCATGGGGC AGATCTCGCG GGTGCGAACG CCGGCCCACC GGCGCCGCAG GCGCATGGAG ATGCTCGTCT TCTTCAGCGC AACCGGAGCC TTGTTGGTTG CCTATGGGGT GGTGGTTGGG GTCTTTTTTG CCTAG
|
Protein sequence | MGQMEQVVQE VLHQLRATWR RRWWILPIAW LVCIPGWAYI HALPDTYEAS SEVYVDTDSV LGPLLGGMTV RPDAEQRMNM ITSTLLSRDN LREIARQADL DILLGYDDID RAVDSLQDID LRGGDESGAG DNIYTISFSD EDPEVAYRVV RQTGDLFMER GLGDPRTDLT ASRDFIENQL DRYGTRLRAK EEQLEDFRRE HSEVLLAGGD FYSRLRAERE RLAEAELEFE QAQSHYESLI AQLEGDGDHP GLVQPPEFEN PELDARISNL ESELDELRRQ YTDQHPDVAH TQRVLEDLRE EREEQAVEFS ESLMASPVDR VGLGQPDHPL QLELAESESR VASLETRVEE QRLRVQELEA VSDDVPEIES EYSRLTRDYE VLQNSYSELR DRLEQAVLTG EVESGADSVD FRVLQPPEQP SEAAAPNRPL LGSAVLVLGL GAGTGFAFLL AQIRGTVAAP GQLGELTGRP VMGQISRVRT PAHRRRRRME MLVFFSATGA LLVAYGVVVG VFFA
|
| |