Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1507 |
Symbol | |
ID | 4709120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1629989 |
End bp | 1631521 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639855974 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_001003076 |
Protein GI | 121998289 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.577803 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGTTTAG AGCAACTCTA TCAGCACATT ATCCAGCAGC TGCGCGCCTC CTGGCGGCGC CGGTGGTGGC TGATCCCGGT CGCGTGGGCG GTCTGCCTGG TCGGCTGGGC TTACATCAAC ACCTTACCCG ATGTCTATCA GTCTTCAAGC CGCGTCTACG TAAACTCTCA GACCGTGCTC GAACCCCTGT TACGCGGTAT GACCGTCCGT CCGGACACAG AGCAGCGCGT CCGCATGATG ACTGTAACCC TTCTCAGCAA CGACAACCTC AGGGAGATCG CTCGCCAAGC GGATCTCGAT GTCCTGCTCA ACCAGGATAA TGAACAAGCG TTGATCGGCA CCTTGCGCGG CGGCATTCAA TTGGACGGTG GCCGGCGCGA TAACATCTAC ACCATCGCCT TTTCCCACCG GGATCCCGAG GTCGCCTATC GGGTCGTCCG CGAGACCTCC AATCTGTTCA TGGAGCGCGG CCTTGGCGAC TCGCGGGTTG ATCTCGCCTC CTCGCAGACG TTCATCGAGC GACAGCTCCA ACGTTACGCC AGTCAGCTGC AGAACAAGGA GGCTGAGCTT GAATCCTTCA AGCGTGAGAA CCATTCGCTG CTCAGCGCTG GCGGCAACTA TTACACCCGG CTGGAGCGCG CTCGCGACGC CCTTGAGCAG GCACAGCTCG AGCGGGACGA ACACGCCCAA CGTTTAGAAA CTCTGCAGGC CAGGCTCGAA AAAGACAGAC AATCCCCCAT TGCCGAGGAC GCGCATTTGA GTAACCCCCG GCTGGACCAA CGGATCAGCC GGCTGGAATC CCAGCTCGAT GAGATGCGGC GCCACTTCAC GGATGCCCAC CCGGACGTCG CCCAAACTCG ACGCATCCTC AAGGAGCTTG AGGAGCGGCG CCGTGAAGAA ACCCGCATGG CTCTCGCAGA TCCCGCTCGG TCCGTCGAGG GCGTTGTCGG CAGCCCGCTC CGGTTGGCAT TGGTTGACGC AGAAAGCCAT GCTGCTTCGC TGGAAACCCG CGTTCAAGAG CATAAGCGGC GCGTGGAGAA CATCGCCGCG CTTGTCGATC AGGTTCCTGC CATCGAATCC CGATTCAATG CCTTAAAGCG CGACCACGAG GTGCTGCAGC AAAGCTATCG CCAACTGCTG ACCACCCGCG AGCGGGCCGC CATGACCGGG TCCGTCGAGA CCGAGACGGC TGCCGTCGAT TTCCGGGTCC TGGAGCCGCC GACACGGCCG AGTTCGCCGT CCGCACCCGA TCGCCCCCTG CTTGCGAGCG GTGTCCTCCT GCTAGGCCTC GGCGCCGGCT CCGGACTGGC CTACCTGCTC GCTCAGTTGC GCGGCACCGT CACCTCCACC GCACGCCTGG CGGAGATCAC CCGTCGCCCC GTACTGGGCT CGGTCACCCG GGTCCCGACC CCGAACCGGC GGCGGCGTCA ACGGCTCGAG CTCATGATCT TTGCAGGGAT CCTCGGAACC CTCTTTGTGG CCTATCTGAT GGTGTTGGCG TACTACGGCG GCGGGGGGTT GTGGCCGTTC TAG
|
Protein sequence | MGLEQLYQHI IQQLRASWRR RWWLIPVAWA VCLVGWAYIN TLPDVYQSSS RVYVNSQTVL EPLLRGMTVR PDTEQRVRMM TVTLLSNDNL REIARQADLD VLLNQDNEQA LIGTLRGGIQ LDGGRRDNIY TIAFSHRDPE VAYRVVRETS NLFMERGLGD SRVDLASSQT FIERQLQRYA SQLQNKEAEL ESFKRENHSL LSAGGNYYTR LERARDALEQ AQLERDEHAQ RLETLQARLE KDRQSPIAED AHLSNPRLDQ RISRLESQLD EMRRHFTDAH PDVAQTRRIL KELEERRREE TRMALADPAR SVEGVVGSPL RLALVDAESH AASLETRVQE HKRRVENIAA LVDQVPAIES RFNALKRDHE VLQQSYRQLL TTRERAAMTG SVETETAAVD FRVLEPPTRP SSPSAPDRPL LASGVLLLGL GAGSGLAYLL AQLRGTVTST ARLAEITRRP VLGSVTRVPT PNRRRRQRLE LMIFAGILGT LFVAYLMVLA YYGGGGLWPF
|
| |