Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2255 |
Symbol | |
ID | 4709143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2477026 |
End bp | 2478144 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639856731 |
Product | type II and III secretion system protein |
Protein accession | YP_001003821 |
Protein GI | 121999034 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4796] Type II secretory pathway, component HofQ |
TIGRFAM ID | [TIGR02515] type IV pilus secretin (or competence protein) PilQ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.936423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGTC GACTGCGATC CGGAGCGGCC GCGTTGCTAT TGATCGCTCT GCCTGTCTTC GCGGCCCCAC CTGATCCGTT TCAGGGTCGG GTGGACGAGG AGGGCGCAAC CCGGCCGGAG CCCGCCCTGC GCACCCTGCC CCTGGTCCAC GCCGAGGCTG CGCAGCTCGC GGAACTCCTG CGCGACGAGA CCGGCGTGCT CTCAGAGAGC GGCCACGCCA GCGTCGATGA ACGCACCAAC ACGCTGATCC TCCAGGACAC GCCGGCCCGC CTGGAAGAGG CCGAACAGCT CGTCCGCGAA CTCGATCGGG CCAACCGTCA GGTCATGATT GAGGCGCGCA TCGTGTTGGC CTCGGGGGAG TACTCCCGGG AGTTGGGCAG CCGACTGGGG CTATCCAGCG AACGGGGCGA CGGCCACTTC GCCTTCGACA GCCTGCCCGG GAGCAACGAC GCGGCTGTGG GCGACGGGCT TCTTGCCGAG CTGCCGCCGG TGGGCGAAGG CGCGCGGCTG AGCGTCTCAG TTGGCGAGGT GGGCGAGCGG CTGCTACAGC TCGAACTCTC GGCGATGGAG GCCGAGGGAC ACGGGCGCGT GGTCTCCAGC CCGCGGGTCC TGACCACCGA GCGCCAGGCC GCCCGCATCG AGCAGGGCGT CCAGATCCCC TACCAGGAGA CCGCCGAGTC CGGTGCCACG GCGGTCGCCT TCCAGGACGC TGCCCTGTCC CTGACCGCCA CCCCGCAGGT GACCGAGGAC GACGCAGTCT CCCTGGCCCT GCGCGTCACC AAGGACGCCG TCGGCCAGAT CTACGAGGGG GTTCCCAGCA TCGACACCCA GGCCGTGACC ACGCACCTTC GAGTCCAGGC CGGTGAGACC ATCGTCCTCG GCGGAGTCCG CGAACACGAA CAGCGCAAGC AGCGCCAGAG GGTCCCGTGG CTCGGAGAAC TGCCTCTCAT TGGCTGGCTG TTCCGCCAGC GCATGTCCGA GCAGAGCCAC CATGAGCTGC TGGTCTTCGT CACCCCGCGC CTGGTCGAGG ACGGCCCGTC CTCATCGACG GCCGTTGACG ACGCCGCCCC CCGCCCCCAC ATTGACTCAC CCAGCCGAAC AACCGAGGAC GATTGGTGA
|
Protein sequence | MSGRLRSGAA ALLLIALPVF AAPPDPFQGR VDEEGATRPE PALRTLPLVH AEAAQLAELL RDETGVLSES GHASVDERTN TLILQDTPAR LEEAEQLVRE LDRANRQVMI EARIVLASGE YSRELGSRLG LSSERGDGHF AFDSLPGSND AAVGDGLLAE LPPVGEGARL SVSVGEVGER LLQLELSAME AEGHGRVVSS PRVLTTERQA ARIEQGVQIP YQETAESGAT AVAFQDAALS LTATPQVTED DAVSLALRVT KDAVGQIYEG VPSIDTQAVT THLRVQAGET IVLGGVREHE QRKQRQRVPW LGELPLIGWL FRQRMSEQSH HELLVFVTPR LVEDGPSSST AVDDAAPRPH IDSPSRTTED DW
|
| |