Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2022 |
Symbol | |
ID | 4710383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2225508 |
End bp | 2226398 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639856495 |
Product | prepilin peptidase |
Protein accession | YP_001003588 |
Protein GI | 121998801 |
COG category | [N] Cell motility [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1989] Type II secretory pathway, prepilin signal peptidase PulO and related peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0169642 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGCCC ACCACCCAGC GCAGGCGCGG AGGACCATGG AGACCTGGAA TCACGTACCG GAGTGGATCA TCTGGAGCGG TGCCGGCCTG TTTGGCCTGC TGGTCGGCAG CTTCCTGAAC GTGGTCGTCC ACCGCCTGCC GGCCATGCTG GAGCGGCGCT GGAGCCACGA GGCCCGGGAC ATCCTCGGCA CCCCAAAACG GGGCACGCCG GAGCCGGCCT ACCACTTGGG CTGGCCTCCA TCGCACTGCC CACAGTGCCA CCGGCAGCTG CGTCCGCGGG AGAATATCCC GCTGCTGAGC TACCTCCTCC AACGCGGACG CTGCAGCGGC TGCGCCGCCC GCATCCCGGC CCGCTACCCG ATCATTGAGG CGCTCACCGG GATCGCCACG GTGGCGGTGG TGGCCAGCCA CGGGCTCTCC CCGCTGATGC TCGGCCCGCT GCTGCTGACC TGGGCCCTGA TCGCCGCGGC GGCCATCGAC TACGAGCACT ACCTCCTGCC CGACGCCCTG ACCCTGCCGG CCCTCTGGCT GGGGTTGATC TGGAGCGTCG TCGATCCCGG CCCCCCCACC CCCACCGATG CGATCATCGG CGCCGTGGCC GGCTACCTGG CGCTGTGGGC CATCTTCCAC GGCCACCGGC TGGTCACCGG GCGCGAGGGC ATGGGCTACG GCGACTTCAA ACTGACCGCC GCCCTGGGGG CCTGGCTGGG CTGGCAGGCC CTGCCCGCCC TGGTGCTCTT CGCTGCCCTG ACCGGGCTCC TAGTGGCAAT AGTGCTGGCC GTGCGCAGCC GCCCCCTGGG GCAGCCCCTG CCCTTCGGCC CCGCGCTGGC CCTGGCCGGC TGGGTGCTTC TGGTCCTGTC CCCCTCCGGC GTGGCCTGGC AACTGGTATG A
|
Protein sequence | MGAHHPAQAR RTMETWNHVP EWIIWSGAGL FGLLVGSFLN VVVHRLPAML ERRWSHEARD ILGTPKRGTP EPAYHLGWPP SHCPQCHRQL RPRENIPLLS YLLQRGRCSG CAARIPARYP IIEALTGIAT VAVVASHGLS PLMLGPLLLT WALIAAAAID YEHYLLPDAL TLPALWLGLI WSVVDPGPPT PTDAIIGAVA GYLALWAIFH GHRLVTGREG MGYGDFKLTA ALGAWLGWQA LPALVLFAAL TGLLVAIVLA VRSRPLGQPL PFGPALALAG WVLLVLSPSG VAWQLV
|
| |