Gene Hhal_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2022 
Symbol 
ID4710383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2225508 
End bp2226398 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content72% 
IMG OID639856495 
Productprepilin peptidase 
Protein accessionYP_001003588 
Protein GI121998801 
COG category[N] Cell motility
[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1989] Type II secretory pathway, prepilin signal peptidase PulO and related peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0169642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGCCC ACCACCCAGC GCAGGCGCGG AGGACCATGG AGACCTGGAA TCACGTACCG 
GAGTGGATCA TCTGGAGCGG TGCCGGCCTG TTTGGCCTGC TGGTCGGCAG CTTCCTGAAC
GTGGTCGTCC ACCGCCTGCC GGCCATGCTG GAGCGGCGCT GGAGCCACGA GGCCCGGGAC
ATCCTCGGCA CCCCAAAACG GGGCACGCCG GAGCCGGCCT ACCACTTGGG CTGGCCTCCA
TCGCACTGCC CACAGTGCCA CCGGCAGCTG CGTCCGCGGG AGAATATCCC GCTGCTGAGC
TACCTCCTCC AACGCGGACG CTGCAGCGGC TGCGCCGCCC GCATCCCGGC CCGCTACCCG
ATCATTGAGG CGCTCACCGG GATCGCCACG GTGGCGGTGG TGGCCAGCCA CGGGCTCTCC
CCGCTGATGC TCGGCCCGCT GCTGCTGACC TGGGCCCTGA TCGCCGCGGC GGCCATCGAC
TACGAGCACT ACCTCCTGCC CGACGCCCTG ACCCTGCCGG CCCTCTGGCT GGGGTTGATC
TGGAGCGTCG TCGATCCCGG CCCCCCCACC CCCACCGATG CGATCATCGG CGCCGTGGCC
GGCTACCTGG CGCTGTGGGC CATCTTCCAC GGCCACCGGC TGGTCACCGG GCGCGAGGGC
ATGGGCTACG GCGACTTCAA ACTGACCGCC GCCCTGGGGG CCTGGCTGGG CTGGCAGGCC
CTGCCCGCCC TGGTGCTCTT CGCTGCCCTG ACCGGGCTCC TAGTGGCAAT AGTGCTGGCC
GTGCGCAGCC GCCCCCTGGG GCAGCCCCTG CCCTTCGGCC CCGCGCTGGC CCTGGCCGGC
TGGGTGCTTC TGGTCCTGTC CCCCTCCGGC GTGGCCTGGC AACTGGTATG A
 
Protein sequence
MGAHHPAQAR RTMETWNHVP EWIIWSGAGL FGLLVGSFLN VVVHRLPAML ERRWSHEARD 
ILGTPKRGTP EPAYHLGWPP SHCPQCHRQL RPRENIPLLS YLLQRGRCSG CAARIPARYP
IIEALTGIAT VAVVASHGLS PLMLGPLLLT WALIAAAAID YEHYLLPDAL TLPALWLGLI
WSVVDPGPPT PTDAIIGAVA GYLALWAIFH GHRLVTGREG MGYGDFKLTA ALGAWLGWQA
LPALVLFAAL TGLLVAIVLA VRSRPLGQPL PFGPALALAG WVLLVLSPSG VAWQLV