Gene Hhal_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2021 
Symbol 
ID4710382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2224301 
End bp2225491 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content77% 
IMG OID639856494 
Producttype II secretion system protein 
Protein accessionYP_001003587 
Protein GI121998800 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.159835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCCGCC TGAGCTGGCG GGGCCTCGAC GCTGGCGGAC GGCGCCTCTC CGGCACCTGC 
CACGCCGACT CGCCAAGCGC CGTACATCAC GCCCTGGCCG AACAGGGCGT GGCGGTGACC
GCCGTCCGGC GGGAGCTATG GCGACCGCGC CGGCGGCAGC CGGGCAGCGC CCGACGCGCC
GCCATCCTGC GCCGGCTGGC CTCGGTCCTG GAGGCTGGGG CGCCGCTGAG CGAGGCGCTG
CGGGTCACCG CGGCCCAGGC CCCGGACGCC GCCCTGCGCA ACGGTCTGCG CGGGGTGCGC
TACGCGGTCG AGCGCGGCAC CGACCTGGCC ACCGCCTTCG GCACTGAGTT CCCCGGTCTG
CGCCCGGCCC ACCGCGCCCT CCTGGCGGCC GGCACCTGGA CCGGCGACCT GCCCGCGGCC
CTGGGCAGTG TCGCCGCCGA GATCGAGCGC GAGGCGGCCA TCGTCGCTCA GCTGCGCCGC
GCCCTGACCT ACCCGGCGGT GGTCGCCGGC GCCGCCCTGA CCCTGATCGC CCTGCTGCTG
ACCGCCGTGG TCCCGCGTTT CGCCGGGCTG TTCGAGCAGA GCGGTGAGCC GCTGCCCGCC
CCGACGCGGG CCGTCCTGGC TGCTTCGGAG GGGTTCGCCG TGGTGGCGCC GGCGACCCTG
CTCCTCGGTC TGGTCACCGG CATCGGGCTG ACGGCGGCCC TGCGCCGTCG CCCCGCCTGG
CGCCGGCACG CCGCCGCCGG GCTGGCCCGG ATGCCGTGGC TTGGCACTCT GCTCCTGGAG
GCCGCCCTCA GTCGCTGGTC GGCCACCCTG GCACGCCTGC ACGGGGCCGG GGTGCCCCTG
CTCGACGCCC TGCCCCGCGC CGCGGAGGCG GCCCGCGGGG CCGACCTGGA GCCGCGACTG
GCCCACCTCG GCCAGCGCAT TGGCGCCGGC GAATCCCTGG CCGAGGCCCT GCGAAAGAGC
CTCCCCGAGT CCCGGGAGAT CAGCCAGCTG GTCGCCATCG GCGAGCGCAG CGGGCGGCTC
GAGGAGCTGC TCCACGAGGC CGCTACGCTG CATCAGCAAC GCCTCGAGGC CCGCTTGCAG
CGCGCCGGCG CGCTGCTCGA GCCGGCCCTG ATCCTGCTCC TCGGGGCGAT CACTGCCGGG
GTGGTCGCGG CCCTCTACCT GCCCGTCTTC CGCATGGGCG CGACACTCTA A
 
Protein sequence
MARLSWRGLD AGGRRLSGTC HADSPSAVHH ALAEQGVAVT AVRRELWRPR RRQPGSARRA 
AILRRLASVL EAGAPLSEAL RVTAAQAPDA ALRNGLRGVR YAVERGTDLA TAFGTEFPGL
RPAHRALLAA GTWTGDLPAA LGSVAAEIER EAAIVAQLRR ALTYPAVVAG AALTLIALLL
TAVVPRFAGL FEQSGEPLPA PTRAVLAASE GFAVVAPATL LLGLVTGIGL TAALRRRPAW
RRHAAAGLAR MPWLGTLLLE AALSRWSATL ARLHGAGVPL LDALPRAAEA ARGADLEPRL
AHLGQRIGAG ESLAEALRKS LPESREISQL VAIGERSGRL EELLHEAATL HQQRLEARLQ
RAGALLEPAL ILLLGAITAG VVAALYLPVF RMGATL