Gene Hhal_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1839 
Symbol 
ID4711380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2011853 
End bp2012983 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content71% 
IMG OID639856310 
Producthypothetical protein 
Protein accessionYP_001003405 
Protein GI121998618 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0767] ABC-type transport system involved in resistance to organic solvents, permease component 
TIGRFAM ID[TIGR00056] conserved hypothetical integral membrane protein 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTGATC CGGGCCACCT GGAGTGCCGG GCGGGCGCGG CGGCCCCCGA GGTGCGTATC 
CGCGGAGACT GGACCCTGGC CCACTACCGC ACGCTCCTGC GGCAGGTGGA GCAGACCCGC
CTGGGCGGCG AGCCGCGCGT CGACCTCAGC GCCCTGGGCC GGCTCGACAC CGCCGGGGCG
ACGCTACTCG CCCGCCTGCT CGGAGCAGCC CGCGTCGAGG CCCTGGCGCG CTCCACGCCG
GAGCTCTCCG CCGAGCGGCG CAAGCTGCTC CAGGCGGTCG CCGCCGCCAG CGAGCGCGCC
CGCACCGCAC CGGAACCGCG CGGCGATCCC TACTTCCTCG GTCAGCTCAC CATCGGCCTG
GGGGCACAGA TGACCCACGC AGGCCATCAG CTGGCCCGGG CCATCGGTTT CACCGGCCTG
GTCATCGCGG CATTCGCCGC CGGCCTGTTG CGCCCCTGGC GCTGGCGCCT GGCCGCCGTC
TCCCGGCAGC TCCAGCACAC CGCCCTGGAG GCGCTGCCCA TCGTCGCCCT GCTGACTTTC
GCGGTGGGGG CCGTCATCGC CGTCCTCGGG GTCACCGTAC TGGGCCGCTT CGGCGCCGGC
ATCTTCACGG TGGACCTGGT GGCCTACGCC TTCCTGCGCG AATTCGGCGT GGTCCTCACG
GCGATCCTCC TCGCCGGACG CTCGGCCAGC GCCTTCACGG CGCAGATCGG CTCGATGAAG
GCCAACGAGG AGCTCGACGC CATGCGCGCC CAGGGATTCA GCCCCATCGA GATGCTGGTC
ATCCCGCGGG TCGTGGCACT ACTGATCGCC GTGCCACTGC TCTCCTTCGT GGCCGTGGTC
TGCGGCCTGG CCGGCGGTGG ACTGGTCACC CTGCTCAACG TCGACGTCCC GGCGGGGCGG
ATCATAGCCC TCTACAGCGA CATCTCCGTC AGCCACTACC TGGCTGGACT GGCCAAGGCA
CCGATCTTCG CCTTCGTCAT CGCCATCATT GGCTGCCTGG AGGGGATCAA GTGCAGCGCC
AGCGCCCAGT CGGTGGGCAC GCACACGACC TCCGCGGTGG TCCAGTCGAT CTTCTGGGTC
ATCATCCTCA ACGCTGTGGC CGCCCTGATC TACGTGGAGC TGGGATGGTG A
 
Protein sequence
MADPGHLECR AGAAAPEVRI RGDWTLAHYR TLLRQVEQTR LGGEPRVDLS ALGRLDTAGA 
TLLARLLGAA RVEALARSTP ELSAERRKLL QAVAAASERA RTAPEPRGDP YFLGQLTIGL
GAQMTHAGHQ LARAIGFTGL VIAAFAAGLL RPWRWRLAAV SRQLQHTALE ALPIVALLTF
AVGAVIAVLG VTVLGRFGAG IFTVDLVAYA FLREFGVVLT AILLAGRSAS AFTAQIGSMK
ANEELDAMRA QGFSPIEMLV IPRVVALLIA VPLLSFVAVV CGLAGGGLVT LLNVDVPAGR
IIALYSDISV SHYLAGLAKA PIFAFVIAII GCLEGIKCSA SAQSVGTHTT SAVVQSIFWV
IILNAVAALI YVELGW