Gene Hhal_1852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1852 
Symbol 
ID4711269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2024066 
End bp2025421 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content67% 
IMG OID639856324 
Productuncharacterized membrane-anchored protein 
Protein accessionYP_001003418 
Protein GI121998631 
COG category[S] Function unknown 
COG ID[COG4949] Uncharacterized membrane-anchored protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAGAA AGCCAGCCAG CATGACCCTG AACGTAACGG AAAGGCATCA GGCCAGTGAG 
CCCGTCCTGG GCGCGTATAA CGATCACCCC CTGCGCCTGG AGGTCAACGA CGAGATCCAC
GCCCGTCCCT ACGAGCGGCT GCGGGCGCCG GCGCGGGTCA GCCACCTGGG GATGTTCTCC
GGGCGCCACT CCGCCGATGC CGATCGGGCG GCAGTGGCCG ATCTCTGCGA GCGCTTCGGG
GTGGAGCCGC CGGAGCCGGA TGCCCAGCAC ATGTCCCGCG ACTTCGGGCC CTTCCGACTG
AAGTGGGAGC GGCGCACCGA GGTCTCGACC TACACCTTCA TCGTCGAAGA CGACTTCGAG
CACCCCTTCG AAGGGCCGCC GATCGAGCGG GTGCCCGAGG ATTGGCTGGC CAGTCTGCCC
GGGCCGCGGC TGGTGGGTAT CCACCTCGCC CTGGAACCGT CCTGGTCGCA GCCACGGACC
AGCAGCGGGA TCGTGGCGCT GTTCAACGAC AACACCATCG TCGGCAGCCA GATCGCCGGC
GGCGCAGCGC GGGTCTGGAC CGACTTCCGA ATCCACGAAG ACGGTTTCTC GCGCATCCTT
CTGCGCGATG TGAACATGCG CGAGCGCCAG GCGGGCCGTG CCATCCAGCG CCTGCTGGAG
ATCGAGACCT ATCGCATGAT GGCGCTGCTC GCCTTTCCCG TGGCGCGACG CCTCAACCCG
CAGCTCAATG AGCTCGAGGA GCAGCTCGCG AGCATTACCG AGCGCATGAC CGGTTCAACC
CGGGAGCGTG ACGAGCAGAA CCTGCTCAGC CAGCTGATGG AGCTGGCCGC GCAGGTTGAG
CGTATCGGCA ACAACACCAA CTACCGGTTT CACGCCGCGC GCGCCTACCA CGAGCTGGTC
GAGCGGCGCA TTCAGGAGCT GCGCGAGCAG CGTATCCAGG GGGTGCAGAC CATCCAGGAG
TTTATGGAGC GCCGCCTGGT TCCGGCCATG CGTACCTGCG ATACGGTGGC CGAGCGGCGA
GGGTCGCTCT CCGAGCGCAT CTCGCGCACC GGCGACCTAC TGCGCACCCG GATCGACGTC
GCCCTGGAGG CGCAGAACCG GGATCTGCTC GACTCCATGG ATCGCCGCGC TAAGCTGCAG
TTTCGCTTGC AGGAGACCGT CGAGGGGCTC TCCGTGGCGG CCATCAGTTA CTACCTGATG
GGCCTGATCA GCTATGTGTT GCAGGCGGCG CGGGACGCCG GCGCGCCCGT CCGCGTGGAG
ATCACCCAGG GCCTGGCGGT GCCGGTGGTG GTCTTCACCA TCTGGTTGGT GGTGCGCTTG
GTGCGCTACC GCATCGCCAA GCGCCAGGAG GGGTAG
 
Protein sequence
MQRKPASMTL NVTERHQASE PVLGAYNDHP LRLEVNDEIH ARPYERLRAP ARVSHLGMFS 
GRHSADADRA AVADLCERFG VEPPEPDAQH MSRDFGPFRL KWERRTEVST YTFIVEDDFE
HPFEGPPIER VPEDWLASLP GPRLVGIHLA LEPSWSQPRT SSGIVALFND NTIVGSQIAG
GAARVWTDFR IHEDGFSRIL LRDVNMRERQ AGRAIQRLLE IETYRMMALL AFPVARRLNP
QLNELEEQLA SITERMTGST RERDEQNLLS QLMELAAQVE RIGNNTNYRF HAARAYHELV
ERRIQELREQ RIQGVQTIQE FMERRLVPAM RTCDTVAERR GSLSERISRT GDLLRTRIDV
ALEAQNRDLL DSMDRRAKLQ FRLQETVEGL SVAAISYYLM GLISYVLQAA RDAGAPVRVE
ITQGLAVPVV VFTIWLVVRL VRYRIAKRQE G