Gene Hhal_1011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1011 
Symbol 
ID4709587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1083395 
End bp1084453 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content72% 
IMG OID639855482 
Productrare lipoprotein A 
Protein accessionYP_001002589 
Protein GI121997802 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0797] Lipoproteins 
TIGRFAM ID[TIGR00413] rare lipoprotein A 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGTCGC TGGCCACGGC ACTGCTCGGA CTCGCCGGCG GGGCGGCGCT GCTGGCCGGC 
TGCGCGGCAC CGGAACGCGA CGAGCTGGAG CCGCCACGAC CGGACGAGGA TCCTGCCGAG
CGGGTGGCCA CCGACGGGGC CTCGGACCGG GACCCCGAGG AGCTGGCCCG ACTCCCCGAC
GCCGTCCCGC AGGACGTGCC GCCGAGCCGC TACGGCAACC CCGAGCAGTA CGAGGTCTTC
GGCGAGACCT ACGCCACCAT GAGCCGGGAG GAGGCCGAGG GCTTCACCCA GCGCGGCCGC
GCCTCCTGGT ACGGCACCAA GTTCCACGGC CGGCGCACGT CCAGCGGCAC GCCCTACGAC
ATGTACGCCA TGACCGCCGC CCACCGCGAG CTGCCGCTAC CGACCTGGGT GGAGGTCGTC
AATCTGGAGA ACGACCGCCG CGCCGTGGTC AAGGTCAACG ACCGTGGCCC CTTCGTCGAC
CCCGACGAAC GCATCCTCGA CCTCTCCTAC GCCGCTGCAG TGCGCCTCGA CATCGCCGAC
CAGGGGACCG CACCGGTCCA GATCCGCGTG GTCACCCCGG ACGACCCGCC CCAGAGTCCG
GCGAAGGACG CCGATGCCGA GGCCGGCGAC CCACCGGCGG AGGGCGGTGT CGAGACCTTC
AGTGTGGACG ACGAGCAGGA GCCCGACGCC ATCGAGGAAC TGCTCGCCGC GCAGGAGGAG
GCCGACGCCG AACCCGGGGA CGAGACCCCC TCCAGCGAGC CGCCGGCCGG GGTCCGGGTC
CGCACCACCG ACCGACTGCT GCAGGACGCC GAGGCGGTCC AGCCCATCGC CGTCTACTTG
CAGACCGGGG TCTTCGGTCA GCGCGAGAAC GCCGAGCGGA TGGAGGCACG GCTCGCTGAC
CTCGACCTCG AGGCCGAGGT CAGCGTCGAG GAGATCGGCG ACGCCGACGG CTCCCTCCAC
CGGGTTCGCC TGGGCCCGCT GGAGAACCTC GCCGAGATCG ACCGAGTGGA GCAGGGTCTG
GACGAGGCGG GGATCGATCA CTATAGGGTC AGCCCGTAA
 
Protein sequence
MRSLATALLG LAGGAALLAG CAAPERDELE PPRPDEDPAE RVATDGASDR DPEELARLPD 
AVPQDVPPSR YGNPEQYEVF GETYATMSRE EAEGFTQRGR ASWYGTKFHG RRTSSGTPYD
MYAMTAAHRE LPLPTWVEVV NLENDRRAVV KVNDRGPFVD PDERILDLSY AAAVRLDIAD
QGTAPVQIRV VTPDDPPQSP AKDADAEAGD PPAEGGVETF SVDDEQEPDA IEELLAAQEE
ADAEPGDETP SSEPPAGVRV RTTDRLLQDA EAVQPIAVYL QTGVFGQREN AERMEARLAD
LDLEAEVSVE EIGDADGSLH RVRLGPLENL AEIDRVEQGL DEAGIDHYRV SP