Gene Hhal_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0104 
Symbol 
ID4709659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp120005 
End bp121528 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content69% 
IMG OID639854562 
Productsporulation domain-containing protein 
Protein accessionYP_001001700 
Protein GI121996913 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3267] Type II secretory pathway, component ExeA (predicted ATPase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCCGC AAACCGGCAC CCCGCTCACT CTGCCCGAGG CGGCCCTGTA CCGGCTCGGG 
CTCACCGCGC AGCCGTTCGT CGGCACCCCC GAGCCCCCCT TCGAGGACAG TGCTCGCGTC
ACTCAGCTGA ACGTCACGCT GAGCCTGCTG CAGAGTGGCG AACGCATCGT GCTGATCAAC
GGCGATGCGG GCCTGGGTAA GAGCACATTC CTCTGCCGCC TGGCGGCGCT GCAGCCCCCC
GGCCTGAGCA TGCAGCGGGT CGATGGCCAC AACGCCGGTC TCGACACTCT GTGGGGGGCT
CTGGTGGCCG CCGCTGAAAG CGAAGAAGGC GCGCAGGCAT CCAGAACCCG CGAACAAGCC
CTGAACTACG TCCGCAGCGC ACGCCGCGGC GGCATCCGCC CCGCCCTGCT CCTGGACGAT
GTCGACGCCC TGCCACCCGG GCACATCGAA GAACTGCTCG AGCTCTGGGC CGAGCTCAGC
CAGGACGACG AAGCCTTCAG CCTGGCCATG GCCCTCGACC CGGGGGAGCT ACAGAAGCTC
CCGGCGACGC TCTCAGACGA GCGCTTCCAC ACCACGACCC TGTATCCACT CGACCAAGAG
CAGACGGCCG CCTATCTCGA CCACCGCGTC CGGAGCGCCG GCGCCGAGCA GGCCCTCTTT
GACCGCGAAA CGGTTCGGGA GATCTTTCAC CGCTCGGGCG GACACCCGGA GCGCATCAAC
GAAGAGGCGC ATCGACGCCT GACCGCTCGC CTGGCCAACC CCGGCGAGGT GCCGCCGGCA
CCCCGACCCG TACTGCCGGC ACACCCCGGC CGGCGAGGGC TCCGGTGGGC TCTCGCGGGG
GTCAGCAGTG TCGCCGCGGC AGTGGCTGGC ACGTACTGGT TGATTGCCCA CCAGCTGCCG
AGCGGACCGT CGACGGATGA ACTCGCCATC GAAGATTTCG AGGAACCCGA GGAGGAGACG
GTAGCGGCGG AGAGCGACGA GATCGAACCG GCCAGCGACA CCCCGTTCGG GCTGGAGTTG
CCGGGGCGCT ACAGCTTCCG CGATGACCAC GAGGAGGCGG ACACCCCCTC CACGCCGGAC
CAGCCCACCG AGACCCTGCA ACTCCTCGAG ATACCTTCCG GCCCCGCCCA AGAGCCCTCA
CCCACCGAGG ACGCCCCGAC GATCGACCAT GAGGCGGACG AGGAAGCCGC GCCGACCGCG
GAGCAGGATG AGGATGATTG GGCTGCCGGC GCGGCCTGGG TCCTGAATGA GGAGGCGGAT
CGCTACACCA TCCAGGTACT GGCGGCCTCC CAGCCCACCA CCCTGGAGGG CTATGCGGAG
CAACACGACC TGGGCGATCC GACCCACGTG GTCTCCACCG AGCGGGGAGA CGGCGACCCC
TGGTTCCTGC TGCTGCACGG TTCCCACGAG GACCGCGAGG CCGCTCACGC GGCCCTGGCG
GCGTTGCCCG AGGAGATCGC CGAGCGCGGG GCGTGGGTCC GCTCTTTTGA GTCGGTGGAA
GAGGGCCTGG TAACCGACGA CTAA
 
Protein sequence
MAPQTGTPLT LPEAALYRLG LTAQPFVGTP EPPFEDSARV TQLNVTLSLL QSGERIVLIN 
GDAGLGKSTF LCRLAALQPP GLSMQRVDGH NAGLDTLWGA LVAAAESEEG AQASRTREQA
LNYVRSARRG GIRPALLLDD VDALPPGHIE ELLELWAELS QDDEAFSLAM ALDPGELQKL
PATLSDERFH TTTLYPLDQE QTAAYLDHRV RSAGAEQALF DRETVREIFH RSGGHPERIN
EEAHRRLTAR LANPGEVPPA PRPVLPAHPG RRGLRWALAG VSSVAAAVAG TYWLIAHQLP
SGPSTDELAI EDFEEPEEET VAAESDEIEP ASDTPFGLEL PGRYSFRDDH EEADTPSTPD
QPTETLQLLE IPSGPAQEPS PTEDAPTIDH EADEEAAPTA EQDEDDWAAG AAWVLNEEAD
RYTIQVLAAS QPTTLEGYAE QHDLGDPTHV VSTERGDGDP WFLLLHGSHE DREAAHAALA
ALPEEIAERG AWVRSFESVE EGLVTDD