Gene Hhal_2370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2370 
Symbol 
ID4709225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2604072 
End bp2606084 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content68% 
IMG OID639856845 
Productgeneral secretion pathway protein D 
Protein accessionYP_001003935 
Protein GI121999148 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID[TIGR02517] general secretion pathway protein D 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGGTC AATGTTTCCC TGATTCGCGG ACGGAACGGG GTACTCAGTC AGCCCGCCCC 
CTCCTGCCCC GCATCCTGTT CATCGCGGCA CTGCTCGCGC CGATGCTCGT TGTCGCCGAC
GATGACGCCT GGGCGGAGCG CGGCGACGGC ATTACGTTGA ACTTTGAGGA TGCGGATATC
AAGTCGGTGA TTGCGCTCGT CTCCGAGCGC ACCGGGCGGA ACTTTGTGGT TGACCCCCGG
GTCCGCGGCG AACTCACCGT CATCTCCAGC CAGCCGGTGG ACGACGACCA GCTCTACCAG
GTCTTCCTCT CCGCCCTACA GATCCACGGC TTCGCCGCCA TCCCCGCGGA CGGGGCCATC
CGCATCGTGC CCAAGAACAT CGCCAAGCGC GACCAGACCC CCGTCCCCGA GCCCGGCACT
GCGGAGAGCG GCCACAACTT CGTCACCCAG GTCATTCCCA TTGAGCACGT CGAGGCCTCG
GAGCTCATTC CCCTTCTGCG ACCGCTGATC TCGGACGAGG CCGAACTAGC CGCCTACTCG
GAGACCAACA CGCTGATCGT CTCTGAGACC GCTGGCAACA TCGGGCGGCT CAAGCGCTTG
ATCGACCGCG TCGACCAGGA CACCACCGGC GTCACGGAGG TGGTCCCCCT TGAACACGGC
TCGGCGAGCG AGATCGTCGA GATGGTCGAG GCCATCGAGC CGGAGAAGCG CGCCGGCCGC
CGCCTACTGC TTGCCGCCGA CGACCGAAGC AACAGCGTTC TGGTCGGCGG CGACCCGGCC
CGGCGACCCA GCGTCATGGA GCTGATCCAG CGCCTCGACG CCGAGCTCGA GGACGAGGAA
GGCGCTGCAG TGATCTACCT GCGCTATAGC GATGCCGAGA GCATCGTCCC CATCCTGGAG
GGCATGGCCG AGGGGATGGC GCGCGGCCCG GAGGGTGAGA CCGGCGTCAG CATCCACGAC
CACGAGGCGA CCAACGCCCT GATCATCAAT GGCCCGCCCG ATCTGGTCGC CAAGCTGCGC
GGGGTAGTGA ACCGGCTCGA CGTGCGTCGA GCACAGGTCC TGGTGGAGGC GATCATCGCC
GAGGTCTCCG CCGAGCGCAG CCAGGAGCTG GGCATTCAGT GGGGCGCCCT CGGCGATCAG
GGGGTGGGCT TGGTCAATTT CGACGCGGCC GGAGGCGGAT CTGTGACCAA CATCGGGCGC
GCAGCGGCGG GAGGCACCGA TGCGCTGGGT AACCTGTCCC TCGGATCCGG CCTCACCGCC
GGCGCCGCCA CCCGCGGCGG CGAGCTGGGT GTGCTGCTAC GGGCGCTCTC CAGCGAATCG
GACAGCAACA TCCTCTCGAC CCCCTCGGTG ATGACCATGG ACAACGAGGA GGCGGAGATC
GTCGTCGGTC AAAACGTCCC CTTCGTCACC GGGCGCGAGG TCGGCGACAC CCGAGACTTC
CAGTCGATCC AGCGCGAGGA CGTCGGCGTG CAGCTGCGCA TCCGCCCCCA GATCAACGAG
GGCGACAGCC TCAAGCTGGA CATCGAGAAG GAGGTCTCCG ACGTCCAGGA GCGGGGCGAG
GCGGAGGATA TCGTCACCAG CATGCGCTCG ATCACCACCA GCGCCATGGT CGACGACGGC
GAGATCATGG TCCTTGGCGG CCTCATGGAC GAGCAGGCCG AGAGCCAGAC GGACCGGGTC
CCGGGCCTGG GCAGCATCCC GGGCCTCGGT TGGCTCTTCC GCTACGAGAG CAGCGCGGCA
CAGAAGCAGA ACCTGATGGT CTTCCTGCGC CCGCGGATCA TCGAGAACCG CGACGATGCC
CGGGAGCTGA CCAGCCCCAA GTACAACCTG ATCCGCAATC GTCAGCTCGC CTCCCGGGCC
CGCGGCATGC GCTTCCTCGA CGACGAAGAC ATCCCCGTTC TGTCGCAGCG GCGGGCGTTC
ATGGAGCTGC CGCCGGAGTT CGGGGATCGC GCCGGGGCGC CGCGACAAAG CGGCAACCTC
GACGCCCCCC CGCGCCGGCC GGACCTGTTC TAG
 
Protein sequence
MHGQCFPDSR TERGTQSARP LLPRILFIAA LLAPMLVVAD DDAWAERGDG ITLNFEDADI 
KSVIALVSER TGRNFVVDPR VRGELTVISS QPVDDDQLYQ VFLSALQIHG FAAIPADGAI
RIVPKNIAKR DQTPVPEPGT AESGHNFVTQ VIPIEHVEAS ELIPLLRPLI SDEAELAAYS
ETNTLIVSET AGNIGRLKRL IDRVDQDTTG VTEVVPLEHG SASEIVEMVE AIEPEKRAGR
RLLLAADDRS NSVLVGGDPA RRPSVMELIQ RLDAELEDEE GAAVIYLRYS DAESIVPILE
GMAEGMARGP EGETGVSIHD HEATNALIIN GPPDLVAKLR GVVNRLDVRR AQVLVEAIIA
EVSAERSQEL GIQWGALGDQ GVGLVNFDAA GGGSVTNIGR AAAGGTDALG NLSLGSGLTA
GAATRGGELG VLLRALSSES DSNILSTPSV MTMDNEEAEI VVGQNVPFVT GREVGDTRDF
QSIQREDVGV QLRIRPQINE GDSLKLDIEK EVSDVQERGE AEDIVTSMRS ITTSAMVDDG
EIMVLGGLMD EQAESQTDRV PGLGSIPGLG WLFRYESSAA QKQNLMVFLR PRIIENRDDA
RELTSPKYNL IRNRQLASRA RGMRFLDDED IPVLSQRRAF MELPPEFGDR AGAPRQSGNL
DAPPRRPDLF