Gene Hhal_0504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0504 
Symbol 
ID4710310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp572518 
End bp573948 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content67% 
IMG OID639854962 
Productflagellar hook-associated 2 domain-containing protein 
Protein accessionYP_001002093 
Protein GI121997306 
COG category[N] Cell motility 
COG ID[COG1345] Flagellar capping protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCTCAC CACTGGATCA GATGCCGAAT ATGCCGAGCC AGATGGACGT CGGCTCCGGC 
ATCGATACCA ACAAGATGGT CCAGGATCTA GTGCGAGCCG AGCGGGCTCC CACCGAGCAG
CGCTTGGATC GGCGTGAGCA AGAGCTCCAG GAGAAGCTCG AGGCCCTCGG GCAGATGCGC
GGGACCATCG GCGAACTGCA GGAGGCCGTG CAGGGGTTGG GGGATCCGAG TGCCTACTCC
GGCATCGACG CCGAGTCGAG CAATGCCGGT GTGGCGGCGG TCTCGGCCAG CGAAGAGGCG
CGCCCCGGTC AGTACGACGT GGAGGTCGAG CAGCTGGCGC GGACCCAGCG CCTCGCCACG
GCCAGCGGTG CCTTCGAGGA CAGCGCCGAT GCGGTGGGCA CTGGCCGGCT GGTGATCACC
GACGGCGAGG GCAACGAGCA GGCAGTGACC ATCGACGAGG AGTCGGGCAC GCTGCTCGGC
ATCCGCGACG CCATCAACGC CCAGGCCGAA GGGCTTCGCG CCTCGGTGGT GGACGACGGG
GCGGGGCCGC GACTGGCGAT CGCCACCGAG CAGACCGGCC GGGCGAACGC CATCGCCCAG
ATCCGTGCCG AGCAGGACCC GGAGGACGAT CAGGGCAACC TGTCAGCTCT GCAGTACAAC
GTCGCGGACC CCCAGAGTGG CGAGCCCATG GGCGCCTTTC AGGAGGTCCG GCCAGCCAGT
GATGCCGTTG TGACCATCGA CGGCATGCAG ATCACGCGAC CCGAGAACCG CATCGAGGGG
GCCATCGAGG GCGCCACCCT CAGCCTCAAG GAGGAGGGCC GCAGCCGCGT CTCCATCGAG
CAACAGACGG GGCTGGCCGA AGAGAACATC CAGCGTCTGG TGGACTCGTT CAACCAGGTG
CGGGCCCAGC TCAACCAGCT CTCCGACTAC GACCCCGAGG CCGAGAAGGC GGGTCCGCTG
CAGGGGGATC ACACCCTGCG TAACCTCCTC TCGCAGCTCA GTCGGGCCGT CAACGAACCA
GTGGAGGCGC TGGACGGGGC GCCCATTTCC TCCCTCGGCG ACCTCGGTGT GCGCACCAAC
CGTGACGGCA CCCTGGATCT TGATGGTGAG CGCATGCAGC AGATGGTCGG TGAGCACTCA
GAGCTGGTGA CGCGCATGAT GACCGACCCG GAGAGCGGGG TGATGTCGCG GCTTGAGGGG
GTGCTTGAGA ACGCCCTCGG CCGGGATAGC GTGATCGACA TGCGGACCGA CGGGGTCGAG
AGTCGGCTCG ACCGCATCGC CGATGATCGC GAGCGCCTGG ATCGGCGCAT GGAGCGGCGC
GAGGACCAGC TGCGGAGCGA GTTCTCGCGC ATGGACTCGA GGGTGGCTGA GCTCAATCAG
ACCTCGGAGT TTCTTGAGCA GCGCCTGGCT GCCATGAACA GCAGGGATTA A
 
Protein sequence
MVSPLDQMPN MPSQMDVGSG IDTNKMVQDL VRAERAPTEQ RLDRREQELQ EKLEALGQMR 
GTIGELQEAV QGLGDPSAYS GIDAESSNAG VAAVSASEEA RPGQYDVEVE QLARTQRLAT
ASGAFEDSAD AVGTGRLVIT DGEGNEQAVT IDEESGTLLG IRDAINAQAE GLRASVVDDG
AGPRLAIATE QTGRANAIAQ IRAEQDPEDD QGNLSALQYN VADPQSGEPM GAFQEVRPAS
DAVVTIDGMQ ITRPENRIEG AIEGATLSLK EEGRSRVSIE QQTGLAEENI QRLVDSFNQV
RAQLNQLSDY DPEAEKAGPL QGDHTLRNLL SQLSRAVNEP VEALDGAPIS SLGDLGVRTN
RDGTLDLDGE RMQQMVGEHS ELVTRMMTDP ESGVMSRLEG VLENALGRDS VIDMRTDGVE
SRLDRIADDR ERLDRRMERR EDQLRSEFSR MDSRVAELNQ TSEFLEQRLA AMNSRD