Gene Hhal_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0226 
Symbol 
ID4709289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp260467 
End bp261507 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content69% 
IMG OID639854685 
ProductOmpA/MotB domain-containing protein 
Protein accessionYP_001001822 
Protein GI121997035 
COG category[N] Cell motility 
COG ID[COG1360] Flagellar motor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.174216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGAGG AGAACAAGAC CCGCCCGGTC GTCATCAAGA AGGTGGCGAA GCACGGCGAC 
GATCACCACG GTGGCTCGTG GAAGATCGCC TTCGCTGACT TCATGACGGC GATGTTCGCC
ATCTTCCTGG TCCTGTGGCT GCTGCTCGCC CTGGATGACG ATCAGCGCCA GGGCATCGGG
CAGTACTTCC GCGACCCGCA GGCGGCGCAT CCGCCCGCCT CGCGGGATAT CATCGACTTT
GAGGGCGAGC GCCGCGCTCC CATTGATCTC AGCGGGCTGC CCATGGGTCA GGGCGGTTTT
ATCCCCACCG AAGAGATGCA GGAGCTGGCC GAGCAGTTCC AGGACGCCGT GCTTGACGAC
CCGGACCTGG CCGAGTACGC CGATCAGATC CTGCTGGAGA TCACCGACGA CGGGCTGCGC
ATCCAGCTTG TCGACCACGA CGGGCGGCCG ATGTTCGAGC TCGGCAGCGC CGACCCCAGG
GAGCATACCG AGGAGATCCT CCGCGCCCTG GCCCGGGTGC TGGAGGATGT GCCGAATCCG
GTCTCGCTCT CCGGCCACAC CGACGCCCGA CCCTTCGCCC GCGACGATTA CGACAACTGG
TCGCTGTCCA CCGATCGGGC CAACGCGGCC CGGCTGACCC TGCTCGACGG CGGGTTGCCC
GCCGAGCGCA TCGGTCAGGT GGTCGGCTAT GCCGATACCG TACCCTTCGA CCCGGACGAC
CCCCGCGCCG ATATCAATCG CCGGATCTCC GTGGTGCTGC TCAGCCGCGA GGCGGTGCAG
GGCATCGCCG AGCGCGAGCG GCGCATCGAC CCCGATGAGC AGACTCTCGA CGCCCTGCCG
CGCCGTCCGC GGGAGCTGCT CACCCCGGAG GAGCGGCGGA TCGAGGAGGG GCTCGATGAG
GTCGAGGAGA CCGTCCCCGA GACCGGGGAC GAGGCGCCGG AGGCCGACGA CGCGGCCGAG
GAAGCGCCGG ACGACGACGA GGCGGCCCCG GAGGTGGAGA TGCCCGACCT GGAGCCGCCG
GCCGAGCCGG AGACCTGGTA A
 
Protein sequence
MVEENKTRPV VIKKVAKHGD DHHGGSWKIA FADFMTAMFA IFLVLWLLLA LDDDQRQGIG 
QYFRDPQAAH PPASRDIIDF EGERRAPIDL SGLPMGQGGF IPTEEMQELA EQFQDAVLDD
PDLAEYADQI LLEITDDGLR IQLVDHDGRP MFELGSADPR EHTEEILRAL ARVLEDVPNP
VSLSGHTDAR PFARDDYDNW SLSTDRANAA RLTLLDGGLP AERIGQVVGY ADTVPFDPDD
PRADINRRIS VVLLSREAVQ GIAERERRID PDEQTLDALP RRPRELLTPE ERRIEEGLDE
VEETVPETGD EAPEADDAAE EAPDDDEAAP EVEMPDLEPP AEPETW