Gene Hhal_0517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0517 
Symbol 
ID4709646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp589419 
End bp590675 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content65% 
IMG OID639854975 
Productflagellar basal body FlaE domain-containing protein 
Protein accessionYP_001002106 
Protein GI121997319 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATTCA ACATCTCGCT TACCGGGATC AACTCCGCCT CCAAGGATCT GGAGACCACC 
AGTAACAACC TGGCCAACGC CGGTACGACG GGCTTCAAGG AGTCCCGGGC CGAGTTCAAT
GACCTCTTCG CCATGGGGCC GATGGGTATC CCGCAGCTGG CTGTCGGACA GGGCTCGCGG
CTGGCCAATG TCGGCCAGAT GTTCAGCCAG GGGTCCTTCG ACTTCACGGA GCGCAGTCTC
GACCTGGGCA TCGAGGGGCG CGGCTTCTTC CGCATGGAGG ACGATGGCGA GGTGAGCTAC
ACCCGGGCCG GTCAGTTCGA GGTGGACCGT GACGGCTACA TCGTCAACAA CACCGGTAAG
CGCCTGACCG GTTTCCAGAC CGACGAGGAC GGCAGTCGCA TCGGCGATGG TCGCGACCAG
CTCCAGCTAC CCACCGACGG CATTCCGGCC CGGGCTAGCG AGAACGTCGA GATTGCGGCC
AATCTCAGCG CGGACGCCGA CGTGATCGAC GAGGGGGTGG CCTTCGATCC GGACGACAAC
GAGACCTTCA CTGAGTCCAC AACGACCACG CTCTACGACT CCCAGGGATC GGCCCGGGAT
GCGACCTTCT ACTTCCGCAA GGTCGGCAAT AACGAGTGGG ACGTCTACAC CCAGGTCGAC
GGGGTGGACT ACGAGCAGGC CGATACCGAG GGCGACTACT TCGGCCCGCA CCGGCTCTCC
TTTGATACCT CCGGGTCCCT GGTCGATGCC GAGGGCGATG ACGAAGGGCG GATCGCCAGC
CTCGAGGATG TCCCGCTGCT CGCCGAGGTG GACGACCTGG ACCTCGACAT CGACTTCGGT
GAGATGACCC AGTTCGCCCG GCCGTTCAAC GTCACCAACG TCTCCCAGGA CGGCTACGCC
GCCGGGGAGT TCGAGAACAT CAACGTCGAG GGCGACGGCA CCATCCTGGC CCGTTACAGC
AACGGCGAGG CCCAGGCCGT GGGTCAGGTG GCGCTGACCA GCTTCCCGTC GGAGGAGAAG
CTCCAGTCCG TCGGTGAGAC CTCCTGGCAG GCCACCCGCG ACGCCGGCGA TCCGCTGATC
GGTGTCCCCG GCCAGGGGCA GTTCGGTCGG GTGGAGAATG GCGCCCTGGA GCAGTCCAAC
GTGGAGGTCT CCGATCAGCT GGTGAACATG ATCACCGCGC AGCGCAACTT CTCCGCCAAT
GCGCAGATGG TCAGCACCCA GGACCAGGTG ACCCAGGAGA TCCTCAACAT CCGCTAA
 
Protein sequence
MSFNISLTGI NSASKDLETT SNNLANAGTT GFKESRAEFN DLFAMGPMGI PQLAVGQGSR 
LANVGQMFSQ GSFDFTERSL DLGIEGRGFF RMEDDGEVSY TRAGQFEVDR DGYIVNNTGK
RLTGFQTDED GSRIGDGRDQ LQLPTDGIPA RASENVEIAA NLSADADVID EGVAFDPDDN
ETFTESTTTT LYDSQGSARD ATFYFRKVGN NEWDVYTQVD GVDYEQADTE GDYFGPHRLS
FDTSGSLVDA EGDDEGRIAS LEDVPLLAEV DDLDLDIDFG EMTQFARPFN VTNVSQDGYA
AGEFENINVE GDGTILARYS NGEAQAVGQV ALTSFPSEEK LQSVGETSWQ ATRDAGDPLI
GVPGQGQFGR VENGALEQSN VEVSDQLVNM ITAQRNFSAN AQMVSTQDQV TQEILNIR