Gene Hhal_0507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0507 
Symbol 
ID4709960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp576316 
End bp577725 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content67% 
IMG OID639854965 
Productflagellin domain-containing protein 
Protein accessionYP_001002096 
Protein GI121997309 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACAGG TCATCAACAC CAACATTGCG TCCCTGACCG GTCAGCGGCA CCTGAGCAGC 
AGCCAGGCCG AGCAGCAGCA GGCCCTGGAG CGGCTCTCCT CGGGGCAGCG GATCAACTCC
GCGGCCGACG ACGCCGCCGG CCTGGCGATC AGCGAGCGCT TCACCTCGCA GATCGGTGGC
ATGAACCAGG CGGAGCGCAA CGCCAACGAC GGCATCTCCT ACGCCCAGAC CGCCGAGGGG
GCCATGGAGG AGATGGGCAA CCTCCTGCAA CGGGTCCGTG AGCTGGCGGT GCAGTCGGCC
AACGACACCA ACACGGCCGA AGACCGTCAG GCCCTGGAGG CCGAGGTGCA GCAGGCGGTG
CAGGAGATCG ACCGGATCGC CTCCAGCACC CAGTTCAACA ACCAGAACAT CCTGGACGGC
TCGCTGGATG AGCTGGTCTT CCAGGTGGGC GCCAACCGTG CGCAGAGCAT CAACACCGGC
GGTGTCGATG TGCGCGGCCA CAACCTGGGT GCCGAGATCG GTGAGGGGCA GGCCGTGCAG
CGGGCCCTGG ACGAGAACGG TGACTACGGC GATCTCGACC TGGACGGCTC GATCAACATC
AACGGGCTGG ATGTTGATGT CAGCGGCTCG CGGAGCGTCT CCGACGCCAT GGACGCCATC
AACGCCCAGT CCCGTGCCAC GGGCGTGACG GCCTTCCGGG CTGACCGCGC TACCACCGAG
GCGTTCGACT TCAACAACGA CGGCGGCTCC AGCCTGGAGA TCAACGGCAC CACCGTCAGT
GTGGGCGAGG ACGCCGGGGT GGGTGAGTTC GTCGACGAGG TGAACGCGGC CTCGGGCAAC
ACGGGTGTGC GGGCCGAGAT GGTCGGCGAT GACCAGGTGC GCTTCGTCTC CGAGTCCGAC
TTCCGCATCG AGCCGGGTGA CAACAGCCCG ATCGGTGATC TGGGTCTCGA GGCTGAAGAA
TCGGGGATGC GCTTCGAGCG GGGTGTCCAG CTCTCCACCG ATCTGGGGCA GCGCCTGGAT
GTCAATGGGG ATGCGGACAC ACTGGCGGCT CTGGGCATGA GCGACGAGCA GATGGACATG
AGCCGTCACC GGGTCAGCGG GCCGGATGCG CTGAGCGTGG CCACCCGCAC CGATGCCGAT
GACGCCATCC GCACGGTGGA CTTCGCCCTG GGGCAGATCA ACGACGCCCG GGCCGACCTG
GGTGCGGTGC AGAACCGCTT CGAGGCCACC ACCAGCAACC TGCAGAACGT CTCCGAGAAC
ATGGAAGCCT CCCGTTCCCG GATTCTGGAT GCGGACTTCG CCGCCGAGAC CGCCGCCATG
ACCCGCGCCC AGGTGCTCCA GCAGGCCGGC ACCTCGGTCC TGGCCCAGGC CAACGAGGCA
CCGCAGAACG TCCTGACCCT GCTGCAGTAA
 
Protein sequence
MAQVINTNIA SLTGQRHLSS SQAEQQQALE RLSSGQRINS AADDAAGLAI SERFTSQIGG 
MNQAERNAND GISYAQTAEG AMEEMGNLLQ RVRELAVQSA NDTNTAEDRQ ALEAEVQQAV
QEIDRIASST QFNNQNILDG SLDELVFQVG ANRAQSINTG GVDVRGHNLG AEIGEGQAVQ
RALDENGDYG DLDLDGSINI NGLDVDVSGS RSVSDAMDAI NAQSRATGVT AFRADRATTE
AFDFNNDGGS SLEINGTTVS VGEDAGVGEF VDEVNAASGN TGVRAEMVGD DQVRFVSESD
FRIEPGDNSP IGDLGLEAEE SGMRFERGVQ LSTDLGQRLD VNGDADTLAA LGMSDEQMDM
SRHRVSGPDA LSVATRTDAD DAIRTVDFAL GQINDARADL GAVQNRFEAT TSNLQNVSEN
MEASRSRILD ADFAAETAAM TRAQVLQQAG TSVLAQANEA PQNVLTLLQ