Gene Hhal_0506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0506 
Symbol 
ID4709959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp574515 
End bp575993 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content66% 
IMG OID639854964 
Productflagellin domain-containing protein 
Protein accessionYP_001002095 
Protein GI121997308 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACAGG TGATCAACAC CAACGTTGCA TCGCTGAACG CGCAACGGCA CTTGAATTCT 
TCCCGGGGCG ACCAGGAGGT GGCCCTGGAG CGGCTCTCCT CGGGGCTGCG TATCAACAGC
GCCCGGGACG ATGCCGCCGG TCTGGCGATC AGTGAGCGCT TCACCGGCCA GATCAACGGC
ATGGATCAGG CCGCCCGCAA CGCGAATGAC GGCATCTCGT TTGCCCAGAC GGCGGAAGGC
GCCATGGAAG AGATGAGCAA TCTGCTCCAG CGGGTCCGCG AGCTGGCGGT GCAGTCGGCC
AACGACACCA ACTCGCCGTC GGACCGTGCC GCTCTGGACC GCGAGGTGCA GGCTGCCGTG
CAGGAGATCG GCCGGATCGC CGAGAGCACC CAGTTCAACC AGCAGAACGT GCTCAACGGC
ACCCTGCGCG AGCTGGTCTT CCAGGTGGGG CCGAACCGCG GGCAGACGAT CAATGCCGGG
GGCGTTGACG TGCGTGCCGA GAACCTCGGG GCCAATGTGG CCGAGGGTCG TGCCGTGCAC
CAGACCGCCG GCGGCGAGGG AGTGCAGCTC CCGAGCGGCC TGCAGGTCAA CGGCCAGGAA
ATCGATCTCG GCGACGCCCG CGAGCTTAAC GACGTGGCGA GCGAGATCAA CGAGCGTCAG
GCCGAGACCG GGGTCTCCGC CATGCGCGCC GACCGTGCCG AGACCCAGGC GGTGGAGTTC
GACGGTCTCG CGGAAGGGGA GCGTGCCCAG CTGCGGATCA ACGACCACGC CATTGAGCTC
GATGGCGACA TGGAGGACAT GAGCGACTTT GCCGCTCGGG TCAACGACCA GGCCTCGGAG
ACCGGGGTGC GTCTGGAGAA CGGCGAGAAC GGCTGGTCCT TCGTCTCCAA CAGCGACTTC
GAGCTGGAGT ACATCTCGGA CGATGCCGAG GGAGCGCTCT CCGTCGGTGG CACCACGGTC
GGACAGGGCC TGGATCGCAC CGACGAGGAG AGCACCGGGC TGATTGTCGA GCGGGGCATC
ACCCTGTCCA CGGAGATCGG TGGTGAGCTC CGGGTCGATC CGCTGGAGGG GGACGACGAC
GCCGACCTCG GGGCGATCGG GCTCAAGAAC TGGCAGCCCG GCGGCGACTA TGAGGATCTT
CAGGCTGAGG CGTACACGGT GGGTGGCGTG GATCCGGTGG ATGTGCGCAC CCGGGAGACT
GCGTCGGATA CCATCGTGGC GGTGGACTTC GCCCTGCAGC AGATCAACAA CACCCGGGCT
GATCTGGGTG CGATCCAGAG CCGGTTCGAT GCCACGATCA ACAATCTGAA CATCTCCTCG
GAGAACCTGA GCGCATCCCG CTCGCGGATC CTGGATGCCG ACTTCGCCGA GGAGACGGCC
GAGATGACCA GGACGCAGAT CCTGCAGCAG GCGGGCACCT CGGTGCTGGG TCAAGCCAAC
GAGATCCCGC AGCAAGTGGC ACAGCTGTTG CAGCAGTAA
 
Protein sequence
MAQVINTNVA SLNAQRHLNS SRGDQEVALE RLSSGLRINS ARDDAAGLAI SERFTGQING 
MDQAARNAND GISFAQTAEG AMEEMSNLLQ RVRELAVQSA NDTNSPSDRA ALDREVQAAV
QEIGRIAEST QFNQQNVLNG TLRELVFQVG PNRGQTINAG GVDVRAENLG ANVAEGRAVH
QTAGGEGVQL PSGLQVNGQE IDLGDARELN DVASEINERQ AETGVSAMRA DRAETQAVEF
DGLAEGERAQ LRINDHAIEL DGDMEDMSDF AARVNDQASE TGVRLENGEN GWSFVSNSDF
ELEYISDDAE GALSVGGTTV GQGLDRTDEE STGLIVERGI TLSTEIGGEL RVDPLEGDDD
ADLGAIGLKN WQPGGDYEDL QAEAYTVGGV DPVDVRTRET ASDTIVAVDF ALQQINNTRA
DLGAIQSRFD ATINNLNISS ENLSASRSRI LDADFAEETA EMTRTQILQQ AGTSVLGQAN
EIPQQVAQLL QQ