Gene Hneap_1766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1766 
Symbol 
ID8534924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1900498 
End bp1901439 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content51% 
IMG OID646384148 
Productflagellar hook-associated protein 3 
Protein accessionYP_003263636 
Protein GI261856353 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID[TIGR02550] flagellar hook-associated protein 3 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGTTT CTACTTCCAT GGTGTTTGCA CAGGGCCTGA GCAATCTGCA ACAGCAGCAG 
TCCGCGATGT TGCAGTCTCA GCAAGAAATC GCAACGGGTG TGAAACTGAC CAACCCGGCG
CAAGATCCGG TCGCCTTTTC GACGGCTAGC GATCTGTCTG TGCTCAATAG CAAGCAGAAT
CAGTACAGTA CGAATATCGA CAATGCGACC GGTAAGATTC AGGTGCAGGA ATCGACCTTG
GGTTCAATTA CCACGATGCT GCAAAGCGTG CGTGATGTTG CCATCCAAGC GAACAACGCT
GCGCAAAATG GCATGTCACT GTCCGCACTG ACGGATCAAC TGGATCAACT GCAAAAGGCT
TTGGCTGGTC AAATGAATGC CACGGACGAG CGCGGGGAGT ATCTGTTTTC CGGTACGGTC
GCACGTGAAA AACCCTATGA CGCCAGCGGT CAGCTTAATC CCGCCTTGGA TTCCACCAGT
CCGTCTTTTC AAAACGTCAC AAGTGTCAAG TTGGCCATTT CCGATCAGCA GTCCGTGGCC
ATTAATCAGC CAGCCGGGCA GATTTTCCAA CTCTCATCCA GTGCAACGAC AGGCGGAAAT
GCCAGCATTC TGCAAGTCAT TGATCAACTG AAAACGGCTA TTACGACCCA GCCCGCAAAT
CTCCAGACTA TTTATCAAAA TGCGCAAAAA GATATTGATG CTGTGATGAA CCAGGTGACG
GACGCGCGCG GCAGCATGGG TAATGCGCTC AATACGCTGA GCACGGCTAA AAACGATAAC
GCCGCACAAA ATGTGCTTAC CCAACAAACG CTTTCCGGTT TGCGCGATAC CGATGTCGCC
AGCGCCATTA CCAAATTGAA TCAAAGCTAC CTCAATTTAC AGGCGACCCA GCAGAGCATG
GTGAAAATCC AAAGTCTGTC CCTGTTTAAC TATATTCGTT GA
 
Protein sequence
MRVSTSMVFA QGLSNLQQQQ SAMLQSQQEI ATGVKLTNPA QDPVAFSTAS DLSVLNSKQN 
QYSTNIDNAT GKIQVQESTL GSITTMLQSV RDVAIQANNA AQNGMSLSAL TDQLDQLQKA
LAGQMNATDE RGEYLFSGTV AREKPYDASG QLNPALDSTS PSFQNVTSVK LAISDQQSVA
INQPAGQIFQ LSSSATTGGN ASILQVIDQL KTAITTQPAN LQTIYQNAQK DIDAVMNQVT
DARGSMGNAL NTLSTAKNDN AAQNVLTQQT LSGLRDTDVA SAITKLNQSY LNLQATQQSM
VKIQSLSLFN YIR