Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0366 |
Symbol | |
ID | 8533487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 372792 |
End bp | 377507 |
Gene Length | 4716 bp |
Protein Length | 1571 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 646382750 |
Product | protein of unknown function DUF637 hemagglutinin putative |
Protein accession | YP_003262276 |
Protein GI | 261854993 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTGGGTCA AGGCCGTTGA GTTTGGCTGT GCCGATCAGG GTTTGAATGG CCGCCGCTCT TTTGCCTGCC CGCTCGCTAC CGGCAAACAG CCAGTTCTTT TTGCCCACCG CAATCGGCCG AATGGTGTTT TCCACTGGGT TGTTGTCAAT CGGCAGGTGA CCGGTTTCGG CGTAGCGGAT CAATGCCGGC CAGCGTTTGA TGGTGTAGTC CATGGCCCTG GCTGTGCCAC TGCCCGGCGC CACGCTCTTG CGGGTGGTGG TGAGCCAGTG ATTGAATTCG TCCAGTGCGC CCCGGGCTTG CTGGCGCAGG GTTTGTCTTT CACCCAAGGA AGCCGCCGTG GCTTCCGACT CCAGGGCATA CAAACGGGCA ATGTATCTCA GTGCGGTTTC TGCAATGGGG CTCTGGTTGG CCTGGTGCAA TTCATAGAAC TTGCGCCTGG CATGCGCCCA GCAGGCCAGC TCGGTAATCC CCCGCTCGGC AAACAGATGC TTGTAGCCAC TGTAGTCATC CACCATCAGA TGACCGTTCC AGTGCCCGAG GAAACTATCG GCATGTTTAC CTGCCCGGCT GGTCTGGTAG TCGAATACCA CAATGGGAGG ACCTCGATCC CAATCATTGC TGCGCCAGGT CCACAGGTAG GCTCGTCTGG TTTTGCCGGC ACCGGGATCC AGTTGGGCAA CCGGGGTTTC ATCGGCACCG GTCGCCGCTT CCTCACCGGC TACAGCAGTG ATGAAGCCCA ATTCCAGGCC CTGATGAACG CCGGCGTGGC CTACGCCAAG CAATGGCACC TCATTCCCGG TGTCGCCCTG ACCGCCGCAC AGGTAGCACA ACTTTCCAGC GACATGGTCT GGCTGGTCGC CAAGGACGTG CAACTGCCCA ACGGTAAAAC CGAACGCGTT CTGGTGCCAC AGGTCTACGC GGCCTCCAAC CCGAACGACC CCGGTAGTAA CGGCAGCCTC ATCAGCGCCA ACCGCATTCA AACCCTGGGC CAAGGCACAT GGACCAACAG CGGCACCATC GCCGGTCGCC AACTCGTGGC TGTGGGTGCT GATGACATCA ATAACCTCGG TGGCAGGATC CTCGGCCAAT CCGTATCGCT GTCTGCCCGC AATGACATCA ATAACATTGG CGGCAGTATT GTCGCGGGCG ATGCCCTCAG TTTGAATGCC GGACGCGACA TCAATATCGT TTCCACTCAC CAACACGTCA GCAATACGGT GGGCGCCAGC CAGTTCAGCC GCGATAGCAT TTTGCAAACT GCCCAACTCA AGGTTCGCAA CACCGACGGC ATGTTACTCG CCTCGGCCGG ACGCGACTTG GTGCTGACCG CCGCCAACGT TGATAACGCC GGTACCGGTA CAACTGGATC AGAAGGCCCT GCTACTCAAC TGCAAGCGGG CCACGATATT CGGCTCACCA CCCTGACTAC CGGTCAGGAA AACCGCGTCG TTTGGGATGC CGACAATCAC GATACCCATG GCAACCAACA AGACGTCGGC ACCCGCATCC ACAGTGCGGG CGATCTCGCC ATGATCGCGG GGAACGACAT TCAGGCCAAG GCCGCCGGAA TCCATAGCGA TAACGGCACC TTGAACCTGC AAGCCGGTCA CGACATTCAA CTGACAGATG GCCAGAGCAG CACCCTCTGG GATGAAGCGC ACAAGCTCAG CAGCAGTGGC CTATTCTCCA GCAGCACGAG CATCTCACGT GACAATGTCA GCGACAGCCG CAGCGTCGCC ACCAAACTGA GCGGCCAGAG CGTGCAGCTC TCCGCCAACC ACGACATCGC CCTTCAAGGC AGCCAAATCG ATGCAAACGG TAACATCGCC CTGCTCGCAG GTGACAACAT CTCCCTGTTG GCTGGTCGTA ATCAGCACAG CGAAAGCCAC TATCGCGAAG AACGCAAATC CGGCTTTACC ACCAAGGGTT ATGGCAAATC CCACACAATC GATACCGTGG ACATGGCCTC CCTCCTAAAC GATGGCAGTA GCCTGCACAG CACTCAAGGC AACATCACAC TAGCAGCGAA TCTGGCTGAA CAAAACCAGG CCGATCAAGG TGCCGTGCTG ATCGAAGGCA GCCAGGTCCA CGCCGACCAC GGACGGGTGG ATGTCAGCGG CAAGCACGTT ATTCTCTCCA CCAGCGAAGA CCAACAGCAA TACAGCCACA GCCATCAAAG CACCCACAGT AACTGGGCTT TTATCACCGG CCTGCCTGAT GGCATGCGCG ACGGACTCGA CACTCAGCAA CAAACAATCA CCGTCAACGG CAGCACGCTG CAGGGGCAGA AAGGGGTGGG TGCACAGGCC ACCGGCCTGG TGGACATGAC CGCCGCCCAC CTCAAGGCCA GCCAGGGCGA CATCGACATC AGTGGTGCTC AAGTCGCCAT TCGCAGCGGC ACCAATCAAC AAGCCAGCAG CAGCCGTGAA ACACACAAGA AATCCGGTAT CAGTTGGCGT GATCTGACCG GCCTGTTCAC ACCCGGCAAG GGCGTGGGCT ATGACGCCAC CCTCGACAAG AAAAGCAGTA AAACCACCGT GGCCCATGCC ACGCTGGAAG GTCAAAACAT CCACATCCAG GCCAACCAAG GCGACCTCAC CCTCGCAGCC GTTCAGGCCA AGGCCACCGG CACTTCTGAA CCATCGGACG GATCGGACGG GCCGGTTCAT TCGCCTGGGC AAATAAGCCT CAAGGCCGCA GGCAATATCA ATCTCGCCAG CGTCAGTACC GAAAGCTACC AGCGGACCGA TGAGAAGCAT AAAGATAAAG CCTGGCAGGA AACCCACGGT GAAGGCAATT ACGATCAGCA AACCCACTAC AACCAACTAA CCGCCGGACA GCTCGATCTT CAGGCCGGTG GCAGCATCAC CGCCGACATG AGCGTGCGTG ACAGCGCCGC CATGCTGGCC CAGTCACCCG ACATGGCCTG GCTGCGCCAG TTGCAACAGA ATCCGAAACT GGTCGGCAAG GTCGATTGGC AACAGATCGA AGAAGCCCAT CAACATTGGG ACTATAAACA CCAGGGCCTG ACCCCGGCGG CATCCGCCGT CGTGGCCCTG GTTGTTGCGT ACTTCACGAT GGGTGCCGGT TCGGCCATCG TCAATACGGC TGCTGGATCC ACTACGGCTG CAGCCAGTGG CGCCGGTGCC GTCGCGGCAG GCATGACCCA GGCTGCGGTC AGCACCATGG CCAGCCAAGC GGCCGTCAGC TTCATCAATA ACGGTGGTGA CCTCAGCAAG ACCCTGAACG ATCTGGGCAG CAGCCAGAGC ATGCGCCAAC TGGCCACAGC CGTTGTCACC GCCGGGGTGC TTAGCAGTAT TGGTCAAGTC ACCTTCGGCG AAGGCAAGAA TGCCTTCCGG CTGAACGATG TCAAGGTAAG CGATGGCCTG GTACCGAACA TCGGCAAAAA CCTGATCGAC GGCGTTGCCC GAGCCACCGT CAACAGCGCC ATCACCGGCA CCGACCTTCA AACCAATATC CGCACCAATG TGGTGGCTGG CATCCTGGGT GCCGCCGAAC AACAAGGTGC TAATTGGATC GGCAACCAGA CCCTGCTGGG CGGGGACTTC AACACCAACG GCAACGTCAA CGAATTCGCC CATGAATTCG CCCATGCCAT CGTCGGTTGT GCCGCCGGAG TGGCCGGTGC CAGTGCATCG GGCAGTGGTG CCAGTACCGG TCAAGGTTGT AGTGCTGGAG CCTTGGGTGC CGTGGTGGGT GAACTATCCG CCCAATTCTA TGGCGGTACC GATCCGAACC AGACCATCGC CTTCGCCCAG ATGATGGGCG GCATCGCCGC TGCTGCGGCG GGGCTTGGTT CCGAAGGCGT TGCCATCGCC GCCAATACCG GTGCCAATGC GGCGCAGAAC AACTACATGG CGCATTACGA CACGTATGAA GCGGATCTGA AGGACTGTCA GCAGAATCCG GGCGGTGTGA ACTGCGGTGC CATCTTAAGT TTGACCGAAG GTACGAACGC TCGTTATCTC GGTATGACCC AAGGCGGTTA TCGGGTTGCG GCCAACATGG GTAAAGACGG CGCTGCAAGC TATACCGTCG TTAGCCCCAA TGGCGAAATG ATGGTGATGC AGCCAACCGA ATGGGCCTAC TTCTCCCAGA TGACATCCGG ACAGCAGGCG ACGATATTTG CCGGATCACA ATGGCAACTG GACCTGACTT CGGCAACCGA GTATGGTTTG GCAGGCGACA CGACAGCCGC CATGGCAAAC TATGCGCACA TGCTGACCCA ACCGGATTAC TGGATTGGCA TGGGGGCCGC GTTCCTACCT GCTGGTGTGT TGGGTCGGGC GGCGACCGCC ACAGATGAGG CGGCCTTGCT TGGGCGAGGG GCGGCTGGCG ATGCTACAAG GGGAATTCCG CGAAACAAGT CGTTGGTTCC ATTCAACCCG CCCAATGACG GATTCCTTGG TGCACCGCTA GATTTCACAC TTATACTAGG AACCAGAATA GATCGAATCG GATTTGATGG TGGGAGGTTT TTGTCACCCG CCGGGACACC AATTCCGATG CGTGCATTAC GTCCTGGAAC TGAAACGCGT CCGTTAAGAA CCTTCGAAGT TATTAGGCCC CTGGATGTCC GTGCTGGTGA AATTGCACCA GCGTTCGGAC AGCCCGGACT AGGCACACAA TTCGTTACAG ATCGACCGGT TCGGGATTTG CTCCGAGAAG GATTCTTAAG GAGCGTAGAC CCATGA
|
Protein sequence | MWVKAVEFGC ADQGLNGRRS FACPLATGKQ PVLFAHRNRP NGVFHWVVVN RQVTGFGVAD QCRPAFDGVV HGPGCATARR HALAGGGEPV IEFVQCAPGL LAQGLSFTQG SRRGFRLQGI QTGNVSQCGF CNGALVGLVQ FIELAPGMRP AGQLGNPPLG KQMLVATVVI HHQMTVPVPE ETIGMFTCPA GLVVEYHNGR TSIPIIAAPG PQVGSSGFAG TGIQLGNRGF IGTGRRFLTG YSSDEAQFQA LMNAGVAYAK QWHLIPGVAL TAAQVAQLSS DMVWLVAKDV QLPNGKTERV LVPQVYAASN PNDPGSNGSL ISANRIQTLG QGTWTNSGTI AGRQLVAVGA DDINNLGGRI LGQSVSLSAR NDINNIGGSI VAGDALSLNA GRDINIVSTH QHVSNTVGAS QFSRDSILQT AQLKVRNTDG MLLASAGRDL VLTAANVDNA GTGTTGSEGP ATQLQAGHDI RLTTLTTGQE NRVVWDADNH DTHGNQQDVG TRIHSAGDLA MIAGNDIQAK AAGIHSDNGT LNLQAGHDIQ LTDGQSSTLW DEAHKLSSSG LFSSSTSISR DNVSDSRSVA TKLSGQSVQL SANHDIALQG SQIDANGNIA LLAGDNISLL AGRNQHSESH YREERKSGFT TKGYGKSHTI DTVDMASLLN DGSSLHSTQG NITLAANLAE QNQADQGAVL IEGSQVHADH GRVDVSGKHV ILSTSEDQQQ YSHSHQSTHS NWAFITGLPD GMRDGLDTQQ QTITVNGSTL QGQKGVGAQA TGLVDMTAAH LKASQGDIDI SGAQVAIRSG TNQQASSSRE THKKSGISWR DLTGLFTPGK GVGYDATLDK KSSKTTVAHA TLEGQNIHIQ ANQGDLTLAA VQAKATGTSE PSDGSDGPVH SPGQISLKAA GNINLASVST ESYQRTDEKH KDKAWQETHG EGNYDQQTHY NQLTAGQLDL QAGGSITADM SVRDSAAMLA QSPDMAWLRQ LQQNPKLVGK VDWQQIEEAH QHWDYKHQGL TPAASAVVAL VVAYFTMGAG SAIVNTAAGS TTAAASGAGA VAAGMTQAAV STMASQAAVS FINNGGDLSK TLNDLGSSQS MRQLATAVVT AGVLSSIGQV TFGEGKNAFR LNDVKVSDGL VPNIGKNLID GVARATVNSA ITGTDLQTNI RTNVVAGILG AAEQQGANWI GNQTLLGGDF NTNGNVNEFA HEFAHAIVGC AAGVAGASAS GSGASTGQGC SAGALGAVVG ELSAQFYGGT DPNQTIAFAQ MMGGIAAAAA GLGSEGVAIA ANTGANAAQN NYMAHYDTYE ADLKDCQQNP GGVNCGAILS LTEGTNARYL GMTQGGYRVA ANMGKDGAAS YTVVSPNGEM MVMQPTEWAY FSQMTSGQQA TIFAGSQWQL DLTSATEYGL AGDTTAAMAN YAHMLTQPDY WIGMGAAFLP AGVLGRAATA TDEAALLGRG AAGDATRGIP RNKSLVPFNP PNDGFLGAPL DFTLILGTRI DRIGFDGGRF LSPAGTPIPM RALRPGTETR PLRTFEVIRP LDVRAGEIAP AFGQPGLGTQ FVTDRPVRDL LREGFLRSVD P
|
| |