Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0370 |
Symbol | |
ID | 8533491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 378823 |
End bp | 386088 |
Gene Length | 7266 bp |
Protein Length | 2421 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 646382752 |
Product | filamentous hemagglutinin family outer membrane protein |
Protein accession | YP_003262278 |
Protein GI | 261854995 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01731] adhesin HecA family 20-residue repeat (two copies) [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATTG AACAGAAATG CAACTCAATC GCTCGCACCG CCCAACAACG CCAGAGTGTC CCGGCAGTGG CGCCCATTGG CTACAGAGTG CTGGTCTTCC TGCTAAGTGC AATCATACCG CCCAGTGCCA ACGCATCAGG GATTGCAGCA GATCATCAAG CGCCTGCCTA CCAGCAGCCC ACCATTCAGG TCACTGCGAA CGGCACACCG GAAGCCAACA TCCAAACACC CGGTGTGGAT GGCGTTTCCC ATAACGTATA CAAACAGTTC GACGTGGGAT CAAAAGGGGT CATCCTCAAT AACGCCCGTA CCAATGCCCA AACACAACTC GGTGGCTGGA TTCAGGGTAA CCCGAATCTA GCGAATGGTT CGGCGCGGGT CATTCTCAAT GAGGTCAATA GCAGTCAGCC CAGTCAATTG AATGGCTACG TCGAAATCGG TGGCCAACGG GCCGAACTCG TCATTGCCAA CCCTTCGGGT ATCAATGTTG ATGGCGGTGG GTTCATCAAT GCCAGTCGCG CCACACTGAC CACCGGCAAA CCCGTATTCC AGAATGGCAC TCTGGCCGGT TATCAGGTCT CCAACGGTAG CATCAACATC AATGGCGCAG GTTTGGACGC CAGTTCGACC GATTACACCG ACATCCTGGC TCGAGCCATC AATGTCAATG CTGGCATCTG GGCAAATCAG CTCAAACTGG TCACCGGCAG CAACGACATC AGTCCAGACG GGACTCAGAT CACGCCCACG TCTGGTGATG CAAATAAACC CGGTTTCGGC ATCGACGTTG CTCAACTGGG AGGCATGTAC GCGGGCAAAA TCACCCTGAT CGGCACTGAA GCCGGCGTGG GTGTCCGCAA CGCCGGACAA ATCGGCGCCA GCGATAACCT GGTTCTAAAT GCCAATGGCG AACTGACGAA CAGCGGGCAG ATCACCAGTC AGGGCACCAC CACCGCCCAG ACCAACGTCA TCGACAACAG CGGCACGATT TACGCTCAGG ACAATGCGCA ATTGAACGCT AATGGTGACA TCACCAACAG TGGCCGCATC CTGGCTCAAG GAAACTTGGG ACTCGATGCC GCCACCGACG TGGTCAACCT CGACGGCGCC ATCCTCGGCG CCGGCGTCCA GCCCGACGGC AGTCTGGGCG GCGCCCGAAA CCTCACCATC GCTTCTGGAC AGAATGCGCG CCTTCAGGGC AAGAACCTCG CCTCAGGCAA CCTGACGGCA AAAGGAAACA CTGTCGACCT GAGCCATAGC CAGACACAAG CCGACAGCAT CACTCTGACG GCAACTCACG GTGATATTGA TGCATCGAGC GCCACTATCA CTGCGCAACA CCAACTCCAA GCCAACGCCA GCGGTTCGTT CATCAACGAT CAAGCGCAAA CCGCCGCAGG CACGCTCCAA CTGAAAGCCG CGAATCTCAG CAATCTCGAT GGCCTGATTC TGCTGGGCGA TTCGTCCGGT GCCAGTGATA TAACCCTGAG CAACTTGTTC GACAACCGCG GTGGCTTGCT GCACACAACT GGGGAGATGC AGATCACTGC AAACCAAGTG GATAACCGTA ACACGACCCA GTCTGCACAA GGCATTCAGG GTAACGACGT CACCATCACC GCTGATCGCA TCAATAATGG TTCCGGGCAC ATCCTGGCTA ACCAGAACAT CCAGTTGAAC AGCGCAGGCC AGATCGACAA CAGCCAAGGT CTCGTCTCCG CCGGCGGCAA TCTGAGCGCT CAGGACGGCC AGGCTACGCG GTCCCTCGCC ATTACCAACA CGCAAGGTAC CTTCGTTGCC GGTCAGAACC TTGCTGTGAC CAGCGCTACC TTGAGTGGTG ACGGCCAACT GTATGCGCAA CAGGACATCC ACCTCCTGTT GGACGGTGGC TTCGACAATA GTGGCACCCT CATCGCTAAC CACGACACCG ATCTTAATAT CGGTGGCATC CTTACCAACA GCGGCACGCT GCAGGCCGGC AACACCCTTA CCGCTCACGC CGGCACTATC GATAACCAAG CAAGTGGGCA AATCAGCGCA GCAGAAGACC GTATCAACGC CACGGGCAAA CTCACCAACC GTGGTCTGAT CGATGGCGGA ACGACTCGTC TACAGGCGAA CACCATCGAC AACCTCGGTA CCGGCAGGAT CTACGGCGAC CACATCGCCA TTGGGGCCAA CGACCTGACC AATGAAAATG AAAACGGTAC TGCTGGTACC ATCGCTGCGC GCAACCGGCT CGACATCGGG GTGCAGACAC TCGAAAACAA GGAACATGCC CTGATTTACA GCGCCGGTGA CATGAGCATC GGCGGCGCAC TGGATGCCAA CGATCAAGCT ACCGGCCAGG CGGTGCACCT CACCAACAGC AGTGCGACCA TTGACGCCGA TGGCCAGATT CAGCTCAGCG CGCAAACCAT CGACAATTTG AATGCGCACC TTGTCACCAG TCAGGTCGAT GACGCAACCG TTACCCACGA GTATGTTCAA CCACGCGGGC AATCTCAGCT TTTCGACATC AGCGCATGCT CAGGTATTGG CGGGGGGCAA GATAAGAACA GCTGCAATGG CTATCCTGGC ACCTTCGAGG ACTATACGCT CTTCATTGTC CACAGCACGC CTTCGCACAC TATTGTGGTG TCATCTGACC CCGGGAAAAT CCAGGCCGGT GGCGACATCG TCATCAATGG CGCGCTCAAC AATCGTGACA GCGCTGTTAT TGCAGGCGGT GACCTGGTGG CTTCCCAACC GGTCAGCAAC ACAGCAACCA AAGGGCAAGA CATCACCCGT TCCACGGGGA CAGCCGAGCT AACCACCGTG GAGAGTTGCG GTCTGTTCGG CAGTGACCAC TGCCGGGAAT GGCATGGTCA ATCCGCCTAC AACCCCGCCC CGGTCTACGG TACGCCTTAC GATCTCCCCA CGCTGACCTA TCAAACCCAC ACATTGATAG GCGGCACCGC GCCAACTATC GGTAACCGCT CGGCGACCGG TACGAGTCAG CCCGTCAGCC AGCCGATCAA TACCTCTGGC GGTTACGCCT GGACTCAGCC CAATATCACG CTGCGCAGCA GCAGTCTGTT CCAGGTACAC GACGACAATG GCAACCATCC GCTGATTGAG ACCGATCCAC GCTTTGCCGA TTATCGGCAA TGGCTCAGCT CTGACTACAT GATCAGCGCC TTGTCATTAC AGCCGGACAG CCTGCTCAAG CGACTCGGCG ACGGTTACTA CGAACAGAAG TTGGTGCAAG ATCAGGTTGC AGAACTCACC GGTCGCCGAT TCCTCACCGG CTACAGCAGT GATGAAGCCC AATTCCAGGC CCTGATGAAT GCGGGTGTAA CCTATGCTAA GCAATGGCAC CTCATTCCCG GCGTCGCCCT GACCGCCGCA CAGGTAGCAC AACTTTCCAG CGACATGGTC TGGCTGGTCG CCAAGGACGT GCAACTGCCC AACGGTAAAA CCGAACGCGT TCTGGTGCCA CAGGTCTACG CGGCCTCCAA CCCGAACGAC CCCGGTAGTA ACGGCAGCCT CATCAGCGCC AACCGCATTC AAACCCTGGG CCAAGGCACA TGGACCAACA GCGGCACCAT CGCCGGTCGC CAACTCGTGG CTGTGGGTGC TGATGACATC AATAACCTCG GTGGCAGGAT CCTCGGCCAA TCCGTATCGC TGTCTGCCCG CAATGACATC AATAACATTG GCGGCAGTAT TGTCGCGGGC GATGCCCTCA GTTTGAATGC CGGACGCGAC ATCAATATCG TTTCCACTCA CCAACACGTC AGCAATACGG TGGGCGCCAG CCAGTTCAGC CGCGATAGCA TTTTGCAAAC TGCCCAACTC AAGGTTCGCA ACACCGACGG CATGTTACTC GCCTCGGCCG GACGCGACTT GGTGCTGACC GCCGCCAACG TTGATAACGC CGGTACCGGT ACAACTGGAT CAGAAGGCCC TGCTACTCAA CTGCAAGCGG GCCACGATAT TCGGCTCACC ACCCTGACTA CCGGTCAGGA AAACCGCGTC GTTTGGGATG CCGACAATCA CGATACCCAT GGCAACCAAC AAGACGTCGG CACCCGCATC CACAGTGCGG GCGATCTCGC CATGATCGCG GGGAACGACA TTCAGGCCAA GGCCGCCGGA ATCCATAGCG ATAACGGCAC CTTGAACCTG CAAGCCGGTC ACGACATTCA ACTGACAGAT GGCCAGAGCA GCACCCTCTG GGATGAAGCG CACAAGCTCA GCAGCAGTGG CCTATTCTCC AGCAGCACGA GCATCTCACG TGACAATGTC AGCGACAGCC GCAGCGTCGC CACCAAACTG AGCGGCCAGA GCGTGCAGCT CTCCGCCAAC CACGACATCG CCCTTCAAGG CAGCCAAATC GATGCAAACG GTAACATCGC CCTGCTCGCA GGTGACAACA TCTCCCTGTT GGCTGGTCGT AATCAGCACA GCGAAAGCCA CTATCGCGAA GAACGCAAAT CCGGCTTTAC CACCAAGGGT TATGGCAAAT CCCACACAAT CGATACCGTG GACATGGCCT CCCTCCTAAA CGATGGCAGT AGCCTGCACA GCACTCAAGG CAACATCACA CTAGCAGCGA ATCTGGCTGA ACAAAACCAG GCCGATCAAG GTGCCGTGCT GATCGAAGGC AGCCAGGTCC ACGCCGACCA CGGACGGGTG GATGTCAGCG GCAAGCACGT TATTCTCTCC ACCAGCGAAG ACCAACAGCA ATACAGCCAC AGCCATCAAA GCACCCACAG TAACTGGGCT TTTATCACCG GCCTGCCTGA TGGCATGCGC GATGGACTCG ACACCCAGCA ACAAACCATC ACCGTCAACG GCAGCACTCT GCAGGGGCAG AACGGTGTGG GTGTACAGGC CACCGGCCTG GTGGACATGA CCGCCGCCCA CCTCAAGGCC AGCCAGGGCG ACATCGACAT CAGCGGTGCT CAGGTCGCCA TTCGCAGCGG CACCAATCAA CAAGCCAGCA GCAGCCGTGA AACACACAAG AAATCCGGCA TCAGTTGGCG TGATCTGACC GGCCTGTTCA CACCCGGCAA GGGCGTGGGC TATGACGCCA CCCTCGACAA GAAAAGCAGT AAAACCACCG TGGCCCATGC CACGCTGGAA GGTCAAAACA TCCACATCCA GGCCAACCAA GGCGACCTCA CCCTCGCAGC CGTTCAGGCC AAGGCCACCG GCACTTCTGA ACCATCGGAC GGATCGGACG GGCCGGTTCA TTCGCCTGGG CAAATAAGCC TCAAGGCCGC AGGCAATATC AATCTCGCCA GCGTCAGTAC CGAAAGCTAC CAGCGGACCG ATGAGAAGCA TAAAGATAAA GCCTGGCAGG AAACCCACGG TGAAGGCAAT TACGATCAGC AAACCCACTA CAACCAACTA ACCGCCGGAC AGCTCGATCT TCAGGCCGGT GGCAGCATCA CCGCCGACAT GAGCGTGCGT GACAGCGCCG CCATGCTGGC CCAGTCACCC GACATGGCCT GGCTGCGCCA GTTGCAACAG AATCCGAAAC TGGTCGGCAA GGTCGATTGG CAACAGATCG AAGAAGCCCA TCAACATTGG GACTATAAAC ACCAGGGCCT GACCCCGGCG GCATCCGCCG TCGTGGCCCT GGTTGTTGCG TACTTCACGA TGGGTGCCGG TTCGGCCATC GTCAATACGG CTGCTGGATC CACTACGGCT GCAGCCAGTG GCGCCGGTGC CGTCGCGGCA GGCATGACCC AGGCTGCGGT CAGCACCATG GCCAGCCAAG CGGCCGTCAG CTTCATCAAT AACGGTGGTG ACCTCAGCAA GACCCTGAAC GATCTGGGCA GCAGCCAGAG CATGCGCCAA CTGGCCACAG CCGTTGTCAC CGCCGGGGTG CTTAGCAGTA TTGGTCAAGT CACCTTCGGC GAAGGCAAGA ATGCCTTCCG GCTGAACGAT GTCAAGGTAA GCGATGGCCT GGTACCGAAC ATCGGCAAAA ACCTGATCGA CGGCGTTGCC CGAGCCACCG TCAACAGCGC CATCACCGGC ACCGACCTTC AAACCAATAT CCGCACCAAT GTGGTGGCTG GCATCCTGGG TGCCGCCGAA CAACAAGGTG CTAATTGGAT CGGCAACCAG ACCCTGCTGG GCGGGGACTT CAACACCAAC GGCAACGTCA ACGAATTCGC CCATGAATTC GCCCATGCCA TCGTCGGTTG TGCCGCAGGA GTGGCCGGTG CCAGTGCATC GGGCAGTGGT ACCAGTACCG GTCAAGGTTG TAGTGCTGGA GCCTTGGGTG CCGTGGTGGG TGAACTATCC GCCCAATTCT ATGGCGGTAC CGATCCGAAC CAGACCATCG CCTTCGCCCA GATGATGGGC GGCATCGCCG CTGCTGCGGC GGGGCTTGGT TCCGAAGGCG TTGCCATCGC CGCCAATACC GGTGCCAATG CGGCGCAGAA CAACTACATG GCGCATTACG ACACGTATGA AGCGGATCTG AAGGACTGTC AGCAGAATCC GGGCGGTGTG AACTGCGGTG CCATCTTAAG TTTGACCGAA GGTACGAACG CTCGTTATCT CGGTATGACC CAAGGCGGTT ATCGGGTTGC GGCCAACATG GGTAAAGACG GCGCTGCAAG CTATACCGTC GTTAGCCCCA ATGGCGAAAT GATGGTGATG CAGCCAACCG AATGGGCCTA CTTCTCCCAG ATGACATCCG GACAGCAGGC GACGATATTT GCCGGATCAC AATGGCAACT GGACCTGACT TCGGCAACCG AGTATGGTTT GGCAGGCGAC ACGACAGCCG CCATGGCAAA CTATGCGCAC ATGCTGACCC AACCGGATTA CTGGATTGGC ATGGGGGCCG CGTTGATACC TACTAGCGTA CTGGGACGAA CTGCCGGCAT GACCAACCAG CTAGCAACCG GAACGAACAA GGCAATGTTC TGGTCCGGCC TTGGCCGGGG TGGCGACAAA ATTGCCGCAA GAGCTGCCGC TCAACAAGGA AAAATGACTC TTGAATCTAC GCTTGCCGCT CGCGGTATCA AATTACCAAA ATGGGACCCA GACAATCCGG AAGTAGTATC AGCCTGGAGA CAGGCTTCCC ACGATTTTGC GGCGGGCGCA AGGGGAAATG TGCGAGTGTT ACAAGGTGAC GTTTTACGGG GGAACTCAGT CTGGGCAGAA GTTGAATTCC CAGCACTGAA AGCAAATCCC ACTGTGAAAT CTATTACCTC AATAGATGCC GCAACGGGTA AAGAGGTATT GCTGTGGTCA AGATAG
|
Protein sequence | MKIEQKCNSI ARTAQQRQSV PAVAPIGYRV LVFLLSAIIP PSANASGIAA DHQAPAYQQP TIQVTANGTP EANIQTPGVD GVSHNVYKQF DVGSKGVILN NARTNAQTQL GGWIQGNPNL ANGSARVILN EVNSSQPSQL NGYVEIGGQR AELVIANPSG INVDGGGFIN ASRATLTTGK PVFQNGTLAG YQVSNGSINI NGAGLDASST DYTDILARAI NVNAGIWANQ LKLVTGSNDI SPDGTQITPT SGDANKPGFG IDVAQLGGMY AGKITLIGTE AGVGVRNAGQ IGASDNLVLN ANGELTNSGQ ITSQGTTTAQ TNVIDNSGTI YAQDNAQLNA NGDITNSGRI LAQGNLGLDA ATDVVNLDGA ILGAGVQPDG SLGGARNLTI ASGQNARLQG KNLASGNLTA KGNTVDLSHS QTQADSITLT ATHGDIDASS ATITAQHQLQ ANASGSFIND QAQTAAGTLQ LKAANLSNLD GLILLGDSSG ASDITLSNLF DNRGGLLHTT GEMQITANQV DNRNTTQSAQ GIQGNDVTIT ADRINNGSGH ILANQNIQLN SAGQIDNSQG LVSAGGNLSA QDGQATRSLA ITNTQGTFVA GQNLAVTSAT LSGDGQLYAQ QDIHLLLDGG FDNSGTLIAN HDTDLNIGGI LTNSGTLQAG NTLTAHAGTI DNQASGQISA AEDRINATGK LTNRGLIDGG TTRLQANTID NLGTGRIYGD HIAIGANDLT NENENGTAGT IAARNRLDIG VQTLENKEHA LIYSAGDMSI GGALDANDQA TGQAVHLTNS SATIDADGQI QLSAQTIDNL NAHLVTSQVD DATVTHEYVQ PRGQSQLFDI SACSGIGGGQ DKNSCNGYPG TFEDYTLFIV HSTPSHTIVV SSDPGKIQAG GDIVINGALN NRDSAVIAGG DLVASQPVSN TATKGQDITR STGTAELTTV ESCGLFGSDH CREWHGQSAY NPAPVYGTPY DLPTLTYQTH TLIGGTAPTI GNRSATGTSQ PVSQPINTSG GYAWTQPNIT LRSSSLFQVH DDNGNHPLIE TDPRFADYRQ WLSSDYMISA LSLQPDSLLK RLGDGYYEQK LVQDQVAELT GRRFLTGYSS DEAQFQALMN AGVTYAKQWH LIPGVALTAA QVAQLSSDMV WLVAKDVQLP NGKTERVLVP QVYAASNPND PGSNGSLISA NRIQTLGQGT WTNSGTIAGR QLVAVGADDI NNLGGRILGQ SVSLSARNDI NNIGGSIVAG DALSLNAGRD INIVSTHQHV SNTVGASQFS RDSILQTAQL KVRNTDGMLL ASAGRDLVLT AANVDNAGTG TTGSEGPATQ LQAGHDIRLT TLTTGQENRV VWDADNHDTH GNQQDVGTRI HSAGDLAMIA GNDIQAKAAG IHSDNGTLNL QAGHDIQLTD GQSSTLWDEA HKLSSSGLFS SSTSISRDNV SDSRSVATKL SGQSVQLSAN HDIALQGSQI DANGNIALLA GDNISLLAGR NQHSESHYRE ERKSGFTTKG YGKSHTIDTV DMASLLNDGS SLHSTQGNIT LAANLAEQNQ ADQGAVLIEG SQVHADHGRV DVSGKHVILS TSEDQQQYSH SHQSTHSNWA FITGLPDGMR DGLDTQQQTI TVNGSTLQGQ NGVGVQATGL VDMTAAHLKA SQGDIDISGA QVAIRSGTNQ QASSSRETHK KSGISWRDLT GLFTPGKGVG YDATLDKKSS KTTVAHATLE GQNIHIQANQ GDLTLAAVQA KATGTSEPSD GSDGPVHSPG QISLKAAGNI NLASVSTESY QRTDEKHKDK AWQETHGEGN YDQQTHYNQL TAGQLDLQAG GSITADMSVR DSAAMLAQSP DMAWLRQLQQ NPKLVGKVDW QQIEEAHQHW DYKHQGLTPA ASAVVALVVA YFTMGAGSAI VNTAAGSTTA AASGAGAVAA GMTQAAVSTM ASQAAVSFIN NGGDLSKTLN DLGSSQSMRQ LATAVVTAGV LSSIGQVTFG EGKNAFRLND VKVSDGLVPN IGKNLIDGVA RATVNSAITG TDLQTNIRTN VVAGILGAAE QQGANWIGNQ TLLGGDFNTN GNVNEFAHEF AHAIVGCAAG VAGASASGSG TSTGQGCSAG ALGAVVGELS AQFYGGTDPN QTIAFAQMMG GIAAAAAGLG SEGVAIAANT GANAAQNNYM AHYDTYEADL KDCQQNPGGV NCGAILSLTE GTNARYLGMT QGGYRVAANM GKDGAASYTV VSPNGEMMVM QPTEWAYFSQ MTSGQQATIF AGSQWQLDLT SATEYGLAGD TTAAMANYAH MLTQPDYWIG MGAALIPTSV LGRTAGMTNQ LATGTNKAMF WSGLGRGGDK IAARAAAQQG KMTLESTLAA RGIKLPKWDP DNPEVVSAWR QASHDFAAGA RGNVRVLQGD VLRGNSVWAE VEFPALKANP TVKSITSIDA ATGKEVLLWS R
|
| |