Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0019 |
Symbol | |
ID | 8533132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 20656 |
End bp | 24519 |
Gene Length | 3864 bp |
Protein Length | 1287 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 646382398 |
Product | putative type 4 fimbrial biogenesis protein PilY1 |
Protein accession | YP_003261932 |
Protein GI | 261854649 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3419] Tfp pilus assembly protein, tip-associated adhesin PilY1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.582284 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCAAT CAAATAAAAA CACGCAAATT AAACGCAGCA AGCGGCACAT GAGATACGCA AGGGGAAATA CGATGAACTT TTTGCACACA ACAATGCAGC AAGTGGGCAG GGTCGCAGTG CGGTTTTTCG CGTTTGCATT GCTCACAGCC ATGCCGATGA TAACCGCCCA AGCAGCAGTT GACATTGACC AAAGCCCTTT GATTCTGCAA AAACCCTTGC CGCCCAATAT TGTGCTGATG CTGGACGATT CTGGTTCGAT GACGTGGGAT TACATGCCGG ATTTTGGCTA TTTAAAAAAC AATAGTAATA ATGATGCGTT AATAAATTCG GCTAATAATT CGGTTTATTA CGATCCAACC ATAACTTACT TGCCGCCATT AAAAGCTGAT AATACAAGCT ACACCAATCA AACTGATTTT ACCAACACGC CCGTTGATGG CTTTGGAACT ACATCAAATG ATATGATTGA TTTGACCAAT TATAATGGAT ATGACGATAC TGGGAATATT AATTATTCAA AATCCGTGCT AACAAATTCT TCTTATACTA AAACCGGAAT AAGTAGAATT GACTGCTACA TTTTATATTA TAACGATCAA ACTGCATACG ATTATAGCTA CAGTAGAGCG AGAAAAAGCC GGCCATCTAG CTGTACAATT TACACTCAGG TTAGCTCTGG GTATTTTCAA TATTCTACAG GCCCTGCGGC GGGCCCCTAC ACCGTGCATT ATGTGGCAAA CACCAGTTGC GGCACGCAAT CTAATTGCGT AGTTGCTTCC GATAAATCTG GAGTATCGGC ACCTATTGGT GTTGCTGCTG GTAATAATAT CGCCAACTGG TTCGCCTATT ACCATACTCG TATATTAACG ACTAAAACTG GGTTGACATT AGCCTTTTCC GGCCTTGACA AAACCTATCG TTTTGGCTTT GCCTCGATTA ACGGGAGAAA CACGGCGGAT ATCCCATCAC CCCAATACTC TTTCTCAACA AGCAATAATT CAGATAACAA ATTAGCAGAA GTTCAGCCCT TTGGTGATGG TAGTAGCGGC ACTCAAAAAG CCAAGTTCTG GACATGGATA ACCAACATCT CCCCCAATGA TGGCACACCT TTACGCGGCG CTCTGCAGGC TGTGGGTGAA TATTACAAAA CACAGCAGCC GTGGGAGACT TCGAGTTCCG ATACATCAGA ATACGCTTGC CGCCCCAGTT ACACCATTTT AACGACAGAT GGTTTCTGGA ATGGCGATAC CCCAAGCGGG ATTGGCAATG CTGACGGTCA AAATGGGCCG AAAATCACCA CGCCATCAAC CTATCAATAT CTTAAAACCG CTCCGTATCA GGATGGTTAT AGCGATACGC TGGCCGATGT CGCAATGAAG TATTGGGAAA CCGATTTGCG CACATCGGTT GCCAACGAGG TGCCCACCAC CCCGAGTGAC CCCGCTTTCT GGCAGCATAT GACCACCTTT ACCATTGGCT TGGGCTGGGA CCCGACCAAC TTGATACGTT CAACATCAGG TGATGCTAAT TTGACCGTGC CACAAATATT GGCCTGGGCG CGCAGCGGCA ACCCACCCGC TGGCTCTTCG CTTACAAACT CCAGCAATAT CTGGCCAAAA CCTACGTCCA ACAGTATTAA TAATATTGCA GATCTGGCGC ATGCGGCCGT TAATGGCCAC GGCGAATTCT ACTCGGTTAA ATCACCTGAT GATTTGGTTA ATGGCCTGCA ATCTGCCTTG AAAAAAATTG GCGAAAGCCC CGGCGCGGGC AACGCCGTAA CCTTGTCGGA TACGGCGCTA CCGACAGATT CTACGGCGGC CGACGCGCTG TATCGGTTTC GGGGTACGTT TTATACGGGC CAATGGTCTG GAACGCTCAC AGCGGAAAAT TACACGACCA CGACCACGCC CCCCAGTTAT GAGTCGTTTT GGTCCACAAA AGATTTGACC CCCTCATTTT CGACCGTTGG TACGGGCAGC AGCGCGGAGC GGCTAAGTAA CCGCAACGTG TGGACATCCA CGCTCGGCAA CGGCAACAAA ACCTCGTCGG TCGCGTTCCG TGTGGCAACT GATTTAAGCG CAACTCAGCA AACTGACTTG GCGGCTACCA TTGGCAACAG TAGTGTGACC GCTCAAACCA TGGTGAATTA CTTGTTGGGT GATAGCACAT ACGCCCAAGG CAATCCGGGC GGCACGCTGC GTAATCGCGA TAGTTTTTTG GGTGATATTG TTTCTTCGAC ACCGGTATTG ATTGCCGCGC CGCAAGCGGA TTTGTATGCA GGCACTACCT TTACGGGCGC CGATACGTAT TCTACTTTTG TGAACAAAGA GGCCACCCGT GCGCCCATTG TTTATGTGGC CGCCAATGAT GGCATGCTGC ATGCCTTTCG GGTAACGGCG GGCGCGGGGT ATGAGCGCGA TGGCAGCACG GTTAGCCCAG TGGCTGATCA GGCCAAGGGC ACGGAGGTTT ATGCCTACAT GCCTTCGGCG GTGCTCACGC AAACAGGTGA TGCCAGCATT ACCAACCTGG CTAACCCGAA ATATGGCGAT GTGGACCCGG TAAATGGCAC ACAGGCTGTG CCGCATCAAT ATTACAACGA TGGCCGAATT ACCACACAGA ATGTGTATTT TGACAAAGCA TGGCACACCG TTTTGGTGGG CACCACGGGG CGTGGGCCTG CCAAAGCAAT TTACGCGCTG GATATTACCG ATCCTTCGGT GTTGATGAAC CCGGCTACTG CTGATCAGGC GCTATTGTGG GAGCGCTCTG CCGGCGATGG TAAAACAGGC AGCAGTTATA TTGGTGAAAT GGTTGGCACC CCAGTCATTG CGCAAGTTGA TCAAGGCGGT CAACCCTCCT GGGCTGTATT TGTGGGCAAT GGTTATAACA GTGCCGAAGG CAAGCCCGCC TTGCTGCAGT TCGATCTGCA AACGGGTGAT TTGAGCGTTC ACACAACCAC CGGTTCTGTT GCTGCGGATG GCGGCCTTGC CGAACCGGGT TTGATGCAAG GGGATAAAAC AACTGGCATA AGCACTTATG CGTTCGCAGG CGATTTGCAA GGCCATTTGT GGAAGTTTAA TTTAGATTCT GCCAGTAGCA CGGGTTCGGT AGCGTTTAAT GCCGTGGACG ATAAGGGCAA CGCACAGCCG ATTACCTCAC TTGTTACCTT GGCTTACGAT GGCGTCACAA ATAGCACCTT TGCCTTGTTT GGTACCGGCA AATATTTGGC AAGCGCCGAT ACAAAAGACG ATCAAGTGCA AACCTGGTAT GGCGTGCGCG TGGGCATGGG GGCTGATTTG GCAGGCGTGG CATCTGCGAC AACGCCTGTG GCCGATAGCA GCACCGCCCG CAGTGATTTG ACCGAGCGAT TCGCGTTTGA TTTTGCATCG GGTGATCGTG CCACCAGTGC CCAAACCTCA GATACGGATA TGAATAATAA AGCAGGGTGG TTTATGGACT TGCCGCAAAC CGGCGAGCGC ATCGTGAACC GCATCCAGTT AATTAGTGGT AAAGCGGTGG CTACCACGCT TATTCCTAAG GTGAATGACC CCTGTAACAC CGTGCCTGCG GGCGCTGTGA TGGGTGTCGA TCCGTTCACC GGCGCTAATC AGTTGGTGAG CACAGGTAAC GGGTATAATT TGGGCACCAA AATAATTACG GTTGATGGTA AACCGCAGCA AGTTGCGATT AATGGCAAAG TGTTTGACGC AGGCCCAGCC GCTGGTGTCA CTGCGGTACG AAATGCTGAT GGTACGATTT CAATTACCTT CAATACCTTG GGCGGCGGCT TGCAAAGCTT GGGCCCATTA AATTTGGGTG GTAATCAGGC GAGTCGGTTA TCGTGGCGTG AGCTTACAAA CTGA
|
Protein sequence | MNQSNKNTQI KRSKRHMRYA RGNTMNFLHT TMQQVGRVAV RFFAFALLTA MPMITAQAAV DIDQSPLILQ KPLPPNIVLM LDDSGSMTWD YMPDFGYLKN NSNNDALINS ANNSVYYDPT ITYLPPLKAD NTSYTNQTDF TNTPVDGFGT TSNDMIDLTN YNGYDDTGNI NYSKSVLTNS SYTKTGISRI DCYILYYNDQ TAYDYSYSRA RKSRPSSCTI YTQVSSGYFQ YSTGPAAGPY TVHYVANTSC GTQSNCVVAS DKSGVSAPIG VAAGNNIANW FAYYHTRILT TKTGLTLAFS GLDKTYRFGF ASINGRNTAD IPSPQYSFST SNNSDNKLAE VQPFGDGSSG TQKAKFWTWI TNISPNDGTP LRGALQAVGE YYKTQQPWET SSSDTSEYAC RPSYTILTTD GFWNGDTPSG IGNADGQNGP KITTPSTYQY LKTAPYQDGY SDTLADVAMK YWETDLRTSV ANEVPTTPSD PAFWQHMTTF TIGLGWDPTN LIRSTSGDAN LTVPQILAWA RSGNPPAGSS LTNSSNIWPK PTSNSINNIA DLAHAAVNGH GEFYSVKSPD DLVNGLQSAL KKIGESPGAG NAVTLSDTAL PTDSTAADAL YRFRGTFYTG QWSGTLTAEN YTTTTTPPSY ESFWSTKDLT PSFSTVGTGS SAERLSNRNV WTSTLGNGNK TSSVAFRVAT DLSATQQTDL AATIGNSSVT AQTMVNYLLG DSTYAQGNPG GTLRNRDSFL GDIVSSTPVL IAAPQADLYA GTTFTGADTY STFVNKEATR APIVYVAAND GMLHAFRVTA GAGYERDGST VSPVADQAKG TEVYAYMPSA VLTQTGDASI TNLANPKYGD VDPVNGTQAV PHQYYNDGRI TTQNVYFDKA WHTVLVGTTG RGPAKAIYAL DITDPSVLMN PATADQALLW ERSAGDGKTG SSYIGEMVGT PVIAQVDQGG QPSWAVFVGN GYNSAEGKPA LLQFDLQTGD LSVHTTTGSV AADGGLAEPG LMQGDKTTGI STYAFAGDLQ GHLWKFNLDS ASSTGSVAFN AVDDKGNAQP ITSLVTLAYD GVTNSTFALF GTGKYLASAD TKDDQVQTWY GVRVGMGADL AGVASATTPV ADSSTARSDL TERFAFDFAS GDRATSAQTS DTDMNNKAGW FMDLPQTGER IVNRIQLISG KAVATTLIPK VNDPCNTVPA GAVMGVDPFT GANQLVSTGN GYNLGTKIIT VDGKPQQVAI NGKVFDAGPA AGVTAVRNAD GTISITFNTL GGGLQSLGPL NLGGNQASRL SWRELTN
|
| |