Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0941 |
Symbol | |
ID | 8534083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 1007521 |
End bp | 1011924 |
Gene Length | 4404 bp |
Protein Length | 1467 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 646383326 |
Product | YD repeat protein |
Protein accession | YP_003262830 |
Protein GI | 261855547 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.125138 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGGGA TATTATTTCT ATTTCTTCTG GCATGCTCGT CACCAAGCTT TGCCGGTTGG TATTGTAATG GTATGGACTA CCCATCCACT ACTATCCCGG CTTTTGCTGC TAGCCCTTCT GGTTGTGCTG ATATAGAAGA GCATTTAGAC GAAAATTGGG ATCCAAGATC AGGTATGGCG TATCCATATA CGATCACAAT GGCCTCCCCT GTACAGGCCG TTGTCACGTT CCATAAAACT GTAACTATCG ACGGATACGG TGTCCCCATC GGAACAGTCG TGAATAATCC GCAATCAGTT TTCAAGTACT GGGGATGCAA TAATGACGGC CCCAAATCAA GTATCGACTC ATGCGCAAAG CCACAACCTC CCAAACGATT GGGTACACCT GCGCCTGGAA GCTGTACTGG TGATCCTTGT GACGCTGGCA CAGGTAATGA ATTCCAATAT GATACCGACT ATCAAGGCGC CTCGGACACT TTGTCCTTCA GCCGAGCCTA TAACAGCCTA GATATTCAAG ACCATGGTCT CGGTTACGGT TGGGTGTCCA ATATCGGCGG ACATCTGGCC ATCAGCGGCA GTTTGCTGAC GGCCTATCAG GCCATCAATG GCAGTTCGCT GACGGTCTAT CAGGCCGACG GCAAGGGCCT CCCGTTTACG CTGACCAATG GTGTCTGGCA GGGCGATGCT GATTCCAAGC TGCAATTGAC CCAGGACGCC ACCGGTTACA CCCTGCAACG TCAGGACGGC AGCAGCGATC GCTATGATTT GCACGGCCAT CTACTCAGTG AAACCGACCG GGCTGGCCGC ACCACGACCT ACACTTACGA CAGCGCCAAC CACATTGCCG CCGTCACCGG TCCGTTCGGA CATACGCTTA CTTTTACCTA CAATAGCTAT GGGCGCCTCG TCAGCCTGAC CGATCCAGTC GGACAGGTCA CCAGCTACAG CTATGACACC GCTGGCAACT TGGCTCAGGT CAACTACCCT GACGGTACTG CCAAACAATA TAGCTATGGC GACAGCAGTT TCTCGCATGC CCTGACCGGC GTTGCCTTCG TTGATGCCAA CGGCAATGTC ACGCCGTTCG ACAACTTCGT CTACGACAGC TACGGAAAGG TCACCACCAA TGAATTGTCG GGCGGGCAGC AGCGCTTCGA TCTGAGCTAC GACTCCGACA CCCAGACCAC TGTTACCAAT GCGGCCGGTC GTCAGGATGT GTTGACCTTC CAGGCCCAAC TGGGCGTCAA GAACCTGCTG TCCAATATCG TTCAAGGCGA CGGCAAGGGC CTGACTCAGC AGTTCGATGC CCGCAACAAT GTGATCTCCC GCACCAATGC AGATGGCCAG ACCACGCAAT ACACCTATGA CAGTCAAAAC CGCCTGGCCT CAGAAACTCA GGCTGCCGGT ACGTCGCAGG CCCGTACTGT CAGCTATCAG TACGGCACGG ATGGCTTGGC TCTGCCGGCC GAGATCGACC GACCCAGCGT CTGCGCCGGT TCCAGCCAAC AGACCGTCAT CACCTACGAT GCCCACCACA ACCCGATCCA GATCACCGAG AACGGCTACA CGCCCGCCTG CAGCGCCATC AGTCGCAGCC TGAGCTTGGG TTACAACAGC GCCGGTCAGG TGACCCGAAT CGATGGCCCG CGTACCGACG TTTCTGACGT CACCACGATG AGTTACAACA ACTGCACCAC AGGTGGCGCT TGTGGTCAGT TGCTATCTCT GACAGACGCC CTAGGCCACA TAACAACCTT TAACGCTTAC GACGCAGATG GTCGTCTGCT ACAGAAAACC GATCCCAATG GCCTCGTGAC GAGCTACGCC TATGATCCTC GTGGCCGCCT GAGCCGGATC ACCCAGCAGG CCAGCGGCAG TAGCGCACGG GTCACTACCT TTGCCTACAC CCCGTCCAGC AAACTGGCCA GCACTTCGTT GCCGGACGGT CGCACACTGA CCTACAGCTA CGACGATGCC CAGGAACTGA CCGCCATCAC CGACAACCTC GGCGACAAGG TCAGCTACGC TTATGACAGC CGCGGTAACC GCAGTCAGAC CAGCATTTAC GATCCCGACA GTTCGCTGGT GCGACAGATC AAGAGCGTCT ACGACCTGCG TAACCACCTG GCCAGCAGTA ACGACAGCGG TTCGATCACC CAACAGGTCA CCGATGCCTT GGGTAATCTG GTCCAGCAGA CTGATCCCAA CAACAACGCA CCGACCACCC ACAGCTACGA TCCGCTCAAC CGGCTGATCC AGACCGTCAA CGCCATCGGC GGTACCACCA GTTACGGCTA TGACGTCAAC GCTGAAATCA ATCAAGTCAT TACCCCCAAC GGTGCCACCA CCGGCTACCA GAATGACGAC TTCGGCGACC TGTTGCAGGA AGTCTCGCCA GATCGAGGCA CCACCACCTA CGCCTACGAT GCCGCCGGCA ACCTGATCCA GAAAACCGAT GCCCGTGGCG TCATCGCCAA CTACAGCTAC GATGCCCTCA ACCGGTTGAC GGCCACTCAC TTCAGTGGGA TGAGCCAAGC TAGCGATGCC GACATCACGC TGACCTACGA TCATGGGCAG AACTGCAGCA ACGGCATCGG CCACCTCTGC ACAGCTCAGG ATCAGTCCGG CACCACAATC TACGCCTACG ATGCCTTTGG CAATATCCTC AACCAAATCC ACAGTGCAGG CACAACCACC TCCACCATCA GCTACCGGTA CGACAACGGT AATCGCATTG CGATGATCAT CTATCCCGAT GGCCGAGAAG TCGGTTACAC ACGCGATGCC ATTGGACGAG TCCAGGGGAT CACCACTACT GCAGCCGGCA ACAGCCAAAC CCTGGTCAGT GCCCGCCAAT ACCATGCCGA CGGTAGTTTG ACCGGAGAGA CTTTTGGTAA TGGCTTGGCC GATCAACGCC AATACACCGC TCAGGGCAGG CTTTCCAGCT GGACACTGGG CGGCAGCAAC GTCAATCTCA ACTATGGCTA CAGCTACGAC GCCAATGGCA ATATGACCGG TCAGAGCGGT CCCGATGGCG CTGCCGCCTA CCAGTACGAC CCGCTTGACC GCCTGATTGA CGAATCCTGG GGCACGGGCA ACTACCACAA CGCCTTCAGC TACGACAACA ACGGCAACCG TCTGACCAGC CTGGACAGTG GTGGCAACAC CGTCGACTAC AGTTACGACC TGCAGAGCAA CCGTCTCAAC CAACACGGCA GTCAGAAAAT CACCTTGGAT GCGGCAGGCA ATACCACAAA CGACGGTACC TATCAGTACC TCTACGATGC CGCCGGTCGC CTGAGTGAAG TACTGTTGAG TGGCGTTACC GTGGCCAGCT ACCGCTATGA CTATCGCGGC TTACGGCGGG AGAAGATCAT CGCAGCAGGC ACCACCGAGT TCACCTACGG CCCCAGCGGC CACCTGTTAA GTGAGCAGAA TACGGCAAGT GGTGGCCGAG ACTACGTCTG GAGCGATACC AGCCCCATTG CCCAGATTGA TGTAAACGGC AGCAATCAGG CCAACGACGC GATCTACTAC CTCCATACCG ACGCCATGGG CACACCGAGA TTGGCGACCA ATGCCAATCA ACAGACTGTC TGGCGATGGA ACAGGGATGC GTTTGGGGAT CGGCAGGTCA ATGCGAGCAG TGCCAGTATC GAGATGAACC TGCGGTATCC GGGGCAGTAT TACGACACCG AGACTGGGTT ATTCTACAAC TGGAATCGAT ACTATGACCC GAGTACTGGG CGGTATGCCA CCAGTGATCC GATTGGGTTG AGTGGTGGAG TGAATACCTT TGGGTATGTT TCAGCTAACC CCTTGGCTCT TATTGATCCA TGGGGGTTGT TTGGCAGTGC GCAGGTCAGC CCGATACCGC CTGGTTATAA CTCGGGCGAT GTCAGAGGTG CGTATGACTC ATATGGCAAT AGCTCGGTTT ATGAACCAGG CTATTACGAC CCTAACAGAG CAGCTGCGAT CATTATGGCT CTGCTTGCTG CCGGGATGGG GCCTGCTGCC GATGAGTTCG CTGCAATACT CGCTCACGCG GAAGAGGCCA GCGCGATTGG AGGTGAGTGT GAAGCTGCCG CTTCTTCGAA GCTGCCGTCG TTAGACGATT TATCACGTGC TGCCAGTGCT TCAGATAGAA ATGGCTTTTC AAAAGCGGGG CGCTCTTTGC AGAAACACGG AAGCAGGCCG GGGTCCAAGT GGGGGCAAGA GGATGTGAAC GTGAATAATC CCTCCGAGGC AAATTCGAGA GCACAAGGCC TCGTTGATGA TATTTTGAAC TCACCAGGGA CTAATGTGGT TCAAAACTCT CGTGGCGGTG TTGATGTGAT CTCTCAAGAT GGTCGGGTTG TTCGCTATAA CCGGGATGGT TCAATGCAAG GTTTTCGGGA GTGA
|
Protein sequence | MRGILFLFLL ACSSPSFAGW YCNGMDYPST TIPAFAASPS GCADIEEHLD ENWDPRSGMA YPYTITMASP VQAVVTFHKT VTIDGYGVPI GTVVNNPQSV FKYWGCNNDG PKSSIDSCAK PQPPKRLGTP APGSCTGDPC DAGTGNEFQY DTDYQGASDT LSFSRAYNSL DIQDHGLGYG WVSNIGGHLA ISGSLLTAYQ AINGSSLTVY QADGKGLPFT LTNGVWQGDA DSKLQLTQDA TGYTLQRQDG SSDRYDLHGH LLSETDRAGR TTTYTYDSAN HIAAVTGPFG HTLTFTYNSY GRLVSLTDPV GQVTSYSYDT AGNLAQVNYP DGTAKQYSYG DSSFSHALTG VAFVDANGNV TPFDNFVYDS YGKVTTNELS GGQQRFDLSY DSDTQTTVTN AAGRQDVLTF QAQLGVKNLL SNIVQGDGKG LTQQFDARNN VISRTNADGQ TTQYTYDSQN RLASETQAAG TSQARTVSYQ YGTDGLALPA EIDRPSVCAG SSQQTVITYD AHHNPIQITE NGYTPACSAI SRSLSLGYNS AGQVTRIDGP RTDVSDVTTM SYNNCTTGGA CGQLLSLTDA LGHITTFNAY DADGRLLQKT DPNGLVTSYA YDPRGRLSRI TQQASGSSAR VTTFAYTPSS KLASTSLPDG RTLTYSYDDA QELTAITDNL GDKVSYAYDS RGNRSQTSIY DPDSSLVRQI KSVYDLRNHL ASSNDSGSIT QQVTDALGNL VQQTDPNNNA PTTHSYDPLN RLIQTVNAIG GTTSYGYDVN AEINQVITPN GATTGYQNDD FGDLLQEVSP DRGTTTYAYD AAGNLIQKTD ARGVIANYSY DALNRLTATH FSGMSQASDA DITLTYDHGQ NCSNGIGHLC TAQDQSGTTI YAYDAFGNIL NQIHSAGTTT STISYRYDNG NRIAMIIYPD GREVGYTRDA IGRVQGITTT AAGNSQTLVS ARQYHADGSL TGETFGNGLA DQRQYTAQGR LSSWTLGGSN VNLNYGYSYD ANGNMTGQSG PDGAAAYQYD PLDRLIDESW GTGNYHNAFS YDNNGNRLTS LDSGGNTVDY SYDLQSNRLN QHGSQKITLD AAGNTTNDGT YQYLYDAAGR LSEVLLSGVT VASYRYDYRG LRREKIIAAG TTEFTYGPSG HLLSEQNTAS GGRDYVWSDT SPIAQIDVNG SNQANDAIYY LHTDAMGTPR LATNANQQTV WRWNRDAFGD RQVNASSASI EMNLRYPGQY YDTETGLFYN WNRYYDPSTG RYATSDPIGL SGGVNTFGYV SANPLALIDP WGLFGSAQVS PIPPGYNSGD VRGAYDSYGN SSVYEPGYYD PNRAAAIIMA LLAAGMGPAA DEFAAILAHA EEASAIGGEC EAAASSKLPS LDDLSRAASA SDRNGFSKAG RSLQKHGSRP GSKWGQEDVN VNNPSEANSR AQGLVDDILN SPGTNVVQNS RGGVDVISQD GRVVRYNRDG SMQGFRE
|
| |