Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_2107 |
Symbol | |
ID | 8535266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 2254933 |
End bp | 2259978 |
Gene Length | 5046 bp |
Protein Length | 1681 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 646384484 |
Product | YD repeat protein |
Protein accession | YP_003263971 |
Protein GI | 261856688 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGGAT TCAATCGCCG TTTCCTGGCT GCTTGTACCG CTGCATTGAT TGCAATGGGT TGGAGTGGAT TGAGTCACGC AACTCATGTT TCTGCACTCA CCGCACCTAT TGCGTGGACA AAAGGGCAGC AGGTGTTACC GACACGAGAT TTCGGTGTGT TTCAAACGCC GCTGGTGCCA TTCGGCGCGT CGAGCGCAGC AGAAACCCAT GACCTGCGGG TTGCTATCGC AAGTTACCGG GCCGCGGCTG ATGCTGCAAA TACGAAGGCT CTCGACCATT TCCTGCATCA ATATCCAAAT TCAGTCTGGC GCATCGCTTT ATTGACCAAT GAAGGCTTGG CTTATGAGCA GGCGGGCCTG TTCTCGCAGG CGATAACCCG GCTGGATGCG GCATGGCAAT TAAGAGCAGG GGCGAAAACC GAACCACAAC GGGCATTGAT TGAGCAAAAC TATGGCGCTT TGCTCCACTT GCATACGGTG TTCGGTCATG AGAAAGCTGT CCAGGTGTTG CTGCAAGAAG GGAAAGGGCT CGTTCTTTCC GGCGCGGCGC AGGCGAACAA AACACAAGCT GAGGAGGGCC TGTGGCGGAT GCAGCATGAG CCTGGCAAGG CTCGGCTTTG TGGTCTTGTC GCTCTGGATC AGTTGTTGGC GATTGAGGGT CATGATCAAT CCGTTGGACG ATTCAAGCGG GTACGCGCCG GTGAGGCAGG GCTTAGTCTG GCGCGACTGG ATACGCTGGC GAATCAGGCG GGTCTGCCCA GCCGCGTGGT ATACCGTCAT GGCGAAGAAC CGATTCCGGT TCCCGCCATA GCGCACTGGA AAGTGGGGCA TTACGCCACC ATCGTCGGCG AAGCTGGTGG GCGTTATCAC ATCAAGGATG CCGCTCAGGG GCGCGATTAC TGGATGACGC CCGAAGCAGT TCGGGCCGAA TCCAGCGGCT ACTTTCTGAT TCCGACCAAG GCATCGGCGC AACCGCAGGC TAATCAGCCG ATTCTGGCGC AAACCATGGG CAGTCCATGG CGTCGCGTTG CGCTGAGTGA AGCTGGCCGC ATCTTTGGTG CGGGTATTAC GCCGGGCAAC AACCCGGACG ACACCTCCAA CGATGACCCT GATGTGGCTG GTTGTGGGGC TTGCGGCTCA TCCGGCATGG CGCAATACAG CGTCAAGGCG ATGTTGGTGA GTTTGAGCTT GCATGATACG CCGGTGGGTT ATGCACCGCC CAAAGGTCCG GCTGTACCGT TCACCATTGT TTATAGCCAA CGCGAAGCGA ATCAACCGGC CAATTTTACC TTTGGCAACC TGGGGCAGAA ATGGATCAGT AACTGGTTTG CCTACGTGCA AGACGATCCG ACCTCACCAG GCAACAGCGT GACCATTGCC TTGCGCGGCG GCGGTACCCG ACACTATGCG GGCTTCAACG CCACGACAGG TGCTTTTTCA CCTGAAGAAC GCACGGCGGC GCAACTGGTG AAAGTATCGG ATTCGCCCGT CACCTATGAG CGCCGGATGC CCGATGGCAG CAAGGAAGTG TATGGTGCGT CGGACAACAG TACCTATTTC CCTCGGCGAA TCTTCTTGAC CCAGGTCGTC GATCCAGCCG GCAATGCCGT GACTCTGGAT TATGACAGCC AAATGCGGTT AACCACACTC ACCGATGCGC TGGGCCAGAA GACCACCCTG ACCTACAGCA ATGCCCAGTA TCCCCTGCAA GTGACAGAAA TTACCGATCC TTTTGGTCGC GCGGCCAGCA TTGCCTACGA CAGCAGTGGA CGGTTGATCG ACATCACCGA TGTGTTGGGT ATGCACTCGC AGTTCACCTA CGACGGCGGC ACCTTCATCA CCGCCATGAC CACCCCCTAT GGAACCACGC AATTTGCCTC AGGCGATAGC GGTACAACGC GTTGGCTGGA AATTACCGAT CCCCAAGGGC GCAAAGAGCG GGTGGAATTC CGGCACAACG CACCGGGCAT CCCGTTCAGC GACTCGCCCG TGCCCCAGGG CATCAACACA TTCAACGCGT ACATCAACTC CCGGGATACC TTCTTCTGGG ATAAAACTGC CATGGAGCAC GCCCCGGGGG ACTATACCCA AGCACATATC TACCATTGGT TGCATAATGC TGCCCAACCG TACTACGGCT TGACTGCCGG CGTATTGGAA AGTGTCAAAT CTCCGTTGGA ACACCGAATC TGGTTCAGCT ATCCAAATCA ATCACCCGGA GTGACGGGAG GTTTCGACAA ACCCTCAGCC ATTGCCCGTG TCCTGGCCGA TGGCAGCACC CAACTGACGC GCATCAGCTA TAACCCCAAG GGTAATGTCA CTCAAACGGT CGATCCGCTT GGCCGCACCG TGAATCTTAT CTATGCGACC AATGGCGTTG ATGTGGTCGA GGTTACGCGT AACACGTTAG CGGGTGCTGA CATTCTGGCG CGTCTCACCT ACAACGCACA GCACGAACCC TTGACCTACA CCGACGCCGC AGGGCAAACC ACTACCTACG CCTACAACGG AGCCGGTCAA CGAACCTCCA TGACCGACCC TCTGGGGCAA GTGACGACAT ATGTCTACGA TGCCAACGGA TACCTGCAAA AGGTTGTCAA CGCCGATGGC AAAACCCGAA ATAGCTATAC GTACGACGGT TTTGGCCGCG TAGCGAGCAG CACCGACTCG GAGGGCCACA CACTGCGTTA TAGCTACGAT GCGCTCAACC GTTTGACCAC CGTTACCTAT CCCGACGGCA CGAGTCGAAC CGTCACCTGG GGCAAGCTCG ATCCGGTCGC CACGACCGAT CGGGAAGGCC GCACGACGAC CTACGCCTAC GACAGCGTGC GTGACCTGAT CAGCAAAACC GACCCAATGA ATCAAGTGAC ACAGTATGGT TATTACGCCA ATGGCAAACT GGAAAGCCTG ACTGATCCAA ACGGCAATAC CAGCACCTGG GCCCGCGACA TCGAAGGGCG GGTAACCGGC AAAACCTATC CCGATGGCAG CCAGACCGGC TACACCTACG ACATCACCGG TCGCGTGATT GAACGCAGTG ATGCCCTGGG GCAAAACACG GCCTACAGCT ACGCACTGGA TGACCGACTC ATCGGCATCA GCTACTCCAA TGCCCTGCAA CCCACCGCCG CTGTTCAACT CGGCTACGAC GCCAGCTATC CGCGCCTGAC TACCCGAACC GACGGGCAGG GTACAACCAC CTATGGTTAT TATCCGGCTG GCGTCCTGGG TGCCGGGCAA TTGGCGAGCG AACAGGGGCA AAACAGCCAC GATAGCCTGC AATACACCTA CAACGCCCTG GGTTTGCTGG CGAGCCAGAC GGTAGATGGT GCAACCGAGC GCTACCAATA CGACGCCTTG TCACGACAGA CGGGCGACAG CAATGCGCTC GGTGATTTCA CCACAGCCTA TCTTGGTGAA ACCAGCCAGC CTGTGAGCCA GACCATCAGC CGCAATGGTC AACCCGTGCC CTATCAAATT CAGTATCAGT ATGAGAACAA CCAGAGCGAT CGCCGCCTGA AAGCGATACT GAACGACATC ATTAACCAGG GGCGTCTGCA ACCGGTGGCC GGTTTCACCT TTACCACCAG CCCTGAAAAT CTCATTCTGA GCCGGGCAGA AAACCAGAAT GAGGATACTG ACGCGCACCA CAAGCACGAC TTCGGTCGCC ACTGGGGGCT GCCGGACTGG ATGTTCGGGT GGGCAGACCG GCACGATACC GACTGCCGGG ATCACGGCCA TGGCTTCGGT TTTGGTCACG ACCGGCACGG CTGCACATCC GATCAGGGAG GTCAGCAGGC ACTCCAGTAT CAGTACGACG ACGCCCTGCG GCTGATCGCC GCCGAAGACG GTAGCCCAAG AGATAATCGC GGCCACAAAG GCGGCAAGAG CGGCAGCAGG AACGGTAACA CGGGCAGCAA TGCTGAAAGC TACCAATACG ATGCAGCCAG CAACCTGACC GACATTACCA TCGGCAAAAC CAGCATCGCC CTGACCATCA ATGCGCTGAA CCAGATTGTC ACCGCAGGCA GTACCGCCTA TCGCTACGAT GCCAATGGCA ACCTGCTTGA CGATGGCATA AACACCTACA CCTGGGATGC CGCCGACCGG CTGGTAACCA TCACCAACCA GCAAACCGGC CACACCAGCC AGTTTGCCTA CGATGGGCTA TCCCGCCGGA TCAGTGTTAC CGAGACCGAC AGCGGCGGTA CGCCGGAGAC TACCCACTAT CTGTGGTGCG GTACCCGCAT CTGCGAAGCG CGGGACAGCA GCGACACCGT ACTGGCCCGC TACTACGCCC AGGGTGAACG GCATGGCAGC ACGATTGCCT ACTATGCGCA GGATCAGGTC GGCAGCGTGG TCGCCACGGT TGATCCACAG GGGCAGATCA CCTCAAGGCT GAAATACGAC AGCTACGGCA ATATCATCCA GAGCAGCGGC ACCTTGCCGG ACTACCGGTA TGCCGAGCTG TACGCCCATC CGCAATCGAG CCTGTATCTG GCAACCTATC GGGCGTATGA TCCGAAGATT GGACGCTGGC TGTCGCGGGA TCCGATTCGG GAAACTGGGG GTATAAATTT ATATGCTTAT GTAACCAGCA ATCCGGTTAT CAACATTGAC CCTAAAGGCC TAGATATCTG GATTGAGGGC CCGTCCGGTC CTGAGCCAAG TTTTCATCAA AGCGTGAATG TAGGAAATAT GAACGGATAC TATGACTCCT ACAGCTTTGG TATGGACGGT CAGGGCATAG AGGGTAAGGT CTACCGCGAC CATGATCCTG GGGGGCAGAT AGAAAACTAC AAGAGAACAA CATCGGAACA GGATAGGATT TTCAAGTCAG AAATGGACAA GAAATTAGGG AACACCGGAA TCTATGGTTG GGACGATATT TGTCGCAGCT GGAGTCAAAG GCAATTTAAA AATGCACCTG GAATCCCAAG CCAGTCTCCA GTCAGAAAAG TTTCGCCACA CTGGAATGTG AGTCCATCTT CATCTAGGTC GACAACAGGG CCTAGTAGTT CAAGTGGCAC CTGGACTTCA AAATGA
|
Protein sequence | MNGFNRRFLA ACTAALIAMG WSGLSHATHV SALTAPIAWT KGQQVLPTRD FGVFQTPLVP FGASSAAETH DLRVAIASYR AAADAANTKA LDHFLHQYPN SVWRIALLTN EGLAYEQAGL FSQAITRLDA AWQLRAGAKT EPQRALIEQN YGALLHLHTV FGHEKAVQVL LQEGKGLVLS GAAQANKTQA EEGLWRMQHE PGKARLCGLV ALDQLLAIEG HDQSVGRFKR VRAGEAGLSL ARLDTLANQA GLPSRVVYRH GEEPIPVPAI AHWKVGHYAT IVGEAGGRYH IKDAAQGRDY WMTPEAVRAE SSGYFLIPTK ASAQPQANQP ILAQTMGSPW RRVALSEAGR IFGAGITPGN NPDDTSNDDP DVAGCGACGS SGMAQYSVKA MLVSLSLHDT PVGYAPPKGP AVPFTIVYSQ REANQPANFT FGNLGQKWIS NWFAYVQDDP TSPGNSVTIA LRGGGTRHYA GFNATTGAFS PEERTAAQLV KVSDSPVTYE RRMPDGSKEV YGASDNSTYF PRRIFLTQVV DPAGNAVTLD YDSQMRLTTL TDALGQKTTL TYSNAQYPLQ VTEITDPFGR AASIAYDSSG RLIDITDVLG MHSQFTYDGG TFITAMTTPY GTTQFASGDS GTTRWLEITD PQGRKERVEF RHNAPGIPFS DSPVPQGINT FNAYINSRDT FFWDKTAMEH APGDYTQAHI YHWLHNAAQP YYGLTAGVLE SVKSPLEHRI WFSYPNQSPG VTGGFDKPSA IARVLADGST QLTRISYNPK GNVTQTVDPL GRTVNLIYAT NGVDVVEVTR NTLAGADILA RLTYNAQHEP LTYTDAAGQT TTYAYNGAGQ RTSMTDPLGQ VTTYVYDANG YLQKVVNADG KTRNSYTYDG FGRVASSTDS EGHTLRYSYD ALNRLTTVTY PDGTSRTVTW GKLDPVATTD REGRTTTYAY DSVRDLISKT DPMNQVTQYG YYANGKLESL TDPNGNTSTW ARDIEGRVTG KTYPDGSQTG YTYDITGRVI ERSDALGQNT AYSYALDDRL IGISYSNALQ PTAAVQLGYD ASYPRLTTRT DGQGTTTYGY YPAGVLGAGQ LASEQGQNSH DSLQYTYNAL GLLASQTVDG ATERYQYDAL SRQTGDSNAL GDFTTAYLGE TSQPVSQTIS RNGQPVPYQI QYQYENNQSD RRLKAILNDI INQGRLQPVA GFTFTTSPEN LILSRAENQN EDTDAHHKHD FGRHWGLPDW MFGWADRHDT DCRDHGHGFG FGHDRHGCTS DQGGQQALQY QYDDALRLIA AEDGSPRDNR GHKGGKSGSR NGNTGSNAES YQYDAASNLT DITIGKTSIA LTINALNQIV TAGSTAYRYD ANGNLLDDGI NTYTWDAADR LVTITNQQTG HTSQFAYDGL SRRISVTETD SGGTPETTHY LWCGTRICEA RDSSDTVLAR YYAQGERHGS TIAYYAQDQV GSVVATVDPQ GQITSRLKYD SYGNIIQSSG TLPDYRYAEL YAHPQSSLYL ATYRAYDPKI GRWLSRDPIR ETGGINLYAY VTSNPVINID PKGLDIWIEG PSGPEPSFHQ SVNVGNMNGY YDSYSFGMDG QGIEGKVYRD HDPGGQIENY KRTTSEQDRI FKSEMDKKLG NTGIYGWDDI CRSWSQRQFK NAPGIPSQSP VRKVSPHWNV SPSSSRSTTG PSSSSGTWTS K
|
| |