Gene Hneap_0941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_0941 
Symbol 
ID8534083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1007521 
End bp1011924 
Gene Length4404 bp 
Protein Length1467 aa 
Translation table11 
GC content56% 
IMG OID646383326 
ProductYD repeat protein 
Protein accessionYP_003262830 
Protein GI261855547 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.125138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGGGA TATTATTTCT ATTTCTTCTG GCATGCTCGT CACCAAGCTT TGCCGGTTGG 
TATTGTAATG GTATGGACTA CCCATCCACT ACTATCCCGG CTTTTGCTGC TAGCCCTTCT
GGTTGTGCTG ATATAGAAGA GCATTTAGAC GAAAATTGGG ATCCAAGATC AGGTATGGCG
TATCCATATA CGATCACAAT GGCCTCCCCT GTACAGGCCG TTGTCACGTT CCATAAAACT
GTAACTATCG ACGGATACGG TGTCCCCATC GGAACAGTCG TGAATAATCC GCAATCAGTT
TTCAAGTACT GGGGATGCAA TAATGACGGC CCCAAATCAA GTATCGACTC ATGCGCAAAG
CCACAACCTC CCAAACGATT GGGTACACCT GCGCCTGGAA GCTGTACTGG TGATCCTTGT
GACGCTGGCA CAGGTAATGA ATTCCAATAT GATACCGACT ATCAAGGCGC CTCGGACACT
TTGTCCTTCA GCCGAGCCTA TAACAGCCTA GATATTCAAG ACCATGGTCT CGGTTACGGT
TGGGTGTCCA ATATCGGCGG ACATCTGGCC ATCAGCGGCA GTTTGCTGAC GGCCTATCAG
GCCATCAATG GCAGTTCGCT GACGGTCTAT CAGGCCGACG GCAAGGGCCT CCCGTTTACG
CTGACCAATG GTGTCTGGCA GGGCGATGCT GATTCCAAGC TGCAATTGAC CCAGGACGCC
ACCGGTTACA CCCTGCAACG TCAGGACGGC AGCAGCGATC GCTATGATTT GCACGGCCAT
CTACTCAGTG AAACCGACCG GGCTGGCCGC ACCACGACCT ACACTTACGA CAGCGCCAAC
CACATTGCCG CCGTCACCGG TCCGTTCGGA CATACGCTTA CTTTTACCTA CAATAGCTAT
GGGCGCCTCG TCAGCCTGAC CGATCCAGTC GGACAGGTCA CCAGCTACAG CTATGACACC
GCTGGCAACT TGGCTCAGGT CAACTACCCT GACGGTACTG CCAAACAATA TAGCTATGGC
GACAGCAGTT TCTCGCATGC CCTGACCGGC GTTGCCTTCG TTGATGCCAA CGGCAATGTC
ACGCCGTTCG ACAACTTCGT CTACGACAGC TACGGAAAGG TCACCACCAA TGAATTGTCG
GGCGGGCAGC AGCGCTTCGA TCTGAGCTAC GACTCCGACA CCCAGACCAC TGTTACCAAT
GCGGCCGGTC GTCAGGATGT GTTGACCTTC CAGGCCCAAC TGGGCGTCAA GAACCTGCTG
TCCAATATCG TTCAAGGCGA CGGCAAGGGC CTGACTCAGC AGTTCGATGC CCGCAACAAT
GTGATCTCCC GCACCAATGC AGATGGCCAG ACCACGCAAT ACACCTATGA CAGTCAAAAC
CGCCTGGCCT CAGAAACTCA GGCTGCCGGT ACGTCGCAGG CCCGTACTGT CAGCTATCAG
TACGGCACGG ATGGCTTGGC TCTGCCGGCC GAGATCGACC GACCCAGCGT CTGCGCCGGT
TCCAGCCAAC AGACCGTCAT CACCTACGAT GCCCACCACA ACCCGATCCA GATCACCGAG
AACGGCTACA CGCCCGCCTG CAGCGCCATC AGTCGCAGCC TGAGCTTGGG TTACAACAGC
GCCGGTCAGG TGACCCGAAT CGATGGCCCG CGTACCGACG TTTCTGACGT CACCACGATG
AGTTACAACA ACTGCACCAC AGGTGGCGCT TGTGGTCAGT TGCTATCTCT GACAGACGCC
CTAGGCCACA TAACAACCTT TAACGCTTAC GACGCAGATG GTCGTCTGCT ACAGAAAACC
GATCCCAATG GCCTCGTGAC GAGCTACGCC TATGATCCTC GTGGCCGCCT GAGCCGGATC
ACCCAGCAGG CCAGCGGCAG TAGCGCACGG GTCACTACCT TTGCCTACAC CCCGTCCAGC
AAACTGGCCA GCACTTCGTT GCCGGACGGT CGCACACTGA CCTACAGCTA CGACGATGCC
CAGGAACTGA CCGCCATCAC CGACAACCTC GGCGACAAGG TCAGCTACGC TTATGACAGC
CGCGGTAACC GCAGTCAGAC CAGCATTTAC GATCCCGACA GTTCGCTGGT GCGACAGATC
AAGAGCGTCT ACGACCTGCG TAACCACCTG GCCAGCAGTA ACGACAGCGG TTCGATCACC
CAACAGGTCA CCGATGCCTT GGGTAATCTG GTCCAGCAGA CTGATCCCAA CAACAACGCA
CCGACCACCC ACAGCTACGA TCCGCTCAAC CGGCTGATCC AGACCGTCAA CGCCATCGGC
GGTACCACCA GTTACGGCTA TGACGTCAAC GCTGAAATCA ATCAAGTCAT TACCCCCAAC
GGTGCCACCA CCGGCTACCA GAATGACGAC TTCGGCGACC TGTTGCAGGA AGTCTCGCCA
GATCGAGGCA CCACCACCTA CGCCTACGAT GCCGCCGGCA ACCTGATCCA GAAAACCGAT
GCCCGTGGCG TCATCGCCAA CTACAGCTAC GATGCCCTCA ACCGGTTGAC GGCCACTCAC
TTCAGTGGGA TGAGCCAAGC TAGCGATGCC GACATCACGC TGACCTACGA TCATGGGCAG
AACTGCAGCA ACGGCATCGG CCACCTCTGC ACAGCTCAGG ATCAGTCCGG CACCACAATC
TACGCCTACG ATGCCTTTGG CAATATCCTC AACCAAATCC ACAGTGCAGG CACAACCACC
TCCACCATCA GCTACCGGTA CGACAACGGT AATCGCATTG CGATGATCAT CTATCCCGAT
GGCCGAGAAG TCGGTTACAC ACGCGATGCC ATTGGACGAG TCCAGGGGAT CACCACTACT
GCAGCCGGCA ACAGCCAAAC CCTGGTCAGT GCCCGCCAAT ACCATGCCGA CGGTAGTTTG
ACCGGAGAGA CTTTTGGTAA TGGCTTGGCC GATCAACGCC AATACACCGC TCAGGGCAGG
CTTTCCAGCT GGACACTGGG CGGCAGCAAC GTCAATCTCA ACTATGGCTA CAGCTACGAC
GCCAATGGCA ATATGACCGG TCAGAGCGGT CCCGATGGCG CTGCCGCCTA CCAGTACGAC
CCGCTTGACC GCCTGATTGA CGAATCCTGG GGCACGGGCA ACTACCACAA CGCCTTCAGC
TACGACAACA ACGGCAACCG TCTGACCAGC CTGGACAGTG GTGGCAACAC CGTCGACTAC
AGTTACGACC TGCAGAGCAA CCGTCTCAAC CAACACGGCA GTCAGAAAAT CACCTTGGAT
GCGGCAGGCA ATACCACAAA CGACGGTACC TATCAGTACC TCTACGATGC CGCCGGTCGC
CTGAGTGAAG TACTGTTGAG TGGCGTTACC GTGGCCAGCT ACCGCTATGA CTATCGCGGC
TTACGGCGGG AGAAGATCAT CGCAGCAGGC ACCACCGAGT TCACCTACGG CCCCAGCGGC
CACCTGTTAA GTGAGCAGAA TACGGCAAGT GGTGGCCGAG ACTACGTCTG GAGCGATACC
AGCCCCATTG CCCAGATTGA TGTAAACGGC AGCAATCAGG CCAACGACGC GATCTACTAC
CTCCATACCG ACGCCATGGG CACACCGAGA TTGGCGACCA ATGCCAATCA ACAGACTGTC
TGGCGATGGA ACAGGGATGC GTTTGGGGAT CGGCAGGTCA ATGCGAGCAG TGCCAGTATC
GAGATGAACC TGCGGTATCC GGGGCAGTAT TACGACACCG AGACTGGGTT ATTCTACAAC
TGGAATCGAT ACTATGACCC GAGTACTGGG CGGTATGCCA CCAGTGATCC GATTGGGTTG
AGTGGTGGAG TGAATACCTT TGGGTATGTT TCAGCTAACC CCTTGGCTCT TATTGATCCA
TGGGGGTTGT TTGGCAGTGC GCAGGTCAGC CCGATACCGC CTGGTTATAA CTCGGGCGAT
GTCAGAGGTG CGTATGACTC ATATGGCAAT AGCTCGGTTT ATGAACCAGG CTATTACGAC
CCTAACAGAG CAGCTGCGAT CATTATGGCT CTGCTTGCTG CCGGGATGGG GCCTGCTGCC
GATGAGTTCG CTGCAATACT CGCTCACGCG GAAGAGGCCA GCGCGATTGG AGGTGAGTGT
GAAGCTGCCG CTTCTTCGAA GCTGCCGTCG TTAGACGATT TATCACGTGC TGCCAGTGCT
TCAGATAGAA ATGGCTTTTC AAAAGCGGGG CGCTCTTTGC AGAAACACGG AAGCAGGCCG
GGGTCCAAGT GGGGGCAAGA GGATGTGAAC GTGAATAATC CCTCCGAGGC AAATTCGAGA
GCACAAGGCC TCGTTGATGA TATTTTGAAC TCACCAGGGA CTAATGTGGT TCAAAACTCT
CGTGGCGGTG TTGATGTGAT CTCTCAAGAT GGTCGGGTTG TTCGCTATAA CCGGGATGGT
TCAATGCAAG GTTTTCGGGA GTGA
 
Protein sequence
MRGILFLFLL ACSSPSFAGW YCNGMDYPST TIPAFAASPS GCADIEEHLD ENWDPRSGMA 
YPYTITMASP VQAVVTFHKT VTIDGYGVPI GTVVNNPQSV FKYWGCNNDG PKSSIDSCAK
PQPPKRLGTP APGSCTGDPC DAGTGNEFQY DTDYQGASDT LSFSRAYNSL DIQDHGLGYG
WVSNIGGHLA ISGSLLTAYQ AINGSSLTVY QADGKGLPFT LTNGVWQGDA DSKLQLTQDA
TGYTLQRQDG SSDRYDLHGH LLSETDRAGR TTTYTYDSAN HIAAVTGPFG HTLTFTYNSY
GRLVSLTDPV GQVTSYSYDT AGNLAQVNYP DGTAKQYSYG DSSFSHALTG VAFVDANGNV
TPFDNFVYDS YGKVTTNELS GGQQRFDLSY DSDTQTTVTN AAGRQDVLTF QAQLGVKNLL
SNIVQGDGKG LTQQFDARNN VISRTNADGQ TTQYTYDSQN RLASETQAAG TSQARTVSYQ
YGTDGLALPA EIDRPSVCAG SSQQTVITYD AHHNPIQITE NGYTPACSAI SRSLSLGYNS
AGQVTRIDGP RTDVSDVTTM SYNNCTTGGA CGQLLSLTDA LGHITTFNAY DADGRLLQKT
DPNGLVTSYA YDPRGRLSRI TQQASGSSAR VTTFAYTPSS KLASTSLPDG RTLTYSYDDA
QELTAITDNL GDKVSYAYDS RGNRSQTSIY DPDSSLVRQI KSVYDLRNHL ASSNDSGSIT
QQVTDALGNL VQQTDPNNNA PTTHSYDPLN RLIQTVNAIG GTTSYGYDVN AEINQVITPN
GATTGYQNDD FGDLLQEVSP DRGTTTYAYD AAGNLIQKTD ARGVIANYSY DALNRLTATH
FSGMSQASDA DITLTYDHGQ NCSNGIGHLC TAQDQSGTTI YAYDAFGNIL NQIHSAGTTT
STISYRYDNG NRIAMIIYPD GREVGYTRDA IGRVQGITTT AAGNSQTLVS ARQYHADGSL
TGETFGNGLA DQRQYTAQGR LSSWTLGGSN VNLNYGYSYD ANGNMTGQSG PDGAAAYQYD
PLDRLIDESW GTGNYHNAFS YDNNGNRLTS LDSGGNTVDY SYDLQSNRLN QHGSQKITLD
AAGNTTNDGT YQYLYDAAGR LSEVLLSGVT VASYRYDYRG LRREKIIAAG TTEFTYGPSG
HLLSEQNTAS GGRDYVWSDT SPIAQIDVNG SNQANDAIYY LHTDAMGTPR LATNANQQTV
WRWNRDAFGD RQVNASSASI EMNLRYPGQY YDTETGLFYN WNRYYDPSTG RYATSDPIGL
SGGVNTFGYV SANPLALIDP WGLFGSAQVS PIPPGYNSGD VRGAYDSYGN SSVYEPGYYD
PNRAAAIIMA LLAAGMGPAA DEFAAILAHA EEASAIGGEC EAAASSKLPS LDDLSRAASA
SDRNGFSKAG RSLQKHGSRP GSKWGQEDVN VNNPSEANSR AQGLVDDILN SPGTNVVQNS
RGGVDVISQD GRVVRYNRDG SMQGFRE