Gene Hneap_2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_2107 
Symbol 
ID8535266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp2254933 
End bp2259978 
Gene Length5046 bp 
Protein Length1681 aa 
Translation table11 
GC content57% 
IMG OID646384484 
ProductYD repeat protein 
Protein accessionYP_003263971 
Protein GI261856688 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGGAT TCAATCGCCG TTTCCTGGCT GCTTGTACCG CTGCATTGAT TGCAATGGGT 
TGGAGTGGAT TGAGTCACGC AACTCATGTT TCTGCACTCA CCGCACCTAT TGCGTGGACA
AAAGGGCAGC AGGTGTTACC GACACGAGAT TTCGGTGTGT TTCAAACGCC GCTGGTGCCA
TTCGGCGCGT CGAGCGCAGC AGAAACCCAT GACCTGCGGG TTGCTATCGC AAGTTACCGG
GCCGCGGCTG ATGCTGCAAA TACGAAGGCT CTCGACCATT TCCTGCATCA ATATCCAAAT
TCAGTCTGGC GCATCGCTTT ATTGACCAAT GAAGGCTTGG CTTATGAGCA GGCGGGCCTG
TTCTCGCAGG CGATAACCCG GCTGGATGCG GCATGGCAAT TAAGAGCAGG GGCGAAAACC
GAACCACAAC GGGCATTGAT TGAGCAAAAC TATGGCGCTT TGCTCCACTT GCATACGGTG
TTCGGTCATG AGAAAGCTGT CCAGGTGTTG CTGCAAGAAG GGAAAGGGCT CGTTCTTTCC
GGCGCGGCGC AGGCGAACAA AACACAAGCT GAGGAGGGCC TGTGGCGGAT GCAGCATGAG
CCTGGCAAGG CTCGGCTTTG TGGTCTTGTC GCTCTGGATC AGTTGTTGGC GATTGAGGGT
CATGATCAAT CCGTTGGACG ATTCAAGCGG GTACGCGCCG GTGAGGCAGG GCTTAGTCTG
GCGCGACTGG ATACGCTGGC GAATCAGGCG GGTCTGCCCA GCCGCGTGGT ATACCGTCAT
GGCGAAGAAC CGATTCCGGT TCCCGCCATA GCGCACTGGA AAGTGGGGCA TTACGCCACC
ATCGTCGGCG AAGCTGGTGG GCGTTATCAC ATCAAGGATG CCGCTCAGGG GCGCGATTAC
TGGATGACGC CCGAAGCAGT TCGGGCCGAA TCCAGCGGCT ACTTTCTGAT TCCGACCAAG
GCATCGGCGC AACCGCAGGC TAATCAGCCG ATTCTGGCGC AAACCATGGG CAGTCCATGG
CGTCGCGTTG CGCTGAGTGA AGCTGGCCGC ATCTTTGGTG CGGGTATTAC GCCGGGCAAC
AACCCGGACG ACACCTCCAA CGATGACCCT GATGTGGCTG GTTGTGGGGC TTGCGGCTCA
TCCGGCATGG CGCAATACAG CGTCAAGGCG ATGTTGGTGA GTTTGAGCTT GCATGATACG
CCGGTGGGTT ATGCACCGCC CAAAGGTCCG GCTGTACCGT TCACCATTGT TTATAGCCAA
CGCGAAGCGA ATCAACCGGC CAATTTTACC TTTGGCAACC TGGGGCAGAA ATGGATCAGT
AACTGGTTTG CCTACGTGCA AGACGATCCG ACCTCACCAG GCAACAGCGT GACCATTGCC
TTGCGCGGCG GCGGTACCCG ACACTATGCG GGCTTCAACG CCACGACAGG TGCTTTTTCA
CCTGAAGAAC GCACGGCGGC GCAACTGGTG AAAGTATCGG ATTCGCCCGT CACCTATGAG
CGCCGGATGC CCGATGGCAG CAAGGAAGTG TATGGTGCGT CGGACAACAG TACCTATTTC
CCTCGGCGAA TCTTCTTGAC CCAGGTCGTC GATCCAGCCG GCAATGCCGT GACTCTGGAT
TATGACAGCC AAATGCGGTT AACCACACTC ACCGATGCGC TGGGCCAGAA GACCACCCTG
ACCTACAGCA ATGCCCAGTA TCCCCTGCAA GTGACAGAAA TTACCGATCC TTTTGGTCGC
GCGGCCAGCA TTGCCTACGA CAGCAGTGGA CGGTTGATCG ACATCACCGA TGTGTTGGGT
ATGCACTCGC AGTTCACCTA CGACGGCGGC ACCTTCATCA CCGCCATGAC CACCCCCTAT
GGAACCACGC AATTTGCCTC AGGCGATAGC GGTACAACGC GTTGGCTGGA AATTACCGAT
CCCCAAGGGC GCAAAGAGCG GGTGGAATTC CGGCACAACG CACCGGGCAT CCCGTTCAGC
GACTCGCCCG TGCCCCAGGG CATCAACACA TTCAACGCGT ACATCAACTC CCGGGATACC
TTCTTCTGGG ATAAAACTGC CATGGAGCAC GCCCCGGGGG ACTATACCCA AGCACATATC
TACCATTGGT TGCATAATGC TGCCCAACCG TACTACGGCT TGACTGCCGG CGTATTGGAA
AGTGTCAAAT CTCCGTTGGA ACACCGAATC TGGTTCAGCT ATCCAAATCA ATCACCCGGA
GTGACGGGAG GTTTCGACAA ACCCTCAGCC ATTGCCCGTG TCCTGGCCGA TGGCAGCACC
CAACTGACGC GCATCAGCTA TAACCCCAAG GGTAATGTCA CTCAAACGGT CGATCCGCTT
GGCCGCACCG TGAATCTTAT CTATGCGACC AATGGCGTTG ATGTGGTCGA GGTTACGCGT
AACACGTTAG CGGGTGCTGA CATTCTGGCG CGTCTCACCT ACAACGCACA GCACGAACCC
TTGACCTACA CCGACGCCGC AGGGCAAACC ACTACCTACG CCTACAACGG AGCCGGTCAA
CGAACCTCCA TGACCGACCC TCTGGGGCAA GTGACGACAT ATGTCTACGA TGCCAACGGA
TACCTGCAAA AGGTTGTCAA CGCCGATGGC AAAACCCGAA ATAGCTATAC GTACGACGGT
TTTGGCCGCG TAGCGAGCAG CACCGACTCG GAGGGCCACA CACTGCGTTA TAGCTACGAT
GCGCTCAACC GTTTGACCAC CGTTACCTAT CCCGACGGCA CGAGTCGAAC CGTCACCTGG
GGCAAGCTCG ATCCGGTCGC CACGACCGAT CGGGAAGGCC GCACGACGAC CTACGCCTAC
GACAGCGTGC GTGACCTGAT CAGCAAAACC GACCCAATGA ATCAAGTGAC ACAGTATGGT
TATTACGCCA ATGGCAAACT GGAAAGCCTG ACTGATCCAA ACGGCAATAC CAGCACCTGG
GCCCGCGACA TCGAAGGGCG GGTAACCGGC AAAACCTATC CCGATGGCAG CCAGACCGGC
TACACCTACG ACATCACCGG TCGCGTGATT GAACGCAGTG ATGCCCTGGG GCAAAACACG
GCCTACAGCT ACGCACTGGA TGACCGACTC ATCGGCATCA GCTACTCCAA TGCCCTGCAA
CCCACCGCCG CTGTTCAACT CGGCTACGAC GCCAGCTATC CGCGCCTGAC TACCCGAACC
GACGGGCAGG GTACAACCAC CTATGGTTAT TATCCGGCTG GCGTCCTGGG TGCCGGGCAA
TTGGCGAGCG AACAGGGGCA AAACAGCCAC GATAGCCTGC AATACACCTA CAACGCCCTG
GGTTTGCTGG CGAGCCAGAC GGTAGATGGT GCAACCGAGC GCTACCAATA CGACGCCTTG
TCACGACAGA CGGGCGACAG CAATGCGCTC GGTGATTTCA CCACAGCCTA TCTTGGTGAA
ACCAGCCAGC CTGTGAGCCA GACCATCAGC CGCAATGGTC AACCCGTGCC CTATCAAATT
CAGTATCAGT ATGAGAACAA CCAGAGCGAT CGCCGCCTGA AAGCGATACT GAACGACATC
ATTAACCAGG GGCGTCTGCA ACCGGTGGCC GGTTTCACCT TTACCACCAG CCCTGAAAAT
CTCATTCTGA GCCGGGCAGA AAACCAGAAT GAGGATACTG ACGCGCACCA CAAGCACGAC
TTCGGTCGCC ACTGGGGGCT GCCGGACTGG ATGTTCGGGT GGGCAGACCG GCACGATACC
GACTGCCGGG ATCACGGCCA TGGCTTCGGT TTTGGTCACG ACCGGCACGG CTGCACATCC
GATCAGGGAG GTCAGCAGGC ACTCCAGTAT CAGTACGACG ACGCCCTGCG GCTGATCGCC
GCCGAAGACG GTAGCCCAAG AGATAATCGC GGCCACAAAG GCGGCAAGAG CGGCAGCAGG
AACGGTAACA CGGGCAGCAA TGCTGAAAGC TACCAATACG ATGCAGCCAG CAACCTGACC
GACATTACCA TCGGCAAAAC CAGCATCGCC CTGACCATCA ATGCGCTGAA CCAGATTGTC
ACCGCAGGCA GTACCGCCTA TCGCTACGAT GCCAATGGCA ACCTGCTTGA CGATGGCATA
AACACCTACA CCTGGGATGC CGCCGACCGG CTGGTAACCA TCACCAACCA GCAAACCGGC
CACACCAGCC AGTTTGCCTA CGATGGGCTA TCCCGCCGGA TCAGTGTTAC CGAGACCGAC
AGCGGCGGTA CGCCGGAGAC TACCCACTAT CTGTGGTGCG GTACCCGCAT CTGCGAAGCG
CGGGACAGCA GCGACACCGT ACTGGCCCGC TACTACGCCC AGGGTGAACG GCATGGCAGC
ACGATTGCCT ACTATGCGCA GGATCAGGTC GGCAGCGTGG TCGCCACGGT TGATCCACAG
GGGCAGATCA CCTCAAGGCT GAAATACGAC AGCTACGGCA ATATCATCCA GAGCAGCGGC
ACCTTGCCGG ACTACCGGTA TGCCGAGCTG TACGCCCATC CGCAATCGAG CCTGTATCTG
GCAACCTATC GGGCGTATGA TCCGAAGATT GGACGCTGGC TGTCGCGGGA TCCGATTCGG
GAAACTGGGG GTATAAATTT ATATGCTTAT GTAACCAGCA ATCCGGTTAT CAACATTGAC
CCTAAAGGCC TAGATATCTG GATTGAGGGC CCGTCCGGTC CTGAGCCAAG TTTTCATCAA
AGCGTGAATG TAGGAAATAT GAACGGATAC TATGACTCCT ACAGCTTTGG TATGGACGGT
CAGGGCATAG AGGGTAAGGT CTACCGCGAC CATGATCCTG GGGGGCAGAT AGAAAACTAC
AAGAGAACAA CATCGGAACA GGATAGGATT TTCAAGTCAG AAATGGACAA GAAATTAGGG
AACACCGGAA TCTATGGTTG GGACGATATT TGTCGCAGCT GGAGTCAAAG GCAATTTAAA
AATGCACCTG GAATCCCAAG CCAGTCTCCA GTCAGAAAAG TTTCGCCACA CTGGAATGTG
AGTCCATCTT CATCTAGGTC GACAACAGGG CCTAGTAGTT CAAGTGGCAC CTGGACTTCA
AAATGA
 
Protein sequence
MNGFNRRFLA ACTAALIAMG WSGLSHATHV SALTAPIAWT KGQQVLPTRD FGVFQTPLVP 
FGASSAAETH DLRVAIASYR AAADAANTKA LDHFLHQYPN SVWRIALLTN EGLAYEQAGL
FSQAITRLDA AWQLRAGAKT EPQRALIEQN YGALLHLHTV FGHEKAVQVL LQEGKGLVLS
GAAQANKTQA EEGLWRMQHE PGKARLCGLV ALDQLLAIEG HDQSVGRFKR VRAGEAGLSL
ARLDTLANQA GLPSRVVYRH GEEPIPVPAI AHWKVGHYAT IVGEAGGRYH IKDAAQGRDY
WMTPEAVRAE SSGYFLIPTK ASAQPQANQP ILAQTMGSPW RRVALSEAGR IFGAGITPGN
NPDDTSNDDP DVAGCGACGS SGMAQYSVKA MLVSLSLHDT PVGYAPPKGP AVPFTIVYSQ
REANQPANFT FGNLGQKWIS NWFAYVQDDP TSPGNSVTIA LRGGGTRHYA GFNATTGAFS
PEERTAAQLV KVSDSPVTYE RRMPDGSKEV YGASDNSTYF PRRIFLTQVV DPAGNAVTLD
YDSQMRLTTL TDALGQKTTL TYSNAQYPLQ VTEITDPFGR AASIAYDSSG RLIDITDVLG
MHSQFTYDGG TFITAMTTPY GTTQFASGDS GTTRWLEITD PQGRKERVEF RHNAPGIPFS
DSPVPQGINT FNAYINSRDT FFWDKTAMEH APGDYTQAHI YHWLHNAAQP YYGLTAGVLE
SVKSPLEHRI WFSYPNQSPG VTGGFDKPSA IARVLADGST QLTRISYNPK GNVTQTVDPL
GRTVNLIYAT NGVDVVEVTR NTLAGADILA RLTYNAQHEP LTYTDAAGQT TTYAYNGAGQ
RTSMTDPLGQ VTTYVYDANG YLQKVVNADG KTRNSYTYDG FGRVASSTDS EGHTLRYSYD
ALNRLTTVTY PDGTSRTVTW GKLDPVATTD REGRTTTYAY DSVRDLISKT DPMNQVTQYG
YYANGKLESL TDPNGNTSTW ARDIEGRVTG KTYPDGSQTG YTYDITGRVI ERSDALGQNT
AYSYALDDRL IGISYSNALQ PTAAVQLGYD ASYPRLTTRT DGQGTTTYGY YPAGVLGAGQ
LASEQGQNSH DSLQYTYNAL GLLASQTVDG ATERYQYDAL SRQTGDSNAL GDFTTAYLGE
TSQPVSQTIS RNGQPVPYQI QYQYENNQSD RRLKAILNDI INQGRLQPVA GFTFTTSPEN
LILSRAENQN EDTDAHHKHD FGRHWGLPDW MFGWADRHDT DCRDHGHGFG FGHDRHGCTS
DQGGQQALQY QYDDALRLIA AEDGSPRDNR GHKGGKSGSR NGNTGSNAES YQYDAASNLT
DITIGKTSIA LTINALNQIV TAGSTAYRYD ANGNLLDDGI NTYTWDAADR LVTITNQQTG
HTSQFAYDGL SRRISVTETD SGGTPETTHY LWCGTRICEA RDSSDTVLAR YYAQGERHGS
TIAYYAQDQV GSVVATVDPQ GQITSRLKYD SYGNIIQSSG TLPDYRYAEL YAHPQSSLYL
ATYRAYDPKI GRWLSRDPIR ETGGINLYAY VTSNPVINID PKGLDIWIEG PSGPEPSFHQ
SVNVGNMNGY YDSYSFGMDG QGIEGKVYRD HDPGGQIENY KRTTSEQDRI FKSEMDKKLG
NTGIYGWDDI CRSWSQRQFK NAPGIPSQSP VRKVSPHWNV SPSSSRSTTG PSSSSGTWTS
K