Gene Hneap_0653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_0653 
Symbol 
ID8533789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp707090 
End bp709735 
Gene Length2646 bp 
Protein Length881 aa 
Translation table11 
GC content60% 
IMG OID646383042 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_003262553 
Protein GI261855270 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTGAAT CAAAAGATCT AAGCCAACAC ACGCCGATGA TGCAGCAATT CTGGACGATG 
AAACAGGCGC ACCCGGATGT GTTGCTGTTT TATCGTATGG GGGATTTTTA CGAGCTGTTT
TACGCCGATG CCGAGCGGGC GGCGCGCATT CTCGATTTGA CACTGACGAC GCGCGGGCAG
TCGGCAGGCG AGCCGATTCC GATGGCGGGT GTTCCGGTTC ATGCCTACGA GAGCTATCTG
GCGCGGTTGA TTCGCGCGGG CGAATCGGTG GCCATTTGCG AGCAGATCGG TGAAACCAAA
ACCAAAGGCC CGATGGAGCG TGCGGTGGTG CGGGTCGTCA CACCCGGAAC GGTCACGGAT
GAGGCCTTGC TCGATCAGCG CGAAGGCAAC CGCTTGGCGG CATTGGTGCC GCTGGCAACC
ACGCCACCGG AATACGGGTT GGCGCATCTG GATCTGGCGG CAGGCGATTT CGTGCTCATG
CGGCTCGATG ATGCGGCGCT GACGGCCGAG CTGGCGCGAA TCGATCCGCG TGAATTGCTG
TTGCCGGAAT CGCTGGCCGA GGCCGCCGAC ACGGCGGCGA AGATAGGCGT GGACCCCAAA
CGTTGGCGTA CGCGCGCCGA TTGGCAGTTC GATGCCAAAC GCGGGCAGGC GGCCTTGCTC
AAACACTGGC AGATTCACGA TCTTAAAAGT TTTGGCGTGA CGGAAATCCA TCAACCGGCG
CTGGGTGCGG CCGCCATTTT GCTGACCTAT GTAGCCGAGA CCCAGCGTAG TGCCGTGCCG
CATATCGAGC GCCTGCGGGT GGAGCACCTG GGCGATGCCC TGCTGATTGA CCGCAACACC
CGTCGCCATC TGGAGCTCTT CACTTCAAAT CAAGAAGGAA GTCACGATGA CGGCCGTTCG
GCAGCCACGC TGATCAACCT GCTGGATGAG ACGGTGACCG CGCACGGCTC GCGGCTGCTC
AAGCATTGGC TAGGTCGCCC GCTGCGTGAT CAGGCCGTGT TGCGGCATCG GCAGCAGGCG
ATTGGCGAAC TGATCGAGCG CGGCAAGATC AATGCGCTGC GCGAATCGTT GCGCGGTATC
AACGATATTG AACGCATCAC CACCCGCATC GTGATGGGCA GCGCCCGCCC CCGTGATTTG
TCCGGGCTGC GCGATGCCCT TGGTGTATTG CCCGCGCTGA GTGCGCAACT CAACCAACTC
GACCTGCCCT TATGGCGCGA TCTGGCCGTT CGGCTGACCG ATCAACCCGC CCCGCGTGAA
TTGCTGAACC GCGCACTGGT GACCCAACCA CCCGTGTGGC TGCGCGATGG CGGCGTGATT
GCCGCCGGAT TCGATGCCGA ACTCGACGAA TTGCGCCACC TTTCTGAACA CGCGGACGAC
GCCCTGAATG CGCTCGAAGC CCAAGCGCGA CTGCAAAGCG GTATTCAGTC CTTGAAGATC
GCCTACAACC GTGTGCAGGG GTTCTATTTT GAAGTCAGCC GGTTGCAGGC CGAAAAAATG
CCACCGCAGT TTATTCGCCG CCAGACGCTC AAATCGGTGG AGCGCTATAC GACCGAAGAG
CTGAAAACCT TCGAAGATCG CGTGTTGTCC GCCCGCGACC GCGCCTTGGC ACGCGAACAA
GGGCTCTTCA CCGAATTGTT GCAAACCCTC GCGACGCACC AGAGCGCCCT GCGCCGCATG
GCCGAAGCCA TTGCCGAGGT CGATGTGCTG CACAGTTTGG CGCGGGTGGC CGAGTGCCAG
CGCTGGGTGG CACCGGAACT CGGCAGTGAA CCGGGCATCC ACATCGAAGC GGGACGACAT
CCGGTGATTG AAGCCCTGAC CAAACAAACC TTAGGGAATC AGCCCTTCAC ACCGAATGAT
TGCGAACTCA CGCCAAACCG GCAACTGTTG ATGATTACCG GCCCGAACAT GGGCGGTAAA
TCGACCTATA TGCGGCAAAC GGCGTTGATC GTGCTGCTGG CGCACATTGG CGCGTTCGTC
CCTGCTACCC GCGCGCGTAT CGGTCCGATC GATCGCATTT TCACCCGCAT CGGCGCGGGC
GATGATCTGG CCTCCGGCCG TTCGACTTTT ATGGTCGAGA TGACCGAAAC GGCAGAAATC
CTGCACACGG CGACCGAAAA TTCACTGGTA TTGATCGATG AAATCGGTCG GGGCACGTCG
ACCTTCGATG GCCTGGCACT GGCCTGGGCC GTGGCGGAGC ACCTGATTCG CCGCAACCGC
GCGCTCACGC TGTTCGCCAC CCATTACTTC GAGCTGACTC AACTGACCGA GCGCTTCGAT
ACGGTCCGAA ACGTACACCT CGATGCCGTC ACACACAAGG ACGATTTGAT TTTTCTGCAC
AGCGTGAAAG ATGGCCCGGC CAGCCAGAGT TACGGCATCA AGGTCGCTGC GCTGGCCGGT
TTGCCCCGGG AGGCTATTCG GCGAGCACAA GCGTTACTAA AACAACTAGA GCAGCAACAC
CCCGTGGGAG CGGCCACGCC GCAGCTCGAT TTGTTTGCCG CGCCCGAAGT AACCGATGCA
ATTGAGGAAC CTGAGATTGA GCCGCACCCG TTGATTACCG CGCTCGAAAA ACTCGACCCG
GACATACTCA CGCCGAAGCA GGCGCTGGAT TTGATTTATG CCTGGCGCAA TGAACTTAAG
AAGTAA
 
Protein sequence
MTESKDLSQH TPMMQQFWTM KQAHPDVLLF YRMGDFYELF YADAERAARI LDLTLTTRGQ 
SAGEPIPMAG VPVHAYESYL ARLIRAGESV AICEQIGETK TKGPMERAVV RVVTPGTVTD
EALLDQREGN RLAALVPLAT TPPEYGLAHL DLAAGDFVLM RLDDAALTAE LARIDPRELL
LPESLAEAAD TAAKIGVDPK RWRTRADWQF DAKRGQAALL KHWQIHDLKS FGVTEIHQPA
LGAAAILLTY VAETQRSAVP HIERLRVEHL GDALLIDRNT RRHLELFTSN QEGSHDDGRS
AATLINLLDE TVTAHGSRLL KHWLGRPLRD QAVLRHRQQA IGELIERGKI NALRESLRGI
NDIERITTRI VMGSARPRDL SGLRDALGVL PALSAQLNQL DLPLWRDLAV RLTDQPAPRE
LLNRALVTQP PVWLRDGGVI AAGFDAELDE LRHLSEHADD ALNALEAQAR LQSGIQSLKI
AYNRVQGFYF EVSRLQAEKM PPQFIRRQTL KSVERYTTEE LKTFEDRVLS ARDRALAREQ
GLFTELLQTL ATHQSALRRM AEAIAEVDVL HSLARVAECQ RWVAPELGSE PGIHIEAGRH
PVIEALTKQT LGNQPFTPND CELTPNRQLL MITGPNMGGK STYMRQTALI VLLAHIGAFV
PATRARIGPI DRIFTRIGAG DDLASGRSTF MVEMTETAEI LHTATENSLV LIDEIGRGTS
TFDGLALAWA VAEHLIRRNR ALTLFATHYF ELTQLTERFD TVRNVHLDAV THKDDLIFLH
SVKDGPASQS YGIKVAALAG LPREAIRRAQ ALLKQLEQQH PVGAATPQLD LFAAPEVTDA
IEEPEIEPHP LITALEKLDP DILTPKQALD LIYAWRNELK K