Gene Hhal_1657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1657 
Symbol 
ID4709847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1810359 
End bp1812965 
Gene Length2607 bp 
Protein Length868 aa 
Translation table11 
GC content69% 
IMG OID639856124 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001003223 
Protein GI121998436 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCGA CCCAAGAGGC TGAAACCGCC ACCAATGGCG CCGGGCATAC CCCGATGATG 
CGGCAGTTCC TGCGCATCAA GGCCGAGTAC CCCGAGACCC TGCTGTTCTA CCGGATGGGT
GATTTCTACG AGCTATTCTA TGAGGACGCC GAGCGTGCCG CGAAACTCCT CGACATCACC
CTGACCACTC GAGGGGAGTC GGCGGGCGCG CCCATCCCCA TGGCCGGTGT ACCCGTCCAG
TCGGTGGAAA GCTATCTGGC GCGGCTGGTC CGCCTCGGCG AATCGGTGGC CATCTGCGAG
CAGATCGGTG ACCCCAACAC CACCAAGGGG CCCGTGGAGC GCCAGGTGGT CCGGGTGGTT
ACCCCCGGCA CCCTGACCGA GGATGCGCTG CTCGAGGAGC GCAGCGCCAA CCTGCTCACC
GCTGTGGCTC CGGGGCCGAA AGGCCGTTTT GGTGTCGCCT CGCTGGAGCT CTCCTCGGGG
CGTTTCTCCG TGCTCGAGGC CCCCGATCAG GAGGCCCTCA CCGCAGAGCT CGAACGCCTG
CGCCCCGCCG AACTGATCCT CCCGGATGAC GACCAGACGC CCGCACCGGA AGGGGGGTGT
GTGGCGCAGC GCCGCCCTTC GTGGCACTTC GAATACGATT CGGCCCGGCG CCTGCTCCTG
CGCCAACTCG GCACCCACGA CCTCTCCGGC TTCGGCGCCG AAGAGTTGCA CGCCCCGGTC
ACCGCAGCCG GGGCGCTCCT GCAATATCTC AACGAGACCC AGCGGGCCGC GCTGCCGCAT
GTTGGCGCAC TGACCGTCGA GTCGCGGGAT GAGGCCATCA CCATCGATGC CGCCAGCCGA
CGCAATCTGG AGATTGAACA CAATCTCTCC GGCGGCACCG AACACACCCT GGCCTCGGTG
ATCGATACCA GCGTCACCGC CATGGGCGGC CGACTGCTGC GCCGCTGGCT ACAGCGCCCC
TTGCGCCGGC GCGAGACCAT CGCCGCCCGC CATGCCGCGG TGGCCGCGCT GGCCGACGGC
GCCTTCGCCG ACGTCCGCAG CACCCTGGAA GGCTGCGCCG ATGTCGAACG CATCCTCGCC
CGGGTCGCCC TCGGCACCGC CCGCCCGCGG GATCTGACCG GCCTGCGCGA CGCCCTCGAG
CGCCTGCCGC AGCTGCAGAT CCTGCTCGGA CAACTCAACA GCCACCGCCT GCAGGCACTG
GGCGTCGAAC TCGATGAGCA CCCGCAAACC GTCGACCTGC TGCAGCGGGC GATCATCGAT
ACCCCGCCCG CGACCGTGCG CGACGGTGGG GTCATCGCCG ACGGCTTCGA CGGCGAGCTC
GACGAACTGC GCTCAATGTC GCGCAACGCC GACGACTACC TGGCCGCCCT GGAGGCCGAG
GAGCGCGCAG CGACAGGGAT CCCGACGCTC AAGGTGGGCT TCAACCGGGT CCACGGCTAC
TACATTGAAG TCAGCCGCAG CCAGAGCGGC CATATGCCGG AACGCTACAC GCGCCGCCAG
ACCCTGAAGG CCGCGGAGCG ATTCATCACC CCGGAGCTCA AACGCTTCGA GGAGCAGGTC
CTCTCCGCGC GCGAGCGCGC CCTGGCCCGC GAGAAGGCCC TCTACGAGCA GCTCGTCGCC
GACCTCGCCT CCGAACTGAC CCCACTGCAG CGCAGCGCCT CAGCCCTGGC CGAGCTAGAC
GCCCTGGCCG CCTTTGCCGA ACGGGCGCGC AGCCTCGACT ACGTCCAGCC CGAGCTGGCC
GACACCCCTG GTGTGCGGAT CGAGGGCGGT CGCCACCCCG TGGTCGAGCA GGCCCTGGAC
GCCCCCTTTG TGCCCAACGA TGTGCGCCTG GACAACCGCC GGCGGATGCT CCTGATCACC
GGGCCGAACA TGGGGGGCAA ATCGACGTAC ATGCGCCAGA CCGCTCTGAT TGCCCTCCTC
GCCTACGCCG GCGCTTTCGT GCCGGCCCAG CGCGCAGTGC TCGGCCCCAT CGACCGCATC
TTCACGCGCA TCGGGGCGGC CGACGACCTC GCCTCCGGCC GCTCCACCTT CATGGTGGAG
ATGACCGAAA CGGCCAACAT CCTCCACAAC GCCACCGCCG AGAGCCTGGT ACTGATGGAT
GAAATCGGCC GGGGCACCAG CACCTTCGAC GGCCTGGCCC TGGCCTGGGC CACCGCCGAG
CGGCTGGCCA CGCGCATCCG GGCCTTCACG CTGTTTGCCA CCCACTACTT TGAGATGACC
GCCCTCGAAC AGATCCACCC CGGCGTGGTC AACGTCCACC TCGAGGCGGC GGAACATGGC
GAACGCATCG TCTTCCTTCA CGCCGTGCGC GATGGCCCCG CCAACCAGAG CTACGGCCTC
CAGGTCGCCG CGCTGGCCGG TGTGCCGCAA GAGGTCCTCA AGGCCGCCCG GGAGAAGCTG
CGCAGCCTGG AGTCCGGGGA TGGCGGCGAC ACCGGCTCGG CACAGCTGCC GCTGTTCGGC
CCCGAGCCGG TGTTCCCGCC ACCGGCGCAG CCGGAGCCCG AGCCCGACCC GATCCGTGAA
GCGGTCGAAA ACCTCGATCC CGATGGGCTG ACCCCGCGGG ACGCCCTCGA GACCATCTAT
TGGCTCAAGG CACAGTGCAA GGAGTAG
 
Protein sequence
MAATQEAETA TNGAGHTPMM RQFLRIKAEY PETLLFYRMG DFYELFYEDA ERAAKLLDIT 
LTTRGESAGA PIPMAGVPVQ SVESYLARLV RLGESVAICE QIGDPNTTKG PVERQVVRVV
TPGTLTEDAL LEERSANLLT AVAPGPKGRF GVASLELSSG RFSVLEAPDQ EALTAELERL
RPAELILPDD DQTPAPEGGC VAQRRPSWHF EYDSARRLLL RQLGTHDLSG FGAEELHAPV
TAAGALLQYL NETQRAALPH VGALTVESRD EAITIDAASR RNLEIEHNLS GGTEHTLASV
IDTSVTAMGG RLLRRWLQRP LRRRETIAAR HAAVAALADG AFADVRSTLE GCADVERILA
RVALGTARPR DLTGLRDALE RLPQLQILLG QLNSHRLQAL GVELDEHPQT VDLLQRAIID
TPPATVRDGG VIADGFDGEL DELRSMSRNA DDYLAALEAE ERAATGIPTL KVGFNRVHGY
YIEVSRSQSG HMPERYTRRQ TLKAAERFIT PELKRFEEQV LSARERALAR EKALYEQLVA
DLASELTPLQ RSASALAELD ALAAFAERAR SLDYVQPELA DTPGVRIEGG RHPVVEQALD
APFVPNDVRL DNRRRMLLIT GPNMGGKSTY MRQTALIALL AYAGAFVPAQ RAVLGPIDRI
FTRIGAADDL ASGRSTFMVE MTETANILHN ATAESLVLMD EIGRGTSTFD GLALAWATAE
RLATRIRAFT LFATHYFEMT ALEQIHPGVV NVHLEAAEHG ERIVFLHAVR DGPANQSYGL
QVAALAGVPQ EVLKAAREKL RSLESGDGGD TGSAQLPLFG PEPVFPPPAQ PEPEPDPIRE
AVENLDPDGL TPRDALETIY WLKAQCKE