Gene Phep_2920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2920 
Symbol 
ID8254031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3482390 
End bp3484768 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content46% 
IMG OID644936568 
ProductSmr protein/MutS2 
Protein accessionYP_003093180 
Protein GI255532808 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000870196 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTATATC CGGAGAATTG TTTGGAGCGT TTGGGTTTTA ATGAGGTAAA ACAGCTTATC 
CACAAACATT GTTTAAGCCC GATGGGGCAG CAAATGGTGG CAAAAATGCA GGTGATGGCC
AAGTTTGACC AGATCAACAA ATTTTTGCGG CAAACGCAGG AGTTCAAAAG TATTCTTGAA
AACCAGGAAC CTTTGCAGAT CAGTACCTTT TTTGACATTA AAAGTCTGGC CGATAAGATC
AGGGTAGAAG GCACTTACCT GGTAGAAGAA GAGCTGCACC AGATGTACGC CTCTTTGCAA
ACGGTGTTTT CGGTATTGCG CTTCTTTGAA GAACGTGCCG CTGTTTATCC CAATCTGGAA
GCTTTGTTTG AACACCTTCC GGTAGAAAAA AATATCCTTA AAAAGATTGA AACCGTACTT
GACCCAAAGG GTAAAATAAA ACCAAATGCT TCGCCGGCAC TGCAAAACAT TATTGGCGAT
ATTGCCAAAG CAGAACAGGA TGTGCGTAAG CGGATGGACT CGATCTATAA GCAGGCGGTA
AGCAACAACT GGGTGGCCGA TGGCAGTCTG ACCATCCGCG ATGGCAGGAT GTGTATCCCT
GTGCTGGCCG AAAACAAGCG TAAGCTAAAA GGCTTTGTAC ACGACGAATC GGCAAGCGGA
CAAACGGTTT ACATTGAACC GGAAGAGGTT TTTACCTTAA ACAATAAGCT CAGGGACCTG
GAGTTTGACA AGCGCAGGGA GATCATCAGG ATACTGATTG CGCTGACCAC TGAACTGAGG
CCTTATACAC CTTTGCTGCT GTCGTACCAT GGTTTTTTAA CGAAACTTGA TTTTGTAAGG
GCAAAAGCCT TGTTTGCCAT TGATGTAGAG GCCGATATGC CGGTACTGAT CAATGCGGCA
AAAACCAGGC TGGTCAATGC CAGGCACCCT TTATTGTATC TTTCCTTTAA GGAAGACAAA
AAAACGGTGG TGCCTTTGAA CATCCACATC AACGAGGAAC TGAGAATTGT ACTGGTATCC
GGCCCCAATG CCGGAGGTAA ATCGGTATGC ATGAAAACGG TGGGCCTGTT GCAGTTAATG
GTACAGTCGG GCTTACTGAT CCCCGTTCAT GAATCCAGTG AGGTAGGAAT ATTCGACAAT
ATATTTGCAG ATATTGGTGA CGACCAGTCG ATAGAAAGTG ATTTGAGTAC CTACAGTGCT
CATTTAACCA AAATGCGCTA TTTTGTGGCC CATGCCACAC CAAAATCGCT GGTACTGATC
GATGAGTTTG GTACAGGTAC TGATCCGCAG TTTGGCGGAC CAATGGCCGA GGCGGTGCTG
GAAGTGCTGA ACAATAAAAA GGCAAGGGGG GTAATCACTA CCCACTATTC CAATTTAAAG
TTGTTTGCTG GCAATACACC CGGACTGGAA AATGCCTCTA TGTTGTTTGA CAACGACCGG
ATGAAGCCCC TGTATATATT GGAGATTGGC AAGCCCGGTA GTTCTTATGC TTTTGAGATA
GCCCAGAATA TAGGCCTGCA AAAGGAAGTG CTGGATTTGG CAAGGGCCAA AACAGGCACC
AACCAGAACA GGATAGACAG TTTGCTGGTA GACCTGGAAC GCGAGAAAAA ACAGATCTAC
GATACCAAAC TGAATTTATC TAACCAGCAG AACAAGGTAA AAAACCTGGT GGCCGAGAAT
GAAAAGCTGA AGGCTTTTCT GGACGACAAT AAAAAGATAC TGATCAAAGA GGCCAAGCTG
GAAGCACAGA ACATCATTAA AAATGCCAAT AAGCTGGTTG AAAATACCAT TGCCGAAATT
AAGGAAAAGC AGGCCGATAA AGCGGTAACC AAACAGCTGC GGCAAAACCT GCAACAGGTG
CTGGTGCAAA ACCAGGTACG GGAAGACAAA AAGCCCGAGC CGGTTAGCCC TTTAAACTTA
AATACACCAA TAGAAGTAGG CGATTGGGTG CAGCTGAAGG ACAGTGAAAC CACAGGCCAG
GTACTGGAGA TCAACAGGGA CAACCTGGTG CTTGCGCTGG GCGACCTGCG TTCGGTGCTC
AAAAAGAACA GGGTATTTAA GATCAGCAAC AGGGAGGCTA AAAAAGCCGC ACAGCGGAAT
TCTTATACCG GCAGCGTTGC TGAGGCCATC AGTAATTTTA ATGCCGAACT GGACCTTAGG
GGCATGCGGG GTGAAAATGC TTTGCACGAG GTAGAAAAGT ACCTGGACAA ATCCATTATG
CTTGGTTTTC CTTTTGTAAA GCTCATCCAT GGTAAGGGGG ATGGTATTTT GAGAAAGCTG
ATCAGGGATT ACCTGAAAAA GTACAGCCAG GTGAACAGGG TAGAGGATGA GCATGCCGAC
AGGGGTGGCG ATGGGATTAC TTATGTTTAT TTTAATTAA
 
Protein sequence
MLYPENCLER LGFNEVKQLI HKHCLSPMGQ QMVAKMQVMA KFDQINKFLR QTQEFKSILE 
NQEPLQISTF FDIKSLADKI RVEGTYLVEE ELHQMYASLQ TVFSVLRFFE ERAAVYPNLE
ALFEHLPVEK NILKKIETVL DPKGKIKPNA SPALQNIIGD IAKAEQDVRK RMDSIYKQAV
SNNWVADGSL TIRDGRMCIP VLAENKRKLK GFVHDESASG QTVYIEPEEV FTLNNKLRDL
EFDKRREIIR ILIALTTELR PYTPLLLSYH GFLTKLDFVR AKALFAIDVE ADMPVLINAA
KTRLVNARHP LLYLSFKEDK KTVVPLNIHI NEELRIVLVS GPNAGGKSVC MKTVGLLQLM
VQSGLLIPVH ESSEVGIFDN IFADIGDDQS IESDLSTYSA HLTKMRYFVA HATPKSLVLI
DEFGTGTDPQ FGGPMAEAVL EVLNNKKARG VITTHYSNLK LFAGNTPGLE NASMLFDNDR
MKPLYILEIG KPGSSYAFEI AQNIGLQKEV LDLARAKTGT NQNRIDSLLV DLEREKKQIY
DTKLNLSNQQ NKVKNLVAEN EKLKAFLDDN KKILIKEAKL EAQNIIKNAN KLVENTIAEI
KEKQADKAVT KQLRQNLQQV LVQNQVREDK KPEPVSPLNL NTPIEVGDWV QLKDSETTGQ
VLEINRDNLV LALGDLRSVL KKNRVFKISN REAKKAAQRN SYTGSVAEAI SNFNAELDLR
GMRGENALHE VEKYLDKSIM LGFPFVKLIH GKGDGILRKL IRDYLKKYSQ VNRVEDEHAD
RGGDGITYVY FN