Gene Phep_0481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0481 
Symbol 
ID8251568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp575701 
End bp577572 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content45% 
IMG OID644934131 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_003090767 
Protein GI255530395 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.324517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATA TTATACAGCT TTTACCCGAT AGTGTGGCCA ACCAGATCGC AGCAGGCGAG 
GTGGTACAGC GGCCTGCGTC GGCTGTAAAA GAGTTATTAG AGAACGCGAT TGATGCGGGG
GCAAATAAAA TACAGCTCCT AGTTAAGGAT GCCGGAAAAG CCCTGATCCA GGTGATAGAC
AATGGTTGCG GAATGAGTGT AACCGATGCC CGGATGTGTT TTGAACGCCA TGCAACCTCT
AAAGTACGTA AGGCTGAAGA CCTGTTTGCG ATCCGCACGA TGGGTTTCAG GGGCGAGGCC
ATGGCTTCTA TTGCAGCCAT TGCCCAGGTA GAGATGAAGA CCCGCAAGCA TGATGAAGAG
CTGGGGACCG TAATAGAGAT TGAAGGCTCT GTTTTCGTAA AACAGGAGCC TGTTGCCTGT
TCTGAAGGTA CCAGCATCAG CATCAAAAAC CTTTTTTACA ATACACCCGC CAGGCGCAAT
TTTCTGAAAA GCAATCCGGC CGAGATGCGC CATATTATTG ATGAATTTCA GCGGATATCA
CTGGCACATC CTTCAATCGC CTTTAGTTTG CACCATGATG GCGTAGAGAT CTATCGCTTG
CCTGCTTCGG TATTGAAACA GCGGATTGTG CATTTGTTTG GTAATAATTA CAACGAACGG
CTGATCCCTG TTGAAGAAGA AACCAGCATC ATTAACCTAA AAGGATATAT TGGCAAGCCA
GAATTTGCCA AAAAGACTAG GGGTGAACAG TTCTTTTTTG TAAACAACCG CTTCATTAAA
GACGCTTATT TAAACCATGC GGTTAACAAG GCTTATGAAG AGCTGCTTGC AGATGATCAT
TTTCCGCTGT ATGTGCTCTT TATAGATATA GATCCGGCAA ATATTGATGT AAACGTACAC
CCAACAAAAA CAGAGATCAA ATATTTAGAT GAAAAATCTA TCTATGCCAT TCTGCATTCG
GCCATAAAAA GATCTTTGGG CAGGTTTAAC ATTAGTCCGA CTATAGATTT CGATCAGGAA
ACCGGCTTCA GCAATATGAT CACGCATAAA GCCCCTGAAG AAATTGTGCC GCCAAGCATT
AGTTTCAATC CTGATTTTAA CCCCTTTGCT GAAGATAAAC CTAGCCCATC CAGGGATGCG
GCTTATGCAA GTTTCCCTAA AAGTTATGGT GGGGGCGGGG GTAATATCAA GCCCAGCACA
AAAAACTGGG GCTCATTGTA TGAGATTGCA AACCATAATC CTGAAACCCA ATCGGCACTT
GACCTCCCTG CCGATCCTGC CGGTCATCAG TTTAGTCCTG TGCAAAAGCA GCTGATGCAA
CTGCACAACC GTTATATCAT TTCACAGATC AAGTCTGGAT TGATGCTGAT CGACCAGCAG
GCTGCGCATG AAAGGATACT TTATGAGCGT TTTACCCTGC ACCTGGAAGA CAGAAAGGGT
GCTTCACAGC AAAGCCTGTT CCCACAAACG GTAACTTTAA GCCCCAACGA TTACGAACTG
GCCAAAAGTT TGCTGGAAGA CATTAAAAGC TTAGGTTTTG AAGTAAGAGA GTTTGGGAAA
AATACCCTGG TGATTGAAGG GATACCAGTT GATCTGGGCG GAGGAAATAT CAACGAAACG
CAATTGTTTG AACACCTGAT AGAAGGATTT AAAAACTCGC AGCAGGAACT AAAACTGGAC
AAAAGAGATG CTCTTGCCAG AAGTATGGCC CGGAACAGCG CCATAAAAAA TGGCACCGTA
CTGGGACAGG AAGAGATGAA TACGCTGATA GAGCAACTTT TTGCCTGTAA AACCCCTAAC
TTTAGCATCA GTGGCAAGCC GGTTATCCAA ACCATCGGCC TTGCAGAACT GGATAAAAAA
TTTGATAAAT AG
 
Protein sequence
MSDIIQLLPD SVANQIAAGE VVQRPASAVK ELLENAIDAG ANKIQLLVKD AGKALIQVID 
NGCGMSVTDA RMCFERHATS KVRKAEDLFA IRTMGFRGEA MASIAAIAQV EMKTRKHDEE
LGTVIEIEGS VFVKQEPVAC SEGTSISIKN LFYNTPARRN FLKSNPAEMR HIIDEFQRIS
LAHPSIAFSL HHDGVEIYRL PASVLKQRIV HLFGNNYNER LIPVEEETSI INLKGYIGKP
EFAKKTRGEQ FFFVNNRFIK DAYLNHAVNK AYEELLADDH FPLYVLFIDI DPANIDVNVH
PTKTEIKYLD EKSIYAILHS AIKRSLGRFN ISPTIDFDQE TGFSNMITHK APEEIVPPSI
SFNPDFNPFA EDKPSPSRDA AYASFPKSYG GGGGNIKPST KNWGSLYEIA NHNPETQSAL
DLPADPAGHQ FSPVQKQLMQ LHNRYIISQI KSGLMLIDQQ AAHERILYER FTLHLEDRKG
ASQQSLFPQT VTLSPNDYEL AKSLLEDIKS LGFEVREFGK NTLVIEGIPV DLGGGNINET
QLFEHLIEGF KNSQQELKLD KRDALARSMA RNSAIKNGTV LGQEEMNTLI EQLFACKTPN
FSISGKPVIQ TIGLAELDKK FDK