Gene Jann_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_4047 
SymbolmutL 
ID3936535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp4150797 
End bp4152629 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content66% 
IMG OID637906432 
ProductDNA mismatch repair protein 
Protein accessionYP_511989 
Protein GI89056538 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAAG CGCACCCCAA CATATCCACA AACCCACGCC CGGTGATCCG TCAGCTGGAC 
GAGGCTGCGA TCAACCGCAT CGCGGCGGGG GAAGTGGTGG AACGCCCCGC CTCTGCTGTC
AAAGAACTGG TGGAAAACGC GATAGATGCC GATGCGCGGC GCATCGTGAT TGAGGTGGCC
CACGGCGGCA AAACCCTGAT CCGCGTGACA GATGACGGCT GCGGGATGGA GGCGGCGGAC
CTGCCGCTGG CCCTGTCGCG CCACGCGACC TCCAAGATCG ACGGCACAGA TCTGCTCAAT
ATCCATTCCT TCGGGTTTCG GGGGGAGGCG TTGCCGTCCC TGGGGTCTGT CGGGCGGTTG
TCGATCACCT CGCGGGCATC GGATGCGGGC CACATGATCC GCGTGACCGG CGGTGCCCAT
GATGCCGTGA AGCCCGCCGC GCTGAACCGG GGCACTCTTG TGGAGCTGCG CGATCTCTTC
TTCGCCACGC CCGCACGCCT GAAGTTCCTG CGCACGGATC GGGCCGAGAT GCAGGCGATC
ACCGATGTGG TGAAGCGTCT GGCGATGGCG GAGCCTGCGG TAGGTTTCAC GTTGAAAGAC
GTGAGTGACG GCGAACGGGT GACGTTCCGC GTCGACCCGG AACAGGGCGA TCTGTTCGAC
GCGCTGCGGG GGCGGCTGAC GGCGATCATG GGGCGTGACT TCACCGACAA TGCGCTGGCG
ATTGATGCGG AGCGCGAGGG GCTGCGGCTG ACGGGCTACG CCGCCCTGCC CACCTATTCG
CGCGGGTCAG CGGTGGCCCA ATTCCTGTTC GTGAATGGCA GGCCTGTACG GGACAAACTG
CTGATCGGGG CGTTGCGCGC GGCATATATG GATGTATTGA GCCGCGACCG GCATCCGGCG
GCGGTGTTGT TCATCGGCTG CCCGTCGGAG CGGGTGGATG TGAACGTGCA TCCGGCGAAA
TCTGAAGTGC GCTTCCGGGA ACCTGGCGTT GCCCGATCCC TGATCGTGAC GGGGCTGCGC
CATGCACTGG CCGAAGCCGG ACACCGGGCC TCCTCCACCG TCGCGGACGC GACGCTGTCT
GCCATGCAAC CGGCTTATTC AGAGCCTCCT TCGGAACCTC TGGCTGCGCC GCGCATCTAT
CAGATGGACC GGTTGGGGCA GCAGCGGGGC TTTGCCCCTG TCCGGGGCCT GGAAGAAGCT
GCACCGCAAT GGTCCGCGCC ATCGGCGCGC GTGGATGAGG CGGTGATCGA CAGCGACGGC
CCATTGGGGG CTGCGCGGGC GCAGGTCCAT GAAAATTACA TCATCGCGCA GACGCCAGGC
GGGATTGTGT TGGTGGATCA GCACGCCGCC CATGAACGGT TGGTTTATGA GCGGTTGAAG
ACACAGATGG CAGAGCGGGG CGTGGCGCGG CAGGCGTTGT TAATCCCGGA GATCCTGACG
CTGGGAGCTG ATGCGGACCG CTTGCTGGAC CACGCGGCGG AGTTGGAGCG GTTCGGTCTG
GTGATTGAGG CGTTTGGCCC CGGCACCGTC GCCGTGCGCG AGACCCCCGC AATCCTGGGA
GAGATCAACG CCGAGGCGCT GTTGCGCGAC ATCCTTGATG AGCTGTCCGA TCTGGGCGAC
AGCCAGACGC TTCAGGCAAG GGTGGAGGCG GTGTTGTCCC GTGTCGCCTG CCACGGCTCC
ATCCGGTCGG GCCGTCAGAT GCGCGCCGAT GAGATGAACG CGCTGCTGCG CGAGATGGAG
GCCACGCCGC TGTCGGGCCA ATGCAACCAC GGGCGGCCCA CCTATGTGGA GCTGAAACTG
GCCGATATCG AACGGCTGTT CGGGCGCACG TGA
 
Protein sequence
MAQAHPNIST NPRPVIRQLD EAAINRIAAG EVVERPASAV KELVENAIDA DARRIVIEVA 
HGGKTLIRVT DDGCGMEAAD LPLALSRHAT SKIDGTDLLN IHSFGFRGEA LPSLGSVGRL
SITSRASDAG HMIRVTGGAH DAVKPAALNR GTLVELRDLF FATPARLKFL RTDRAEMQAI
TDVVKRLAMA EPAVGFTLKD VSDGERVTFR VDPEQGDLFD ALRGRLTAIM GRDFTDNALA
IDAEREGLRL TGYAALPTYS RGSAVAQFLF VNGRPVRDKL LIGALRAAYM DVLSRDRHPA
AVLFIGCPSE RVDVNVHPAK SEVRFREPGV ARSLIVTGLR HALAEAGHRA SSTVADATLS
AMQPAYSEPP SEPLAAPRIY QMDRLGQQRG FAPVRGLEEA APQWSAPSAR VDEAVIDSDG
PLGAARAQVH ENYIIAQTPG GIVLVDQHAA HERLVYERLK TQMAERGVAR QALLIPEILT
LGADADRLLD HAAELERFGL VIEAFGPGTV AVRETPAILG EINAEALLRD ILDELSDLGD
SQTLQARVEA VLSRVACHGS IRSGRQMRAD EMNALLREME ATPLSGQCNH GRPTYVELKL
ADIERLFGRT