Gene Rcas_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0100 
Symbol 
ID5537559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp120193 
End bp122010 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content64% 
IMG OID640892264 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_001430254 
Protein GI156740125 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATAC GGGTGCTCGA TGCAACCGTT GCTGCGCAGA TCGCCGCTGG CGAAGTAATC 
GAACGTCCGG CGTCAGTCGT GCGCGAACTG GTCGAAAATG CGCTCGATGC CGGGGCGCGC
CGCATTGTCG TGGAAGCGCG CGGCGGCGGG TTGCGCGAGA TCCGGGTGCA GGACGACGGG
TGCGGCATTC CTGCCGATGA AGTCGAACTG GCGTTTGCGC GCCACGCAAC CTCGAAACTA
TCCACAGCCG ATGATCTCTG GTCGATTGCA ACGCTTGGCT TTCGCGGTGA AGCGCTCCCG
TCAATCGCGG CAGTGGCGCA GGTAATCTGC GTTACTCGCG CTGCCGGCGC CGACGTGGGC
GTCGAACTGC GGATTGCCGG CGGCGAGGTG CAGGCGATTA TGCCACGCGG ATGCTCGCCA
GGCACGACAA TCAGTGTACG CAACCTGTTC TACAATACGC CGGTGCGGCG CGATTTCCTC
CGTTCCGACG CCGCCGAGTC CGCCGCGATT ACGTCCGTCG TCACGCAGTA TGCGCTTGCC
TACCCCGAAG TGCGCTTCAG CCTCGTCATC GACGGGCGTG CCACGCTTCA GACCAGCGGC
AACGGCGATC TGCGCGCCGC CACAATCGAG ATTTACGGGC TTGATGTTGC GCGCCAGTTG
CTGGCAATCG ATGCTGCCGT CGGCGAGGGT GTCGATCTGG TCCAGGTGCG TGGGTTGGTG
TCGCCGCCGG GGCTGACCCG CAGTTCACGC GCGGCTATCC ACCTGTTCAT CAATCGTCGC
GCTATTCAAC CGCGCGGGCA GATTGCAATT GTGCTCGAAG AGGCATATCA CACGCTGCTG
ATGAAAGGTC GCCATCCTAT GGCGATTTTG AACATCACGG TGCATCCCGC AGCGGTCGAT
GTCAATGTCC ATCCAACCAA GAGCGAGGTC AAATTCCGCA ATACGACGCA GGTGATGAGC
GTACTGGGGC GAGCGGTGCG CACCGCGCTC CTGGAAAGCG GCGTGCGTCC CTGGGAAGAA
CCAGGCGTCC CTGCATCGCT CGACACGGCG CAGCGCCGCT TTGAACTGCG GCGCCTGGGA
ACATCCCCCG AAAGCGCCTG GGATGCGCCA TCCTGGATGA CGGCGGGCGA TAAGCACGGG
GAAATACCGG CGATGGACGA CCGGCGCGCT GTTGGGCAGA GCAGCGCGTG GGAGCGTGAG
CCGGCGCCAC CCGACGCACA ACTCATCACG CACGCCTCGA AATTGCCGCC GTTGCGGATC
GTCGGACAGA TCGCCCAATC CTACATTGTC GCCGAGTCGC CGGATGGGAT GTATCTCATC
GATCAGCACG CTGCGCACGA ACGCATCACC TACGAGCGGT TGATGGCGCA ACGGGGCGCC
GGCGCGATTG AACGCCAGGA ACTGCTGATC CCGCAGGTGA TCGATCTGCC GCCGACGGCG
CAGGACGTGC TGCTGGATGC CGCCGACCGG TTGGCGGAGT GGGGGTTTGC TGTCGAACCA
TTCGGGCGCA GCCTGCGCAT TCGCGCTATT CCGGCAGTGC TCTATCCCGG CGATCTGGCG
ACAGCACTGC TCGAAATCGC CGATCACCTG AGCGGTCGTG GCGGAACAAC GCCACACGAC
TGGCGTGAGG CGCTGCTGAT CACCCTCTCG TGCCATACCT CGGTGCGCAG CGGGCAAACC
CTCTCGTTCG ACGAAATGCG CGGATTGGTG CTGCAACTGG AGCGCTGCTC ATCGCCACGC
ACCTGTCCCC ACGGTCGTCC GACGATGATT CTGCTGACGA CGACCCAGAT CGAGCGCCAG
TTCGGCAGGA TTAAGTAG
 
Protein sequence
MPIRVLDATV AAQIAAGEVI ERPASVVREL VENALDAGAR RIVVEARGGG LREIRVQDDG 
CGIPADEVEL AFARHATSKL STADDLWSIA TLGFRGEALP SIAAVAQVIC VTRAAGADVG
VELRIAGGEV QAIMPRGCSP GTTISVRNLF YNTPVRRDFL RSDAAESAAI TSVVTQYALA
YPEVRFSLVI DGRATLQTSG NGDLRAATIE IYGLDVARQL LAIDAAVGEG VDLVQVRGLV
SPPGLTRSSR AAIHLFINRR AIQPRGQIAI VLEEAYHTLL MKGRHPMAIL NITVHPAAVD
VNVHPTKSEV KFRNTTQVMS VLGRAVRTAL LESGVRPWEE PGVPASLDTA QRRFELRRLG
TSPESAWDAP SWMTAGDKHG EIPAMDDRRA VGQSSAWERE PAPPDAQLIT HASKLPPLRI
VGQIAQSYIV AESPDGMYLI DQHAAHERIT YERLMAQRGA GAIERQELLI PQVIDLPPTA
QDVLLDAADR LAEWGFAVEP FGRSLRIRAI PAVLYPGDLA TALLEIADHL SGRGGTTPHD
WREALLITLS CHTSVRSGQT LSFDEMRGLV LQLERCSSPR TCPHGRPTMI LLTTTQIERQ
FGRIK