Gene Cagg_1936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1936 
Symbol 
ID7268852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2370057 
End bp2371895 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content59% 
IMG OID643566774 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_002463267 
Protein GI219848834 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000687656 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGATCC GTCTGCTCGA TGAAACGATT GCTGCCCAGA TTGCTGCCGG CGAAGTGGTC 
GAACGTCCGG CTTCCGTTGC CAAGGAGCTG GTCGAGAATG CACTCGATGC CGGTGCGCGC
CGGATTGTGG TCGAGGTGCG GGGTGGTGGG CTGCGTGAAA TTCGTGTGCA AGATGATGGC
TGTGGTATTT CTGCCGAGGA GGTTGAGTTG GCATTTGCTC GTCATGCGAC GAGTAAGTTG
CACTCTGCCG ACGATTTGTG GGCCATTCGC ACGCTTGGTT TTCGTGGTGA GGCGTTGCCG
AGCATTGCGA GCGTTGCCCA GGTGATCTGT GTGACCCGCG TCGCCGATGC CGAGTTGGGG
GTTGAACTGC GGATCGCCGG TGGTGAGGTG CAGTCACGTT TGCCGTGTGG TTGCCCTGCC
GGCACGACGA TTACGGTGCG CAATCTGTTC TACAATACGC CGGTGCGGCG CGAGTATCTC
CGTTCGGAAG CTACCGAAAC CGGTGCCATC AGTGCAATCG TTCAGCAGTA CGCTCTTGCC
TACCCAGAAG TGAGTTTTAG CCTCGTGATC GATGGGCGGT TGGCATTTCA AACGACCGGT
GATGGCGATT TGCGCGCGGT GGTAGTTGAG CTGTACGGTC TTGCGGTCGG GCGAGCCTTG
TTGCCGGTGC AGGCCGAAGT CGGTGATGGT GAATTGTGGG TGGCAGTGAA CGGTCTGATC
TCGCCGCCCG ACCTGACCCG TAGTTCGCGA AGTTATCTCT CGTTCTTTGC TAATCGGCGC
GCATTGCAAC CGCGCGGGGC CTTGGCGGCT GTGGTAGAGA ATGCGTACCA TACGATGCTG
ATGAAGGGTC GTTTCCCCAT CGCGATTATC GATCTGCGCG TGCATCCTGC TGCCATCGAT
GTGAATGTTC ATCCAACCAA GAGCGAAGTC AAGTTTCGCT ATCCGGCCCA TGTCCATAGT
GTGTTGGGGC AGGCAATTCG CGATGCGCTG ATCAAGGGGA GCGATATTCC GGTGTGGGAA
GCTCCCGATC CGGCGACAGC TCAGCGTCGA TTTGAGCTGC GTCGTCTTGG TCAGGAACAG
GCAAGCTCAC CGGCAACCTG GGGAGTAGGT GCTCAAGCAT GGGATCGGGA ACGCGCGCAC
TGGGATGTCG GTACACCGGT TTCGCGATCT GAGCCGTCGT TGCTGGTGTC ACCGGTTGCA
GTACCCACGT CGAATACTGC TGTACCTACA CCAGATGCTT CATTTGCAAC CTCATCATCG
GCCTTACCAC CGTTGCGCGT GGTGGGGCAA GTTGGGTTGA CGTATATCGT GGCGGAAGCA
CCAGAGGGGA TGTATCTGAT CGATCAGCAC GCTGCCCACG AGCGAATTAC CTATGAGAAG
TTGATGAATC AGTACGCTCA ACGTGCAGTT GAGTCGCAGC AGTTGCTGAT TCCGCAAGCG
GTTGAGTTAA GCCCGGAAGC GAGCACACTT TTGGTAGGTA ATGCCGAGAA GTTGGCCGAA
TGGGGGTTTG CGCTGGAGCC TTGGGGAACC GGAGTGTTGG TGCGGGCGAT ACCGGCTACT
CTGCCACCCG ACGAGCTAAC ACAGGCGTTG CACGAGGTAG CCGAGCGGCT GGCCGGGCGC
GGTGGGAGTA GTCCGCTCGA GTGGCGTGAA GCGATGTTGA TTACGTTGGC CTGCCACACT
TCGGTGCGCG CCGGCCAACC GCTCTCGCAC GACGAGATGC GTCATCTGCT CCGCCAACTT
GAACAGTGTG TAAGCCCGCG CACGTGTCCG CATGGTCGCC CGACTATGAT CCTGATGACA
CCGGCTCAGC TCGAACGGCA GTTTGGTCGG CGCGTGTAA
 
Protein sequence
MPIRLLDETI AAQIAAGEVV ERPASVAKEL VENALDAGAR RIVVEVRGGG LREIRVQDDG 
CGISAEEVEL AFARHATSKL HSADDLWAIR TLGFRGEALP SIASVAQVIC VTRVADAELG
VELRIAGGEV QSRLPCGCPA GTTITVRNLF YNTPVRREYL RSEATETGAI SAIVQQYALA
YPEVSFSLVI DGRLAFQTTG DGDLRAVVVE LYGLAVGRAL LPVQAEVGDG ELWVAVNGLI
SPPDLTRSSR SYLSFFANRR ALQPRGALAA VVENAYHTML MKGRFPIAII DLRVHPAAID
VNVHPTKSEV KFRYPAHVHS VLGQAIRDAL IKGSDIPVWE APDPATAQRR FELRRLGQEQ
ASSPATWGVG AQAWDRERAH WDVGTPVSRS EPSLLVSPVA VPTSNTAVPT PDASFATSSS
ALPPLRVVGQ VGLTYIVAEA PEGMYLIDQH AAHERITYEK LMNQYAQRAV ESQQLLIPQA
VELSPEASTL LVGNAEKLAE WGFALEPWGT GVLVRAIPAT LPPDELTQAL HEVAERLAGR
GGSSPLEWRE AMLITLACHT SVRAGQPLSH DEMRHLLRQL EQCVSPRTCP HGRPTMILMT
PAQLERQFGR RV