Gene Plut_1981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlut_1981 
Symbol 
ID3744950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium luteolum DSM 273 
KingdomBacteria 
Replicon accessionNC_007512 
Strand
Start bp2205363 
End bp2207243 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content61% 
IMG OID637770012 
ProductDNA mismatch repair protein 
Protein accessionYP_375866 
Protein GI78187823 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.201794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCCA TTGCCAGGCT TCCGGACAAT GTAGCCAACA AGATTTCAGC GGGGGAGGTC 
GTCCAGCGGC CGGCTTCCGT CGTCAAGGAA CTCCTTGAAA ACGCCATAGA CTCCGGCGCC
GACCGGATTT CAGTCGTTAT AAAAGACGCC GGCAGGGAAC TGGTGCGCAT CATCGACAAC
GGACGGGGAA TGAGCAGGGC GGATGCACTT CTTTCGGTCG AGCGGTTCGC CACAAGCAAG
CTCAGGGACG TCGATGACCT CGATACCCTC GGGACCCTCG GATTCCGTGG CGAGGCACTC
GCCAGCATCT CCTCCGTCTC CCATTTCGAG CTCCGCACCC GCATGACCGA TGCGCCTGTC
GCGCTCCGTT TCCGTTACGA GGGCGGCATT GCAGTGGAAG AGTCTGAGGT GCAGGGTGAG
GCGGGCACCT CGGTGAGCGT CCGCAATCTC TTTTACAACG TCCCCGCGCG CCGGAAGTTC
CTGAAGTCCA ACGCCACCGA GTACGGCCAT ATTTTCGAAC TCGTCCGTTC GTTTTCCCTC
GCCTACCCTG AAATACAGTG GCAGCTCCTG AACGACGACC AGGAGCTGTT CAACTTCCGC
ACTTCCGATA TGCTGGAGCG CCTCGATACC TTTTACGGAA AAGGGTTTGC CGACAGCCTC
ATCGAGGTCG GCGAAGAAAA CGACTACCTC TCCATCAGGG GATACATCGG CCGCCCGGCG
CTCCAGAAGC GAAAGAAGCT CGACCAGTAC TTCTTCATCA ACCGTCGCCC GATCCAGAAC
CGCATGCTCA CCCAGGCTCT CCAGCAGGCA TATGCCGAGC TGCTTGTAGA GCGCCAGGCA
CCCTTCGCCC TCCTCTTTCT CGGTATCGAT CCCTCACGGG TGGATGTCAA CGTGCACCCT
GCGAAGCTCG AGGTCCGGTT CGACGATGAG CGAAGCGTGC GCAACATGTT CTACCCCGTC
ATCAAGCGGG CCGTGACACT GCATGACTTT TCCCCCGATC TTGCCGCAGG AGGACGGACC
TCGCAGGCAG GGGATGATTC CGCTTCCCGG GGGTTCACTC ATGCCGGCGG GGGTGGATTC
AGGACCCTTG CTTTTCAGGA GGTCCCGGAA CGGGCCATTA CGACCGGAGA GCTCTACGGC
AGCTATCGCG AAGGGGCATT CGGCAGTTCC CGCCCGGCAG TTCCGCAGCC TTCACACCAG
GAGGTGATGT TCCCTGTTCC TGAAGTCCCG GCGGCCCGTG AGGATATCTC ACAGCTGCTC
CGCTCGAGCA TGCACGAGGG CCCGGAAGGC GCCGGAGTGG AGCCGAAAGG GGAGGAACCG
AAGATCTGGC AGCTCCACAA CAAGTACCTC ATCTGCCAGA TCAAGACCGG GCTCATGATC
ATCGACCAGC ACGTGGCTCA TGAGCGGGTG CTCTACGAAC GCGCGGTGGA GGTGATGGAG
AGCCGCGTGC CGAACTCCCA GCAGCTGCTC TTTCCGCAGA AGGTCGAGTT CCGGCCGTGG
GAGTATGAAG TGTTCGAGGA GATCAAAGAC GATCTGTACC GGCTGGGCTT CAACCTTCGT
TCGTTCGGGA CCCGGGCGGT GATGATCGAG GGCGTCCCGC AGGATGTGCG GCCCGGAAGC
GAGGCCACCA TCATGCAGGA CATGATTGCC GAGTACAGGG AGAACGCCAC CCGGCTGCGG
CTGGAGAGGC GCGACAATCT GGCGAAATCA TACTCCTGCC GCAACGCCAT CATGGCGGGC
CAGAAACTCT CGATGGGGGA GATGCGCACC CTCATCGACA ATCTTTTCGC CACCAGGGAA
CCTTACTCAT GCCCGCATGG CAGGCCCGTC ATCATCAAGA TGACGCTGAC CGAGCTCGAC
CATATGTTCG GCAGGTCCTG A
 
Protein sequence
MPSIARLPDN VANKISAGEV VQRPASVVKE LLENAIDSGA DRISVVIKDA GRELVRIIDN 
GRGMSRADAL LSVERFATSK LRDVDDLDTL GTLGFRGEAL ASISSVSHFE LRTRMTDAPV
ALRFRYEGGI AVEESEVQGE AGTSVSVRNL FYNVPARRKF LKSNATEYGH IFELVRSFSL
AYPEIQWQLL NDDQELFNFR TSDMLERLDT FYGKGFADSL IEVGEENDYL SIRGYIGRPA
LQKRKKLDQY FFINRRPIQN RMLTQALQQA YAELLVERQA PFALLFLGID PSRVDVNVHP
AKLEVRFDDE RSVRNMFYPV IKRAVTLHDF SPDLAAGGRT SQAGDDSASR GFTHAGGGGF
RTLAFQEVPE RAITTGELYG SYREGAFGSS RPAVPQPSHQ EVMFPVPEVP AAREDISQLL
RSSMHEGPEG AGVEPKGEEP KIWQLHNKYL ICQIKTGLMI IDQHVAHERV LYERAVEVME
SRVPNSQQLL FPQKVEFRPW EYEVFEEIKD DLYRLGFNLR SFGTRAVMIE GVPQDVRPGS
EATIMQDMIA EYRENATRLR LERRDNLAKS YSCRNAIMAG QKLSMGEMRT LIDNLFATRE
PYSCPHGRPV IIKMTLTELD HMFGRS