Gene Plut_0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlut_0447 
Symbol 
ID3745964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium luteolum DSM 273 
KingdomBacteria 
Replicon accessionNC_007512 
Strand
Start bp523612 
End bp525981 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content62% 
IMG OID637768487 
ProductDNA mismatch repair protein MutS-like 
Protein accessionYP_374378 
Protein GI78186335 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAGG GTACCTCCAG AAAACTTGAA TTCGACAGGG TCGTCGCCCA TACGGCGGGC 
TACGCCCTCT CAGTCTACGG ACGCGACACC CTGTCTTCCT CTCTGCCGTT TTCCGGGCAC
CGGGAACTGC AGGAGGAGCT CGAGCGGGTA CTTGAGCTGA AAGGTTTTCT CGAAGAGGGT
TCCACGCTCC CTTTCGATAC CCTTGCGGAC ATGCGCCCCC TGCTTGCCAC GCTCGAGATG
CATGGGGGTG TGCTTGAGCC CGGCGAGCTT CAGGATCTTC ACCTCTTTCT TGATGCAGCA
GCAGGGCTCC GGCGCCGAAT GGTCCGGGAG GCCGAGCTCC GGCCCCGCCT TGCCGTGTTC
GGGGCCGGGA TTCCGGAGGA TGCTTTTGTA CGGCCGGTCA TCAGGGAGGT CATCGACGAA
CAGGGGGTGG TGAAGAGTTC GGCCAGCGGA GAACTTGCCC GGATCCGCCG TGCCCTTGTC
GACAGGCGAC GGGCGCTCGG GCGCACGCTG GAGCGGATCC TTGGCGGATG CCGCTCGAGC
GGCTGGCTGA TGGAGGATAC CCTTACCATC CGCAATGGCC GCCAATGCCT CGCGATGCGC
CTCGAGTACC GCCACCGTGT GCCGGGCTTT GTGCAGGATT ACTCTGGCAG CGGCCAGACC
GTTTTTCTTG AGCCTGCCGA GTGCATGGAG ATCACCAATG CCATTCTCGA GGGAGAGATC
GAGGAGCGGC GGGAGATCGA GCGGATACTC AGGGAAACGA CGGCCAGGAT CCGCCCTGAA
CTTCCCGATC TTCTTTCGGG CGCATCCCTC ATGGCGGCCT TCGATTCACT CTATGCCCGT
GCCCGTTTCG CACTCGAAAC CCGTTCCGTG CTGCCAGCTC TTTCAGAGGG AACGGAGTTC
CGCATCAAGC GGGGTTACCA TCCCTGGCTG CTTATTTCCC ACCGCGACCG CGAAGTGCTG
CCTCTTGATC TGGAGCTCGG TTTAAATGAA CAGGTGCTCA TCATTTCGGG CCCGAACGCC
GGAGGAAAGT CCGTTGCCAT GAAGACAGCC GGCCTGCTCT CCTTCATGCT CCTGCACGGT
TATCTTCTGC CCTGCAGCGA AAGCTCCGTT TTCCCGCTTT TCACAAGCAT CGGCATCGAA
ATCGGCGACG AGCAGTCCAT CGAGAACGAT CTGTCGACGT TCAGTTCGCA CCTCCGCGAG
GTCCGCCGCA TCCTCGATGC GGCAGGGCGT GGATCGCTGG TGCTGATCGA CGAACTGTGT
TCGGGGACCG ATGTCGAAGA GGGGAGCGCC ATTGCCCGCA CCATCATTGA AGAGCTCCTC
CGGCGCGGTA CGAAGGCTAT CGTCACCACC CACCTTGGAG ACCTCAAAGC ATACGCCCAC
AGCCGCGAAG GCGTGGTGAA CGGCGCCATG GAGTTCGATC GCCGGGCTCT CCAGCCGACC
TTCCGCTTCA TCAAGGGCGT ACCGGGCAGC AGTTTCGCTT TTGCCATGAT GCAGCGGATG
GGATTTCCTC CCGCCATGGT CCGGGAAGCC GAATCCGCCA TCGGCGAAGG CCACCGGGGC
CTGGAGGAAC TGCTCGAGGA CCTGCAGGAG CTCCTGGCCT CCAACCGGTT GCTCCATCTG
GAGCTTGAAG CGCAGTCAGC CAGTATTCTA TTGAGGGAAC AGGCCGTCAC GGAGGCCGAA
AGCATGCTCC GGCGACGGGA TCGGGAGCAG AGACAGAAGG CCTCGAAGGA ACTGCAGCGG
GAGCTGCATC GGGCGCGTCT CGAGATCCGC GACATTCTTG CCGAAGCCAA TGCGGCGGCC
GGAGACCCGC GTGCCGTTCA GGAGGCCCGT CGGAAGCTGG CATCAAGAGC CGGTGATGCT
GAAAAAAAAG AGGCTCTGCT CCTGGATGCC CCGGCCCCGA CCCTCGATCG GAGCATCCGT
CCCGGGGATC TTGTCCAGCT CCTCGATACC TCGGCATCCG GAGAGATCGA AAGCCTCAGG
GGGGACATGG CCGTCGTACT CTGCGGTACC TTCCGTCTTA CCACATCGCT TTCGAACCTC
GAGAAGACCT CGAAACGCCA GGTCCGCAAG GCGGCCGGCG CCCCGGCACC CCGTTTTGTG
GGATGGAACG CAACTACATC CCCTGTAGAA TCAACAACGC TTGACCTGCG CGGCCTGACC
GGCGACGAAG CTGCCGTGAA AATCGAGCGC TTCCTCGACG CTCTCCGGCT CAACCGCATC
GAGCGTGCCA CCATCATCCA CGGCATGGGC ACCGGTGCCC TCCGCCGACG CACTGAAGAG
GTGCTTCGAA ACCACCCCCA TGTCCACTCC TGGCGATTGG GAGGGCAGGG CGAAGGGAGC
TCGGGAGTGA CTATCGTCAC GCTCGCCTGA
 
Protein sequence
MAEGTSRKLE FDRVVAHTAG YALSVYGRDT LSSSLPFSGH RELQEELERV LELKGFLEEG 
STLPFDTLAD MRPLLATLEM HGGVLEPGEL QDLHLFLDAA AGLRRRMVRE AELRPRLAVF
GAGIPEDAFV RPVIREVIDE QGVVKSSASG ELARIRRALV DRRRALGRTL ERILGGCRSS
GWLMEDTLTI RNGRQCLAMR LEYRHRVPGF VQDYSGSGQT VFLEPAECME ITNAILEGEI
EERREIERIL RETTARIRPE LPDLLSGASL MAAFDSLYAR ARFALETRSV LPALSEGTEF
RIKRGYHPWL LISHRDREVL PLDLELGLNE QVLIISGPNA GGKSVAMKTA GLLSFMLLHG
YLLPCSESSV FPLFTSIGIE IGDEQSIEND LSTFSSHLRE VRRILDAAGR GSLVLIDELC
SGTDVEEGSA IARTIIEELL RRGTKAIVTT HLGDLKAYAH SREGVVNGAM EFDRRALQPT
FRFIKGVPGS SFAFAMMQRM GFPPAMVREA ESAIGEGHRG LEELLEDLQE LLASNRLLHL
ELEAQSASIL LREQAVTEAE SMLRRRDREQ RQKASKELQR ELHRARLEIR DILAEANAAA
GDPRAVQEAR RKLASRAGDA EKKEALLLDA PAPTLDRSIR PGDLVQLLDT SASGEIESLR
GDMAVVLCGT FRLTTSLSNL EKTSKRQVRK AAGAPAPRFV GWNATTSPVE STTLDLRGLT
GDEAAVKIER FLDALRLNRI ERATIIHGMG TGALRRRTEE VLRNHPHVHS WRLGGQGEGS
SGVTIVTLA