Gene Lferr_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1972 
Symbol 
ID6877960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1970145 
End bp1972469 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content63% 
IMG OID642789841 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_002220396 
Protein GI198284075 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.656834 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTGG CCGGGGTTGC TCCTTGGCAG GACAGCTTCT GGCGGGCTCT GGATGCCAGT 
GATGTGCAGA GCGCGTGGTC ACAGCATTGT GGCCACATCT ATGGCCGCAG ACATGTTGCG
GCGACTACCC CCTGGGTGCC GCTTGCGGAA ATCCATGATC GTTTGAATTG GTTGGCCTGG
TTGTCGCAGC GGTTTTCCGC CGGTTTTCTT CTTCCCGTTT CTGATCTCCC CGATATCGAT
GCTCTGCTGG TTCTGGCACA GCGCCCTGGT GCCGTGCTCA GCGGGAGAGA TTTGCGCGCG
GTGCTCGATG TGCTGACGGC GCAGAAGGGC TATGCCGCAG CTTTGCGCGA GACTCGGGAT
GCCTTGACCG GGCTGGCAGG CGAGCTTGAT CCCCCGGTGA CCTTGTTGCG CCGTCTCGGC
GTGGCGTTGG ATGAGGAGGG TGAGCTCCTG GATACGGCGA GTGCGGATCT CGCCCTCCTG
CGCCAGCGCT TGCGAATCAG CCGCAACGAA TTGCAGCGTT TTCTGCAGGG CTTTCTGCGT
AATCGAGACT GGCAGGAATA CTGGCAGGAT CAGGTCATCG TGCAGCGTAA TGAGCGTTAC
GTACTCCCGC TCAAAGCTAG CCATAAAGGG CGTATCAAGG CGATCGTCCA TGACCGGTCG
GCAAGTGGTG AGACTCTTTT TGTGGAGCCG CTGGCGGCCG TCGATCTGAA TAATCAACTG
GTCCAGGACC GGCGTGCCGA AATTCAGGAG CAGGAGCGCA TTCTTCGTGC CCTGAGCGCA
GCGGTAGGGC TGGAAGTGCC GGGCATCGTC GCCGCCCTGC GGCATATTGG CCGGCTGGAT
GCGGTACGGG CGGGACTGGA ACTGGGGGAT GCCTGTGGCG GGATTTTGCC CAAGGTCGAT
GCGGCTGCGG CATTTGATCT GCGGGCCCTG CGTCATCCGC TGCTTTGTCT GCGCCATCCG
GGGCAGGTCG TGGGTAACGC CCTGCGTCTC GGGGCGGACG CGCAGCAACT GGTCATTACC
GGCCCCAATA CCGGCGGCAA GACGGCTATC CTGAAAGCAC TCGGACTCAA TCATCTGATG
GCCTACATGG GATTGCCGGT AACTGCGGAA GGTACATTGG GTTATTTCCC GAAGTGCTTT
GCGGTCATCG GCGACGCCCA GGATATCCAC ACGGATCTTT CCACTTTTTC GGCGCAGGTA
CAGCGTCTTC GGGAAGTGCT GGAGCATGCC GACGCCCACA GTCTCGTGCT TCTCGACGAA
CTGGGCAATG GCACCGACCC GCGGGAAGGG GGCGCGCTGG CTCAGGCGGT CGCAGAGGCC
TTGCTGGCGG CCGAATGCTG CACACTGCTC ACGAGCCATC TGGAAGTGAT GAAGCGTTAT
GCCCTGAGCC ATGCGGGTGT AGCCCTGGCG GGTATGGGCT TTGATGCAGA GTCTTTGAAA
CCCACCTACC GTCTGCTTTG GGGTGTAGGC GGCGCGAGCC AGGGTCTGGT CATCGCCCGG
CGCGTGGGGA TGCCTGCACC ACTGATGAAC CGCGCCGAGG CACTCTATGC CGATGACCGC
GAGAACTGGG AACGTTGGGA AGCGCAACGG GAAACCCTAT TGCAGGCCGC CCGGCAGGCT
ATGGATGAGG CCGTACTGGC GCGTGACGAA GCCACCAGTG TGGCCCGCAG CCTGGAACGC
GAACTGGAAG CAGCGAGGCA GGAGCGTGAC AAAGCCGCCG CCGCCGCCCG CGCCGAATGG
GAAGATATAC TGGCAACGGC GCGCCAGCAG GTGCGGCAAG CCATTGCCGC GCTCAAGTCC
GGGAGGGATA CCCAGGCGGC AACGGCGGCT TTGCAACGTC TGGAAGTCCC CTTCCGGGCG
GAGGAGCAGC AAGTGGACAG CCTGCCCGCG GTGGGGACCA GGGGTCTGTT CCTGCCACTC
CGGCAGGTGA CCCAAGTGCT GCGTGCGGAT CCCGCACAAC ACAGGGTGCA GATTCAATTA
CGGGGCAAGC AGTTGTGGGT GCCCGCCGCA CAATTTGCCG TGGACGCCGC GCTGCAGATC
CCAAAAGAAG CAGGCAGCAC CCAGTATGCC ACCCCCGACG ATCATCCCTG GCGTCTGGAT
TTGCGCGGAC AGCTTCGGGA AGACGCCCTG GCGGCGCTGC GTCGTCATGT GGATGGCGCC
GTAGCCGCCG GTCGTCGACA GGTCCAGATC CTGCATGGTA AGGGCAACGG GGTGCTCGCA
GAAATGGTGC GCGAGTTTGC CGGGCAAGAC CCTCGGGTCA GCCAGTGGCG TATGGCGCGG
CCAGAGCATG GCGGTGGCGG CGTCAGCGAG TTGGAGTTAC GCTGA
 
Protein sequence
MAVAGVAPWQ DSFWRALDAS DVQSAWSQHC GHIYGRRHVA ATTPWVPLAE IHDRLNWLAW 
LSQRFSAGFL LPVSDLPDID ALLVLAQRPG AVLSGRDLRA VLDVLTAQKG YAAALRETRD
ALTGLAGELD PPVTLLRRLG VALDEEGELL DTASADLALL RQRLRISRNE LQRFLQGFLR
NRDWQEYWQD QVIVQRNERY VLPLKASHKG RIKAIVHDRS ASGETLFVEP LAAVDLNNQL
VQDRRAEIQE QERILRALSA AVGLEVPGIV AALRHIGRLD AVRAGLELGD ACGGILPKVD
AAAAFDLRAL RHPLLCLRHP GQVVGNALRL GADAQQLVIT GPNTGGKTAI LKALGLNHLM
AYMGLPVTAE GTLGYFPKCF AVIGDAQDIH TDLSTFSAQV QRLREVLEHA DAHSLVLLDE
LGNGTDPREG GALAQAVAEA LLAAECCTLL TSHLEVMKRY ALSHAGVALA GMGFDAESLK
PTYRLLWGVG GASQGLVIAR RVGMPAPLMN RAEALYADDR ENWERWEAQR ETLLQAARQA
MDEAVLARDE ATSVARSLER ELEAARQERD KAAAAARAEW EDILATARQQ VRQAIAALKS
GRDTQAATAA LQRLEVPFRA EEQQVDSLPA VGTRGLFLPL RQVTQVLRAD PAQHRVQIQL
RGKQLWVPAA QFAVDAALQI PKEAGSTQYA TPDDHPWRLD LRGQLREDAL AALRRHVDGA
VAAGRRQVQI LHGKGNGVLA EMVREFAGQD PRVSQWRMAR PEHGGGGVSE LELR