Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_38640 |
Symbol | mutS |
ID | 7762753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3907166 |
End bp | 3909733 |
Gene Length | 2568 bp |
Protein Length | 855 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643806727 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_002800979 |
Protein GI | 226945906 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.477357 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAGCC TTTCTCAGCA CACCCCGATG ATGCAGCAAT ACTGGAAGCT CAAGCGCGAG CATCCGGATC AGTTGATGTT CTACCGCATG GGCGACTTCT ACGAACTGTT CTACGAGGAC GCCAAGAAAG CCGCCAAGCT GCTCGACATC ACCCTCACCG CCCGCGGTCA GTCGGCCGGC AAGTCGATTC CCATGGCCGG CATTCCTTTC CACTCGGTCG AGGGGTATCT GGCCAAGCTG GTCAAGCTCG GCGAGTCGGT GGCGATCTGC GAGCAGATCG GCGACCCGGC CACCACCAAG GGACCGGTGG AGCGCCAGGT AGTGCGCATC ATCACCCCGG GCACGGTCAG CGACGAGGCA CTGCTCGACG AGCGCCGCGA CAACCTGCTG GCGGCAGTGG TCGGTGACGA ACGGCTGTTC GGTCTGGCGA TACTGGACAT CACCAGCGGT CGCTTCAACG TTCAGGAGAT CCAGGGCTGG GAAAATCTGC TGGCCGAACT GGAACGCCTC AATCCGGCGG AGCTGCTGTA TCCCGACGAC TGGCCGGCCG GTCTGCCGCT GGAAAAGCGT CGTGGCGCAC ACCGCCGGGC GCCCTGGGAC TTCGACTTCG ACAGCGCGTA CAAGAGCCTC TGCCAGCAGT TCGCCACCCA GGATCTGAAA GGCTTCGGCT GCGATGGGCT GGGTCTGGCC ATCGGTGCCG CCGGCTGCCT GCTGGCCTAC GCCAGGGAAA CCCAGCGCAC AGCCCTGCCT CATCTGCGCG GTCTGCGTCA CGAGCGCCTG GACGATACCG TGATCCTCGA CGGCGCCAGC CGGCGCAACC TGGAGCTGGA CGTCAACCTT TCCGGCGGGC GCGATAATAC CCTGCAGTCG GTGATCGACC GTTGCCAGAC GGCCATGGGC AGCCGCCTGC TCGGTCGCTG GCTGAATCGC CCGCTGCGCG ATCGCGCAGT GCTGGAGGCG CGCCAGGACA CCGTCGCCTG CCTGCTGCAG GACTACCGCT TCGAGAGCCT GCAGCCGCAG CTCAAGGAGA TCGGCGATGT CGAGCGCATC CTCGCCCGCA TCGGCCTGCG CAATGCCCGC CCGCGCGATC TCGCCCGCCT GCGCGATGCT CTTGCCGCAC TGCCACAACT GCAGACGGCG CTGAGTCCGC TGGAAGCGCC GCACCTGCAG GCCCTGGCCG GCAACATCCG GACCTATCCG GAACTGGCCG AGCTGCTGAG ACGCGCCATC ATCGATAACC CGCCGGCGGT GATTCGCGAT GGCGGAGTGC TCAAGCAGGG CTACGATGCG GAGCTGGACG AGCTTCTCTC GCTCAGCGAG AACGCCGGCC AGTTCCTCAT GGACCTGGAG GCGCGGGAAA AGGCACGCAC CGGGCTGCCC AACCTCAAGG TCGGCTACAA CCGCATCCAC GGCTATTACA TCGAGCTGCC GCGGGTACAG GCCGAGCAGG CGCCGGCCGA CTATATCCGC CGCCAGACCC TGAAGGGCGC CGAACGCTTC ATCACGCCCG AGCTGAAGGC CTTCGAGGAC AAGGCCCTGT CGGCCAAGAG CCGTGCCCTG GCGCGGGAAA AGGCACTCTA CGAGGAACTG CTGGAGATCC TCATCGCCCA GTTGGCGCCG CTGCAGGAAA CCGCGACCGC CCTGGCCGAA CTGGATGTGC TGGCCAACCT CGCCGAGCGG GCCTTGAACC TCGACTTCAA TCGCCCACGC TTCGTCGAAG AACCCTGCCT GCGCATTCGC CAGGGCCGCC ATCCGGTGGT CGAGCAGGTG CTGGACACAC CCTTCGTCGC CAACGATCTG GAACTCGACG ACAACACCCG GATGCTGATC ATCACCGGCC CCAACATGGG CGGCAAATCC ACCTACATGA GGCAAACGGC GCTGATCGTG CTGCTCGCCC ACATCGGCAG CTTCGTTCCG GCGCAGAGCT GCGAGCTTTC CCTGGTGGAT CGCATTTTCA CCCGCATCGG CTCCAGCGAC GACCTGGCCG GCGGCCGTTC CACCTTCATG GTGGAGATGA GCGAGACGGC CAACATCCTG CACAATGCCA GCGAACGCAG CCTGGTGCTG ATGGACGAGG TCGGCCGCGG CACCAGCACT TTCGACGGCC TGTCACTGGC CTGGGCGGCG GCCGAGCACC TGGCCGGCCT GCGCGCCTGG ACCCTGTTCG CCACCCATTA TTTCGAGCTG ACCGTGCTGG CGGAAAGCCA ACCGGTAGTG GCCAACGTGC ACCTGTCGGC CACCGAGCAC AACGAGCGCA TCGTCTTCCT CCACCATGTG CTGCCGGGAC CGGCCAGCCA GAGTTACGGC CTGGCGGTCG CCCAACTGGC CGGCGTGCCT GGACCGGTGA TCAGCCGTGC CCGCGAACAC CTGGCGCGTC TGGAGGCCAC CAGCCTGCCC CATGAAGCGC CGCTCCGCGA AGCAGGCAAA CCCCAGCCGC CGATCCAGAG CGATCTGTTC GCCAGCCTGC CGCATCCCCT GATGGAGGAA CTGGCACGCC TCAAGCCGGA CGACCTGAGC CCGCGCCAGG CGCTTGAGCT ATTGTATTCG TGGAAAACGA GGCTCTAA
|
Protein sequence | MESLSQHTPM MQQYWKLKRE HPDQLMFYRM GDFYELFYED AKKAAKLLDI TLTARGQSAG KSIPMAGIPF HSVEGYLAKL VKLGESVAIC EQIGDPATTK GPVERQVVRI ITPGTVSDEA LLDERRDNLL AAVVGDERLF GLAILDITSG RFNVQEIQGW ENLLAELERL NPAELLYPDD WPAGLPLEKR RGAHRRAPWD FDFDSAYKSL CQQFATQDLK GFGCDGLGLA IGAAGCLLAY ARETQRTALP HLRGLRHERL DDTVILDGAS RRNLELDVNL SGGRDNTLQS VIDRCQTAMG SRLLGRWLNR PLRDRAVLEA RQDTVACLLQ DYRFESLQPQ LKEIGDVERI LARIGLRNAR PRDLARLRDA LAALPQLQTA LSPLEAPHLQ ALAGNIRTYP ELAELLRRAI IDNPPAVIRD GGVLKQGYDA ELDELLSLSE NAGQFLMDLE AREKARTGLP NLKVGYNRIH GYYIELPRVQ AEQAPADYIR RQTLKGAERF ITPELKAFED KALSAKSRAL AREKALYEEL LEILIAQLAP LQETATALAE LDVLANLAER ALNLDFNRPR FVEEPCLRIR QGRHPVVEQV LDTPFVANDL ELDDNTRMLI ITGPNMGGKS TYMRQTALIV LLAHIGSFVP AQSCELSLVD RIFTRIGSSD DLAGGRSTFM VEMSETANIL HNASERSLVL MDEVGRGTST FDGLSLAWAA AEHLAGLRAW TLFATHYFEL TVLAESQPVV ANVHLSATEH NERIVFLHHV LPGPASQSYG LAVAQLAGVP GPVISRAREH LARLEATSLP HEAPLREAGK PQPPIQSDLF ASLPHPLMEE LARLKPDDLS PRQALELLYS WKTRL
|
| |