Gene Avin_38640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_38640 
SymbolmutS 
ID7762753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3907166 
End bp3909733 
Gene Length2568 bp 
Protein Length855 aa 
Translation table11 
GC content66% 
IMG OID643806727 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002800979 
Protein GI226945906 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.477357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGCC TTTCTCAGCA CACCCCGATG ATGCAGCAAT ACTGGAAGCT CAAGCGCGAG 
CATCCGGATC AGTTGATGTT CTACCGCATG GGCGACTTCT ACGAACTGTT CTACGAGGAC
GCCAAGAAAG CCGCCAAGCT GCTCGACATC ACCCTCACCG CCCGCGGTCA GTCGGCCGGC
AAGTCGATTC CCATGGCCGG CATTCCTTTC CACTCGGTCG AGGGGTATCT GGCCAAGCTG
GTCAAGCTCG GCGAGTCGGT GGCGATCTGC GAGCAGATCG GCGACCCGGC CACCACCAAG
GGACCGGTGG AGCGCCAGGT AGTGCGCATC ATCACCCCGG GCACGGTCAG CGACGAGGCA
CTGCTCGACG AGCGCCGCGA CAACCTGCTG GCGGCAGTGG TCGGTGACGA ACGGCTGTTC
GGTCTGGCGA TACTGGACAT CACCAGCGGT CGCTTCAACG TTCAGGAGAT CCAGGGCTGG
GAAAATCTGC TGGCCGAACT GGAACGCCTC AATCCGGCGG AGCTGCTGTA TCCCGACGAC
TGGCCGGCCG GTCTGCCGCT GGAAAAGCGT CGTGGCGCAC ACCGCCGGGC GCCCTGGGAC
TTCGACTTCG ACAGCGCGTA CAAGAGCCTC TGCCAGCAGT TCGCCACCCA GGATCTGAAA
GGCTTCGGCT GCGATGGGCT GGGTCTGGCC ATCGGTGCCG CCGGCTGCCT GCTGGCCTAC
GCCAGGGAAA CCCAGCGCAC AGCCCTGCCT CATCTGCGCG GTCTGCGTCA CGAGCGCCTG
GACGATACCG TGATCCTCGA CGGCGCCAGC CGGCGCAACC TGGAGCTGGA CGTCAACCTT
TCCGGCGGGC GCGATAATAC CCTGCAGTCG GTGATCGACC GTTGCCAGAC GGCCATGGGC
AGCCGCCTGC TCGGTCGCTG GCTGAATCGC CCGCTGCGCG ATCGCGCAGT GCTGGAGGCG
CGCCAGGACA CCGTCGCCTG CCTGCTGCAG GACTACCGCT TCGAGAGCCT GCAGCCGCAG
CTCAAGGAGA TCGGCGATGT CGAGCGCATC CTCGCCCGCA TCGGCCTGCG CAATGCCCGC
CCGCGCGATC TCGCCCGCCT GCGCGATGCT CTTGCCGCAC TGCCACAACT GCAGACGGCG
CTGAGTCCGC TGGAAGCGCC GCACCTGCAG GCCCTGGCCG GCAACATCCG GACCTATCCG
GAACTGGCCG AGCTGCTGAG ACGCGCCATC ATCGATAACC CGCCGGCGGT GATTCGCGAT
GGCGGAGTGC TCAAGCAGGG CTACGATGCG GAGCTGGACG AGCTTCTCTC GCTCAGCGAG
AACGCCGGCC AGTTCCTCAT GGACCTGGAG GCGCGGGAAA AGGCACGCAC CGGGCTGCCC
AACCTCAAGG TCGGCTACAA CCGCATCCAC GGCTATTACA TCGAGCTGCC GCGGGTACAG
GCCGAGCAGG CGCCGGCCGA CTATATCCGC CGCCAGACCC TGAAGGGCGC CGAACGCTTC
ATCACGCCCG AGCTGAAGGC CTTCGAGGAC AAGGCCCTGT CGGCCAAGAG CCGTGCCCTG
GCGCGGGAAA AGGCACTCTA CGAGGAACTG CTGGAGATCC TCATCGCCCA GTTGGCGCCG
CTGCAGGAAA CCGCGACCGC CCTGGCCGAA CTGGATGTGC TGGCCAACCT CGCCGAGCGG
GCCTTGAACC TCGACTTCAA TCGCCCACGC TTCGTCGAAG AACCCTGCCT GCGCATTCGC
CAGGGCCGCC ATCCGGTGGT CGAGCAGGTG CTGGACACAC CCTTCGTCGC CAACGATCTG
GAACTCGACG ACAACACCCG GATGCTGATC ATCACCGGCC CCAACATGGG CGGCAAATCC
ACCTACATGA GGCAAACGGC GCTGATCGTG CTGCTCGCCC ACATCGGCAG CTTCGTTCCG
GCGCAGAGCT GCGAGCTTTC CCTGGTGGAT CGCATTTTCA CCCGCATCGG CTCCAGCGAC
GACCTGGCCG GCGGCCGTTC CACCTTCATG GTGGAGATGA GCGAGACGGC CAACATCCTG
CACAATGCCA GCGAACGCAG CCTGGTGCTG ATGGACGAGG TCGGCCGCGG CACCAGCACT
TTCGACGGCC TGTCACTGGC CTGGGCGGCG GCCGAGCACC TGGCCGGCCT GCGCGCCTGG
ACCCTGTTCG CCACCCATTA TTTCGAGCTG ACCGTGCTGG CGGAAAGCCA ACCGGTAGTG
GCCAACGTGC ACCTGTCGGC CACCGAGCAC AACGAGCGCA TCGTCTTCCT CCACCATGTG
CTGCCGGGAC CGGCCAGCCA GAGTTACGGC CTGGCGGTCG CCCAACTGGC CGGCGTGCCT
GGACCGGTGA TCAGCCGTGC CCGCGAACAC CTGGCGCGTC TGGAGGCCAC CAGCCTGCCC
CATGAAGCGC CGCTCCGCGA AGCAGGCAAA CCCCAGCCGC CGATCCAGAG CGATCTGTTC
GCCAGCCTGC CGCATCCCCT GATGGAGGAA CTGGCACGCC TCAAGCCGGA CGACCTGAGC
CCGCGCCAGG CGCTTGAGCT ATTGTATTCG TGGAAAACGA GGCTCTAA
 
Protein sequence
MESLSQHTPM MQQYWKLKRE HPDQLMFYRM GDFYELFYED AKKAAKLLDI TLTARGQSAG 
KSIPMAGIPF HSVEGYLAKL VKLGESVAIC EQIGDPATTK GPVERQVVRI ITPGTVSDEA
LLDERRDNLL AAVVGDERLF GLAILDITSG RFNVQEIQGW ENLLAELERL NPAELLYPDD
WPAGLPLEKR RGAHRRAPWD FDFDSAYKSL CQQFATQDLK GFGCDGLGLA IGAAGCLLAY
ARETQRTALP HLRGLRHERL DDTVILDGAS RRNLELDVNL SGGRDNTLQS VIDRCQTAMG
SRLLGRWLNR PLRDRAVLEA RQDTVACLLQ DYRFESLQPQ LKEIGDVERI LARIGLRNAR
PRDLARLRDA LAALPQLQTA LSPLEAPHLQ ALAGNIRTYP ELAELLRRAI IDNPPAVIRD
GGVLKQGYDA ELDELLSLSE NAGQFLMDLE AREKARTGLP NLKVGYNRIH GYYIELPRVQ
AEQAPADYIR RQTLKGAERF ITPELKAFED KALSAKSRAL AREKALYEEL LEILIAQLAP
LQETATALAE LDVLANLAER ALNLDFNRPR FVEEPCLRIR QGRHPVVEQV LDTPFVANDL
ELDDNTRMLI ITGPNMGGKS TYMRQTALIV LLAHIGSFVP AQSCELSLVD RIFTRIGSSD
DLAGGRSTFM VEMSETANIL HNASERSLVL MDEVGRGTST FDGLSLAWAA AEHLAGLRAW
TLFATHYFEL TVLAESQPVV ANVHLSATEH NERIVFLHHV LPGPASQSYG LAVAQLAGVP
GPVISRAREH LARLEATSLP HEAPLREAGK PQPPIQSDLF ASLPHPLMEE LARLKPDDLS
PRQALELLYS WKTRL