Gene Jann_0209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_0209 
Symbol 
ID3932646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp199959 
End bp202592 
Gene Length2634 bp 
Protein Length877 aa 
Translation table11 
GC content60% 
IMG OID637902551 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_508151 
Protein GI89052700 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.784326 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCGC AAGTGACGCC GATGATGGCG CAATATCTGG AGATCAAGGC AGCACATCCC 
GATGCACTGC TGTTCTACCG GATGGGCGAC TTCTACGAGA TGTTCTTTGA TGATGCCGTC
GCGGCGGCCG AGGCGTTGGA CATCGCGTTA ACCAAGCGCG GCAAGCACAA CGGCGAAGAT
ATCGCCATGT GTGGCGTGCC TGCCCATTCA GCGGAGAGCT ATCTACTGAC GCTGATCCGC
AAGGGCTTTC GTGTCGGTAT TTGCGAACAG ATGGAGGACC CCGCGACCGC AAAGAAACGC
GGTGGGTCCA AGGCCGTGGT GCGGCGGGAG GTCGTGCGGC TGGTGACACC CGGCACGCTG
ACCGAAGACA CATTGCTTGA GGCGCGGCGG CACAATTACC TCGCATCTTA CGCCGACGTT
CGCGGCGAAG GCGCGTTGGC GTGGTGCGAT ATCTCCACCG GAGACTTCCG CGTGATGCGC
TGTCCGGCGG TGCGTTTGGG GCCGGAATTG GCGCGGTTGT CGCCAAAGGA AGTGCTTGTG
TCCGACGGAT TGGATATCGC ACGCCAGGAC GCGATCATCG AGTTGGGCGC GGCCGTGACG
CCATTATCAC CCGGCTCTTT CGACAGCGCC TCTGCGGAAC AGCGGCTGGC GGGTTTGTTC
AGCGTCAGCA CCTTGGAGGG GTTCGGGAAC TTCGCCCGCG CCGAGGTGTC CGCCATGGGT
GCGCTGATCG ACTATCTGGA ATTGACCCAG AAAGGCGCAC TGCCGCTGCT TCGTGCACCG
CGACAGCAAT CGGCGGATCG TCTGCTACAG CTTGATACCG CCACCCGACG CAATCTGGAG
CTGACCCAAG CGCTATCTGG CGGGCGGGCC AATTCACTAC TGTCGGTGCT GGATCGGACA
CAAACTGCGG GTGGTGCGCG CTTGCTGCAA AGGCGCCTGA CGGGTCCATC CACGGACCTT
GACGTCATTC GGGCGCGCCA TGAAAGCGTT TCATTTTTTT TCTCTGACAC GCTGATTAGG
GACGATCTGG AGGCTGAACT TCGCCGTATC CCTGACCTGG ATCGCGCTCT ATCGCGTCTG
GCCTTGGATC GCGGCGGGCC ACGGGATTTG TCGGCCATCC GTGACGGATT GTCTGGTGCT
GCCCGACTGT CTGACAAATT AAAGATCGTG GACTTGCCGC CCCTTCTTGA AGGTGCCGTG
CAGGATCTTC AGGGGCACGA TGAGCTTTCC GCCCTTCTCG ACGAGGCGCT GGTGGCAGAA
CCCCCAGTTC AACTCCGTGA CGGAGGTTTG ATCGCGCCCG GTTACCACGC CGAATTGGAC
GAGGCGCGTA CCCTACGGGA TGAAGGGCGC AGCGTGATCG CCACGATGCA GGCGGACTAT
GTTGAGGCGA CTGGAGTGAA CGCCCTGAAG ATCAAGCACA ACAATGTGTT GGGGTATTTC
ATTGAAACCA CTGCCACCCA CGCCGAAAAG ATGTTGAACC CACCGCTGTC GGAACGGTTC
ATTCACCGCC AGACCACCGC CAACCAAGTG CGGTTTACAA CGGTGGAGCT TTCCGAGTTA
GAGACCAAAA TCCTGAACGC CGGAAATCGG GCGTTGGAAA TCGAGCGGGT GCTGTTCCAA
TCCCTGCGCG ACGCGATTTT GAATTGTCAG GATCAGATCG GCCAGGCCGC TCGCGGGTTG
GCAGAGTTGG ATCTATCGGC TGCGCTGGCC CGGCGTGCAC GGGAAGGGGA CTGGACGCAA
CCCGAGATGA CCGAGGATCG TGCGTTCATG ATTGAAGGGG CACGTCATCC GGTGGTGGAG
GCCGCATTGG CCAAGGACGG CACGTCATTT GTCGCCAACG ATTGTGACCT GAGCGCCGAG
GGTGGCGCCG CTATCACCTT GCTCACCGGG CCAAACATGG CGGGTAAATC GACGTATCTT
CGTCAGAACG CTCTTCTGGT GATTATGGCT CAAACCGGTA GTTTCGTGCC TGCGAAATGC
GCGAAGATCG GGCTTGTAAG CCAGGTGTTC AGCCGCGTCG GAGCCTCAGA CGATTTGGCG
CGGGGCCGAT CAACCTTCAT GGTCGAGATG GTGGAAACGG CGACGATCCT GAACCAAGCC
GACGACCGTG CTTTGGTGAT CCTTGATGAG ATCGGGCGGG GGACGGCGAC TTATGATGGC
CTGTCAATTG CCTGGGCGAC GTTGGAACAC CTGCATGAGA CCAACCAGTG CCGCGCGCTT
TTTGCGACGC ACTACCATGA AATGACGTCT CTTGCGTCCA AGCTGGACGG GCTGACCAAC
GCGACTGTGG CCGTGAAAGA ATGGGAGGGG GAGGTGATCT TCCTCCACGA AGTCCGCGAG
GGGGCCGCGG ACAGGTCTTA TGGCGTGCAG GTCGCAAAAC TCGCGGGGCT GCCAGACGCT
GTCATCGCAC GGGCGCAGGT CGTGTTGGAT GCGTTGGAGA AAGGCGAACG GGAGGGCGGC
GAACGCAAGG CTGTTATCGA CGACTTGCCT CTGTTCGCCA TGATGCCCGC GCCAGCGCCC
GCCCCATCGG CGCCGTCTTT GGTCGAAGAG AAACTCCGCG CCGTACATCC CGATGAGATG
ACGGCGCGCG AGGCACTGAA TTTGCTCTAT GAATTGAAGG CTGAGCTTAG CTAG
 
Protein sequence
MNAQVTPMMA QYLEIKAAHP DALLFYRMGD FYEMFFDDAV AAAEALDIAL TKRGKHNGED 
IAMCGVPAHS AESYLLTLIR KGFRVGICEQ MEDPATAKKR GGSKAVVRRE VVRLVTPGTL
TEDTLLEARR HNYLASYADV RGEGALAWCD ISTGDFRVMR CPAVRLGPEL ARLSPKEVLV
SDGLDIARQD AIIELGAAVT PLSPGSFDSA SAEQRLAGLF SVSTLEGFGN FARAEVSAMG
ALIDYLELTQ KGALPLLRAP RQQSADRLLQ LDTATRRNLE LTQALSGGRA NSLLSVLDRT
QTAGGARLLQ RRLTGPSTDL DVIRARHESV SFFFSDTLIR DDLEAELRRI PDLDRALSRL
ALDRGGPRDL SAIRDGLSGA ARLSDKLKIV DLPPLLEGAV QDLQGHDELS ALLDEALVAE
PPVQLRDGGL IAPGYHAELD EARTLRDEGR SVIATMQADY VEATGVNALK IKHNNVLGYF
IETTATHAEK MLNPPLSERF IHRQTTANQV RFTTVELSEL ETKILNAGNR ALEIERVLFQ
SLRDAILNCQ DQIGQAARGL AELDLSAALA RRAREGDWTQ PEMTEDRAFM IEGARHPVVE
AALAKDGTSF VANDCDLSAE GGAAITLLTG PNMAGKSTYL RQNALLVIMA QTGSFVPAKC
AKIGLVSQVF SRVGASDDLA RGRSTFMVEM VETATILNQA DDRALVILDE IGRGTATYDG
LSIAWATLEH LHETNQCRAL FATHYHEMTS LASKLDGLTN ATVAVKEWEG EVIFLHEVRE
GAADRSYGVQ VAKLAGLPDA VIARAQVVLD ALEKGEREGG ERKAVIDDLP LFAMMPAPAP
APSAPSLVEE KLRAVHPDEM TAREALNLLY ELKAELS