Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CA2559_05155 |
Symbol | |
ID | 9296523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Croceibacter atlanticus HTCC2559 |
Kingdom | Bacteria |
Replicon accession | NC_014230 |
Strand | + |
Start bp | 1169815 |
End bp | 1172424 |
Gene Length | 2610 bp |
Protein Length | 869 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | |
Product | putative DNA mismatch repair protein MutS |
Protein accession | YP_003715795 |
Protein GI | 298207616 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAAAG CAAAGAAAGT CACACCATTA ATGCAGCAAT ATAATAGCAT CAAGACAAAG TATCCTGATG CCTTATTGTT ATTTCGCGTA GGCGATTTTT ACGAAACTTT TGGGGAAGAT GCTGTAAAAG CAGCACGCAT ATTAAATATA GTGCTCACTA ACCGAAATAA CGGTGGCGAG CGTACAGAGC TTGCAGGATT TCCACACCAT TCATTAAATA CCTACCTACC CAAATTAGTA AAAGCAGGAG AACGTGTGGC TATTTGTGAC CAACTAGAAG ATCCAAAAGC TACTAAAAGT ATTGTTAAAC GTGGCGTTAC AGAACTTGTT ACGCCAGGTG TTGCACTAAA TGATGAGGTG CTACAGAGCA ATTCTAATAA CTTCCTTGCT TCAGTTTACA TTGGAAAAAA GCAAATGGGT GTAGCGTTTT TAGATGTTTC AACAGGCGAA TTTCTTACAG CGCAAGGCTC TTCAGAATAT ATAGATAAAT TACTGCAAAA TTTTGCGCCT AGTGAAATAC TTATTGCAAA ACAAAAGAAA GCAGATTTTA CAGCAATCTT TGGGTCAGAT TTTCATACAT TTTATATTGA AGATTGGGTG TTTAAGACAG ACTATGCCCA CGAAACACTA CATCAACATT TTGGTGTAAA ATCATTAAAA GGCTTTGGTG TAGATCATTT AGAGGATGGT ATCATAGCCT CTGGAGCTAT ATTATATTAC CTAAGTGAAA CACAACATCA TAAATTAAAA CATATTACAA GCATAAGCCG CATTGCAGAA GACGCCTATG TTTGGATGGA TCGTTTTACT ATAAGAAATC TAGAGCTTTA TCAAGGCACA TCTTTACAGT CTGTAACTTT ATTAGATGTT ATAGATAAAA CAACATCTCC TATGGGAGGT AGAACATTAA AGCGTTGGTT GGCACTGCCA TTAAAAAACG CTGAAAAAAT AAAAAAACGT CACCGAGTTG TAAACTATTT CCTTAAGCAA AAAACATTAT TGAGTGATGT CACGTCTCAT ATAAAACAGA TTGGAGATAT AGAGCGTCTC ATTTCTAAAG TAGCTACCGC TAAAGTAAGC CCAAGAGAAG TTATTCAACT TAAAAACTCA TTAGATGCTA TTGTGCCTAT TAAGACATTA GCCCTTAAAT CTGAAAACGA TGCTCTAAAA GTTATAGGTG ATAATTTACA GTCTTGTGAT TTATTGCGAG GAAAAATAAC AGAAACCTTA AATGAAGAAG CACCAGTTAA TATACTAAAG GGTAGTACTA TAGCTAGAGG ATTTTCCAAA GAGCTGGATG AGCTTAGAGA TATACGTTTT TCTGGAAAAG AATATCTAGA TAAAATGCTT CAGAGAGAAA CAGAGGCTAC TGGTATTACA TCATTAAAAA TAGCAAGCAA CAATGTTTTT GGATATTATA TTGAAGTGAG AAATTCTCAT AAAGATAAGG TTCCAGAAAA CTGGGTTAGA AAACAAACTT TGGTAAATGC AGAGCGGTAT ATTACTGAAG AATTAAAAGA ATACGAAGCT AAAATTTTAG GAGCAGAAGA GAAGATTGTG CAAATAGAGC AAGAGTTGTT CTCTAAATTA GTTACTTGGA TTTCAGACTA CATAAAACCA GTACAGCAAA ATGCACATCT TATAGGAGAA ATAGACTGTC TTTGTGGTTT TGCTACACAA GCTATGCAGG AAAACTATTG TTTGCCAGAA ATCACAGAAG ACTATAGTTT AGAGATTACA GAAGGAAGGC ATCCCGTTAT TGAAAAACAG TTGCCACTTG GAGAACCCTA TATAACTAAC GATATCTTGC TTAATCGTGA TGATCAGCAA ATGATTATGA TAACTGGGCC AAATATGAGT GGTAAGTCAG CTATCCTAAG ACAAACGGCA CTAATTGTAT TATTAGCTCA AATGGGAAGT TTTGTGCCTG CTAAAGCTGC CAAAATAGGA TTAGTAGATA AGATTTTTAC TAGAGTAGGC GCAAGTGATA ATATTTCGAT GGGTGAAAGT ACATTTATGG TCGAGATGAA TGAAACTGCG AGTATTCTTA ATAATCTTTC AGATCGTAGT TTAGTGCTTT TAGATGAGAT AGGTCGTGGT ACAAGTACAT ATGATGGTAT ATCTATAGCT TGGGCAATTA GTGAATACTT ACATGAACAC CCAGCAAAGG CTAAGACACT ATTTGCAACT CATTATCATG AGTTAAATGA GATGACAGAA ACCTTTGAGC GCATTAAGAA TTATAATGTG TCTGTAAAAG AATTAAAAGA TAATGTACTC TTTTTAAGAA AACTAGTTCC AGGAGGTAGC GAACATAGCT TCGGAATTCA CGTAGCTAAA ATGGCAGGAA TGCCACAACA GGTATTGCAT CGAGCAAATA AAATATTAAA GAAATTAGAG AAAAGTCATT CTTCTGAAGA GTTAAGCGGA CAGATAAAAA AAGCAACAGA GCAAGAACCA CAATTAAGCT TCTTTAAGTT AGACGATCCT TTATTAGAAG ATATAAAGCA GGAAATCATA CAAGTAGACA TAAATACTTT AACGCCAGTT GAAGCATTAA TGAAGTTAAA TGAGATTAAA AGAATGCTTG TCCCAAAAGG AAATGATTAA
|
Protein sequence | MAKAKKVTPL MQQYNSIKTK YPDALLLFRV GDFYETFGED AVKAARILNI VLTNRNNGGE RTELAGFPHH SLNTYLPKLV KAGERVAICD QLEDPKATKS IVKRGVTELV TPGVALNDEV LQSNSNNFLA SVYIGKKQMG VAFLDVSTGE FLTAQGSSEY IDKLLQNFAP SEILIAKQKK ADFTAIFGSD FHTFYIEDWV FKTDYAHETL HQHFGVKSLK GFGVDHLEDG IIASGAILYY LSETQHHKLK HITSISRIAE DAYVWMDRFT IRNLELYQGT SLQSVTLLDV IDKTTSPMGG RTLKRWLALP LKNAEKIKKR HRVVNYFLKQ KTLLSDVTSH IKQIGDIERL ISKVATAKVS PREVIQLKNS LDAIVPIKTL ALKSENDALK VIGDNLQSCD LLRGKITETL NEEAPVNILK GSTIARGFSK ELDELRDIRF SGKEYLDKML QRETEATGIT SLKIASNNVF GYYIEVRNSH KDKVPENWVR KQTLVNAERY ITEELKEYEA KILGAEEKIV QIEQELFSKL VTWISDYIKP VQQNAHLIGE IDCLCGFATQ AMQENYCLPE ITEDYSLEIT EGRHPVIEKQ LPLGEPYITN DILLNRDDQQ MIMITGPNMS GKSAILRQTA LIVLLAQMGS FVPAKAAKIG LVDKIFTRVG ASDNISMGES TFMVEMNETA SILNNLSDRS LVLLDEIGRG TSTYDGISIA WAISEYLHEH PAKAKTLFAT HYHELNEMTE TFERIKNYNV SVKELKDNVL FLRKLVPGGS EHSFGIHVAK MAGMPQQVLH RANKILKKLE KSHSSEELSG QIKKATEQEP QLSFFKLDDP LLEDIKQEII QVDINTLTPV EALMKLNEIK RMLVPKGND
|
| |