Gene CA2559_05155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_05155 
Symbol 
ID9296523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp1169815 
End bp1172424 
Gene Length2610 bp 
Protein Length869 aa 
Translation table11 
GC content35% 
IMG OID 
Productputative DNA mismatch repair protein MutS 
Protein accessionYP_003715795 
Protein GI298207616 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAAG CAAAGAAAGT CACACCATTA ATGCAGCAAT ATAATAGCAT CAAGACAAAG 
TATCCTGATG CCTTATTGTT ATTTCGCGTA GGCGATTTTT ACGAAACTTT TGGGGAAGAT
GCTGTAAAAG CAGCACGCAT ATTAAATATA GTGCTCACTA ACCGAAATAA CGGTGGCGAG
CGTACAGAGC TTGCAGGATT TCCACACCAT TCATTAAATA CCTACCTACC CAAATTAGTA
AAAGCAGGAG AACGTGTGGC TATTTGTGAC CAACTAGAAG ATCCAAAAGC TACTAAAAGT
ATTGTTAAAC GTGGCGTTAC AGAACTTGTT ACGCCAGGTG TTGCACTAAA TGATGAGGTG
CTACAGAGCA ATTCTAATAA CTTCCTTGCT TCAGTTTACA TTGGAAAAAA GCAAATGGGT
GTAGCGTTTT TAGATGTTTC AACAGGCGAA TTTCTTACAG CGCAAGGCTC TTCAGAATAT
ATAGATAAAT TACTGCAAAA TTTTGCGCCT AGTGAAATAC TTATTGCAAA ACAAAAGAAA
GCAGATTTTA CAGCAATCTT TGGGTCAGAT TTTCATACAT TTTATATTGA AGATTGGGTG
TTTAAGACAG ACTATGCCCA CGAAACACTA CATCAACATT TTGGTGTAAA ATCATTAAAA
GGCTTTGGTG TAGATCATTT AGAGGATGGT ATCATAGCCT CTGGAGCTAT ATTATATTAC
CTAAGTGAAA CACAACATCA TAAATTAAAA CATATTACAA GCATAAGCCG CATTGCAGAA
GACGCCTATG TTTGGATGGA TCGTTTTACT ATAAGAAATC TAGAGCTTTA TCAAGGCACA
TCTTTACAGT CTGTAACTTT ATTAGATGTT ATAGATAAAA CAACATCTCC TATGGGAGGT
AGAACATTAA AGCGTTGGTT GGCACTGCCA TTAAAAAACG CTGAAAAAAT AAAAAAACGT
CACCGAGTTG TAAACTATTT CCTTAAGCAA AAAACATTAT TGAGTGATGT CACGTCTCAT
ATAAAACAGA TTGGAGATAT AGAGCGTCTC ATTTCTAAAG TAGCTACCGC TAAAGTAAGC
CCAAGAGAAG TTATTCAACT TAAAAACTCA TTAGATGCTA TTGTGCCTAT TAAGACATTA
GCCCTTAAAT CTGAAAACGA TGCTCTAAAA GTTATAGGTG ATAATTTACA GTCTTGTGAT
TTATTGCGAG GAAAAATAAC AGAAACCTTA AATGAAGAAG CACCAGTTAA TATACTAAAG
GGTAGTACTA TAGCTAGAGG ATTTTCCAAA GAGCTGGATG AGCTTAGAGA TATACGTTTT
TCTGGAAAAG AATATCTAGA TAAAATGCTT CAGAGAGAAA CAGAGGCTAC TGGTATTACA
TCATTAAAAA TAGCAAGCAA CAATGTTTTT GGATATTATA TTGAAGTGAG AAATTCTCAT
AAAGATAAGG TTCCAGAAAA CTGGGTTAGA AAACAAACTT TGGTAAATGC AGAGCGGTAT
ATTACTGAAG AATTAAAAGA ATACGAAGCT AAAATTTTAG GAGCAGAAGA GAAGATTGTG
CAAATAGAGC AAGAGTTGTT CTCTAAATTA GTTACTTGGA TTTCAGACTA CATAAAACCA
GTACAGCAAA ATGCACATCT TATAGGAGAA ATAGACTGTC TTTGTGGTTT TGCTACACAA
GCTATGCAGG AAAACTATTG TTTGCCAGAA ATCACAGAAG ACTATAGTTT AGAGATTACA
GAAGGAAGGC ATCCCGTTAT TGAAAAACAG TTGCCACTTG GAGAACCCTA TATAACTAAC
GATATCTTGC TTAATCGTGA TGATCAGCAA ATGATTATGA TAACTGGGCC AAATATGAGT
GGTAAGTCAG CTATCCTAAG ACAAACGGCA CTAATTGTAT TATTAGCTCA AATGGGAAGT
TTTGTGCCTG CTAAAGCTGC CAAAATAGGA TTAGTAGATA AGATTTTTAC TAGAGTAGGC
GCAAGTGATA ATATTTCGAT GGGTGAAAGT ACATTTATGG TCGAGATGAA TGAAACTGCG
AGTATTCTTA ATAATCTTTC AGATCGTAGT TTAGTGCTTT TAGATGAGAT AGGTCGTGGT
ACAAGTACAT ATGATGGTAT ATCTATAGCT TGGGCAATTA GTGAATACTT ACATGAACAC
CCAGCAAAGG CTAAGACACT ATTTGCAACT CATTATCATG AGTTAAATGA GATGACAGAA
ACCTTTGAGC GCATTAAGAA TTATAATGTG TCTGTAAAAG AATTAAAAGA TAATGTACTC
TTTTTAAGAA AACTAGTTCC AGGAGGTAGC GAACATAGCT TCGGAATTCA CGTAGCTAAA
ATGGCAGGAA TGCCACAACA GGTATTGCAT CGAGCAAATA AAATATTAAA GAAATTAGAG
AAAAGTCATT CTTCTGAAGA GTTAAGCGGA CAGATAAAAA AAGCAACAGA GCAAGAACCA
CAATTAAGCT TCTTTAAGTT AGACGATCCT TTATTAGAAG ATATAAAGCA GGAAATCATA
CAAGTAGACA TAAATACTTT AACGCCAGTT GAAGCATTAA TGAAGTTAAA TGAGATTAAA
AGAATGCTTG TCCCAAAAGG AAATGATTAA
 
Protein sequence
MAKAKKVTPL MQQYNSIKTK YPDALLLFRV GDFYETFGED AVKAARILNI VLTNRNNGGE 
RTELAGFPHH SLNTYLPKLV KAGERVAICD QLEDPKATKS IVKRGVTELV TPGVALNDEV
LQSNSNNFLA SVYIGKKQMG VAFLDVSTGE FLTAQGSSEY IDKLLQNFAP SEILIAKQKK
ADFTAIFGSD FHTFYIEDWV FKTDYAHETL HQHFGVKSLK GFGVDHLEDG IIASGAILYY
LSETQHHKLK HITSISRIAE DAYVWMDRFT IRNLELYQGT SLQSVTLLDV IDKTTSPMGG
RTLKRWLALP LKNAEKIKKR HRVVNYFLKQ KTLLSDVTSH IKQIGDIERL ISKVATAKVS
PREVIQLKNS LDAIVPIKTL ALKSENDALK VIGDNLQSCD LLRGKITETL NEEAPVNILK
GSTIARGFSK ELDELRDIRF SGKEYLDKML QRETEATGIT SLKIASNNVF GYYIEVRNSH
KDKVPENWVR KQTLVNAERY ITEELKEYEA KILGAEEKIV QIEQELFSKL VTWISDYIKP
VQQNAHLIGE IDCLCGFATQ AMQENYCLPE ITEDYSLEIT EGRHPVIEKQ LPLGEPYITN
DILLNRDDQQ MIMITGPNMS GKSAILRQTA LIVLLAQMGS FVPAKAAKIG LVDKIFTRVG
ASDNISMGES TFMVEMNETA SILNNLSDRS LVLLDEIGRG TSTYDGISIA WAISEYLHEH
PAKAKTLFAT HYHELNEMTE TFERIKNYNV SVKELKDNVL FLRKLVPGGS EHSFGIHVAK
MAGMPQQVLH RANKILKKLE KSHSSEELSG QIKKATEQEP QLSFFKLDDP LLEDIKQEII
QVDINTLTPV EALMKLNEIK RMLVPKGND