Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CA2559_07070 |
Symbol | |
ID | 9296910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Croceibacter atlanticus HTCC2559 |
Kingdom | Bacteria |
Replicon accession | NC_014230 |
Strand | + |
Start bp | 1573946 |
End bp | 1575799 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | |
Product | putative DNA mismatch repair protein |
Protein accession | YP_003716173 |
Protein GI | 298207994 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.748012 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGACA TCATTCAACT CTTACCAGAC CACGTTGCCA ACCAAATTGC AGCTGGAGAA GTAGTACAAC GACCAGCTTC TGTAGTTAAG GAGTTGCTTG AAAATGCATT AGATGCCGGA TCGGAAAGTA TACAGCTAAT TGTTAAGGAA GCAGGAAAAA TACTTATACA AGTTATAGAT GATGGTAAGG GAATGAGCGT GACAGATGCA CGATTATCTT TTGAACGCCA TGCAACATCT AAAATAAAGT CTGCTGAAGA CCTTTTCGCA ATAAATACAA AAGGTTTTAG GGGTGAAGCA TTAGCATCAA TCGCTGCGGT CTCTCACGTA GAGATGAAAA CCAAACAGAC TCAAGATGAA GTAGGGACTT ATATAAAAAT TGAAGGTTCA GAAATAACAA CTCAAGATGT TTGTGTAACA CCAAAGGGCA CAAGTATCTC TGTAAAAAAC TTATTTTATA ATATACCTGC ACGCCGAAAT TTCTTAAAAT CTGATGCTGT AGAAATGCGT CATATTATAG ATGAATTTCA ACGTGTAGCA CTTGCACATC CATCTGTTGC TTTTGATCTT CATCATAACG GCAGCAGTCT ATTTAGTTTG CCAAGTAGTA ATTATAGACA ACGCATAGTA AATATTTTAG GAACTAAGAC TAATGAACGT TTAGTGCCTG TTGAGGAGGA AACAGATATC GTTAAGATAT CTGGATTTGT CGGAAAACCA GAATTTGCAA AACGTACACG TGGCGAGCAA TTCTTCTTTG TAAATAACAG GTATATAAAG AGTTCTTATT TACATCATTC CATTGTAAGT GCCTTTGAAG GATTATTAAG AGATAAAAGC CATCCAAGCT ATTTTTTATA TCTAGATGTA GACCCTAAAA CTATAGATAT AAACATACAT CCCACAAAAA CAGAAATTAA GTTTGAAGAT GAGCATACAC TATACGCTAT GCTAAAAAGT GTGGTAAAAC ATAGTTTAGG ACAATTTAGT GTAGCACCAG TTTTAGATTT TGATAGGGAT AGTGATTTAG ATACACCTTA CAGTTATAAA GAAAAACACG TTAGCATCCC AAAAATAGAT GTAGATAGAA ATTTTAACCC TTTTCAGGAC TCACAACCTG CAACAGCTTC AAAACAATCT CGTAATTTTC AGCAAAAGCC AACACAATCT TGGGAAAGCT TATATGTAGG CGCAAATACA GAGGCAATTG AAGATAATTT TCAGACATCT GCTATAGAGT TTGAAAGTGA CGAGGTAACG GGCAATTTAT TTGAAAACGA AACAGAAGAG CATCAAGCTA ATAGTTTTCA GATACATAGA AAATATATAG TAAGCACAAT AAAAAGTGGT TTGTTGGTTG TAGATCAGCA TAGAGCGCAC ACTCGCGTAC TTTATGAGGA GCTTCTAAAA AATATAACAA TGAGCTCTGC AGTGAGCCAA CAGTTATTAT TTCCAATCGA GCTCCAGTTT AATGCTAACG AGCTAAGCTT ATTAAATGAA TTAAAAGATT CTTTAGTACA AACAGGTTTT GTGTTTGAGG GAAGCCAAGA ACATACATTA GTAGTTTCAG GAATACCCAC AATAATAAAT GAAAGTAACA TACAGGATTT ATTACAAAGG TTACTTAGTG ATTTAGAGCA GGAAGTGCCA GGAAATCAGT TTTCGCAAAA TGACACTTTG GCAAAAAGTA TGGCAAAAAG TATGGCTGTA AAAGCAGGCA CGGTTTTGAA TGTAGAAGCA CAACAACATT TGTTAAACCA ATTATTTGCT TGTAAAGAAC CTAGCGTTAC ACCACAAAAC AGAAAAGTGT TTGTAACACT TACCAGCAAC GATCTGGATA ATAAGTTTAT CTAA
|
Protein sequence | MSDIIQLLPD HVANQIAAGE VVQRPASVVK ELLENALDAG SESIQLIVKE AGKILIQVID DGKGMSVTDA RLSFERHATS KIKSAEDLFA INTKGFRGEA LASIAAVSHV EMKTKQTQDE VGTYIKIEGS EITTQDVCVT PKGTSISVKN LFYNIPARRN FLKSDAVEMR HIIDEFQRVA LAHPSVAFDL HHNGSSLFSL PSSNYRQRIV NILGTKTNER LVPVEEETDI VKISGFVGKP EFAKRTRGEQ FFFVNNRYIK SSYLHHSIVS AFEGLLRDKS HPSYFLYLDV DPKTIDINIH PTKTEIKFED EHTLYAMLKS VVKHSLGQFS VAPVLDFDRD SDLDTPYSYK EKHVSIPKID VDRNFNPFQD SQPATASKQS RNFQQKPTQS WESLYVGANT EAIEDNFQTS AIEFESDEVT GNLFENETEE HQANSFQIHR KYIVSTIKSG LLVVDQHRAH TRVLYEELLK NITMSSAVSQ QLLFPIELQF NANELSLLNE LKDSLVQTGF VFEGSQEHTL VVSGIPTIIN ESNIQDLLQR LLSDLEQEVP GNQFSQNDTL AKSMAKSMAV KAGTVLNVEA QQHLLNQLFA CKEPSVTPQN RKVFVTLTSN DLDNKFI
|
| |