Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_02951 |
Symbol | |
ID | 4780110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 273492 |
End bp | 275906 |
Gene Length | 2415 bp |
Protein Length | 804 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640083560 |
Product | DNA mismatch repair protein MutS family protein |
Protein accession | YP_001014124 |
Protein GI | 124025008 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0619291 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATTAA CAAAAAATCA TGATGATTCT AAAAAGGCAC AAATAATATC AGAGTCTTTG GATTTGCTTG AATGGCCAAC TGTTTGTAGC CATTTGTCTA CATTCGCTCT TACTCAACAA GGTCGTAAAA AATGTGAAAG CTTTGATTTG CCACGAAATC TATCTTTAAG CCAAGAGCTA TTGTCCCAAA CATTAGAAAT TGGGTCATTA GATAGTTCTC TTAATGAAGG AATATCTTTT GATGGTGTTC ATGATTTGGA AAATATACTT TTGATATGCT CCAAAGGAGG TATTGCTATT GGTGAGGATT TATTAAAAGT AGCTGATACT TTAAGAGCTG CTAGAAAATT ACGAAAACTA ATATTTGATC AAGTGATACG TCCACGACTT TCTGAATTGC TCAAAGATGT TGCAACTTTG CCAGATTTGC AAAAACTCCT CGAATTCGGG CTTGATGAAG GTGGGCGAAT TGCAGATCGT GCTAGCCCAA AGCTTTCTGA ATTACGACGT TATAGAAATT CCGTACGTCT TCAAAGAAAA GATATTCTAC AAGATATCAT CCGGAAATAT GGTGGATTAC TTCAAGATAA TATTATTTCA GAGAGGTATG GACGACCTGT TTTAGCGTTT AAGGCTGGGA CTTCTGATCA AATTAAAGGA ATGGTTCATG ATAGTTCGGC CTCTGGAAAC ACGATATATG TCGAGCCCCA AGTTGTCATA TCAATAGGAA ATCGTTTAGC TAAGATAGAT TCTGAAATCT CAGATGAAGA GAGGAGACTT TTAGCTGATT GGAGTAAAGA GGTTGGTCTT AATGCAATTG TAATAGCTCA TTTAGTAGAG ATCCTTTTGC AAATTGAGTT TGCATTGTCT CGAGCACGTT ATTCTAAATG GCTTAATGGG GTCCCTGCAA TTCTTGATCA AGAAGAACAT TCACTCTTTG AGATCAAAGA TTTTCGTCAT CCTTTATTAG TATGGAATGA CTTCCATGAG AAAAAGAATA CAGTAGTTCC AACTAGTTTT GATGTCGCCC CTGATTTAAA AGTTGTTGCG ATTACAGGCC CTAATACTGG AGGGAAAACA GTTGCTTTGA AAAGTATTGG TTTAGCAGTT TTAATGGCAA AAGCGGGGTT GCTTTTGCCA TGTACAGGCT CACCAAGATT ACCATGGTGT AAAAATGTTT TCGCTGATAT TGGTGATGAG CAATCTTTAC AGCAAAATTT GTCTACATTT AGCGGACATA TTCTTCGTAT AAGTCGAATA CTCGACGCTA TAGATGTGTT CCCTGGTACG ACTCTCGTTC TTTTAGATGA AGTTGGAGCT GGAACTGATC CAACTGAAGG CACAGCATTG GCCATGGCAC TCCTACAGGT AATGGCTAAT AGAGCAAGAT TAACTATCGC GACTACTCAT TTTGGACAAT TAAAAGCGCT CAAATATAGT GATTCAAGAT TTGAAAATGC TTCAGTTTCT TTTGATAGTG AAACTATACA ACCAACTTTT CATTTGCAAT GGGGAATTCC TGGTCGAAGT AATGCAATTG AAATTTCAAA GAGACTTGGT CTCGATGAGC AAGTAATCAT AAGTGCTCAA AAATTTATCA ATCCTGAAAG GGTTGATAAT GTTAATCAAG TTATTCAAGG CTTAGAAAAA CAACGCGAGC GTCAGCAATC AGCAGCTGAA GATGCTGCTG CATTATTGGC TAAAACCGAA TTACTGCATG AGGAATTACT TAATAGTTGG CAGAAACAAC GTCAACAATC GGAAGAGTTT AATGAGCAAG GAAGGTTCAA ATTGGAGTCA TCAATTCGTG AAGGTCAAAA AGAAGTTAGA CATTTAATTA AACGTTTGCG CGATCAAAAC GCTAGTGGTG AGACAGCAAG AATTGCCGGT CAACGATTAC GGCAAATAGA AAAGGGATAT CGAAACGACA AGCGAATTAA CCACACACAG AGTTGGACCC CAAAGATTGG GGAAAAAGTT AGATTGTCTT CTATTGGTAA AGCAGGTGAA ATAATTTCTT TTTCAGATGA TGGAATGCAA TTAACGGTGC TATGCGGAGT ATTTCGAAGC AAAGTCAATT TAACCGAAGT TGAAAGTCTT GATGGTCAAA AGGTCGAAAT AAACCAAAAT GTGCAAGTAA AAACTTCGCA GGTAAGAAAG AATTTATCTT TAGTAAGAAC TAAAAAAAAT ACCTTAGATG TAAGAGGGTT ACGCGTTCAT GAAGCCGAGG GGGTAATTGA AGAAAAATTG AGAAATTGTT CCGGAGCTTT ATGGGTTATT CATGGAATTG GTTCTGGAAA ACTGAAAAAA GGTTTGAGGA AATGGTTTGA TTCACTTCCA TATATTGAAA AAGTAGCCGA TGCTGAACCT CATGATGGCG GCCCTGGATG TAGCGTTGTG TGGATGGTTG ATTGA
|
Protein sequence | MGLTKNHDDS KKAQIISESL DLLEWPTVCS HLSTFALTQQ GRKKCESFDL PRNLSLSQEL LSQTLEIGSL DSSLNEGISF DGVHDLENIL LICSKGGIAI GEDLLKVADT LRAARKLRKL IFDQVIRPRL SELLKDVATL PDLQKLLEFG LDEGGRIADR ASPKLSELRR YRNSVRLQRK DILQDIIRKY GGLLQDNIIS ERYGRPVLAF KAGTSDQIKG MVHDSSASGN TIYVEPQVVI SIGNRLAKID SEISDEERRL LADWSKEVGL NAIVIAHLVE ILLQIEFALS RARYSKWLNG VPAILDQEEH SLFEIKDFRH PLLVWNDFHE KKNTVVPTSF DVAPDLKVVA ITGPNTGGKT VALKSIGLAV LMAKAGLLLP CTGSPRLPWC KNVFADIGDE QSLQQNLSTF SGHILRISRI LDAIDVFPGT TLVLLDEVGA GTDPTEGTAL AMALLQVMAN RARLTIATTH FGQLKALKYS DSRFENASVS FDSETIQPTF HLQWGIPGRS NAIEISKRLG LDEQVIISAQ KFINPERVDN VNQVIQGLEK QRERQQSAAE DAAALLAKTE LLHEELLNSW QKQRQQSEEF NEQGRFKLES SIREGQKEVR HLIKRLRDQN ASGETARIAG QRLRQIEKGY RNDKRINHTQ SWTPKIGEKV RLSSIGKAGE IISFSDDGMQ LTVLCGVFRS KVNLTEVESL DGQKVEINQN VQVKTSQVRK NLSLVRTKKN TLDVRGLRVH EAEGVIEEKL RNCSGALWVI HGIGSGKLKK GLRKWFDSLP YIEKVADAEP HDGGPGCSVV WMVD
|
| |