Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1174 |
Symbol | |
ID | 6377415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 1506104 |
End bp | 1507990 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642682278 |
Product | hypothetical protein |
Protein accession | YP_001958237 |
Protein GI | 189502520 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0643431 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACAA ATATTGTCCG TCTACTTCCA GATAGTTTAG CTAACCAAAT AGCAGCTGGC GAGGTTATTC AAAGGCCTGC TTCTGTTGTT AAAGAGTTAG TAGAAAATGC AGTAGATGCA GCAAGTACAC ATATTAAAGT AGTCATTAAA GATGCTGGTA AAACCCTTAT ACAAGTTATT GATGATGGAA TAGGTATGTC AGAAGTAGAT GCTAGGATGA GTCTTGAAAA ACATGCCACT TCTAAAATAA GCCAAGCAGA CGACCTATTT AACATTCGCA CTATGGGCTT TCGAGGAGAA GCACTACCCT CTATTGCAGC CATAGCCCAA GTAGAGATAG AAACACGTAC CGAAGATGCA GAACTTGGCA CACGCCTTGT GGTAGAGGGT TCTAAAATAA AACTACAAGA ACCTGTTGCT ACTACAAAAG GCACTACCAT TAGCGTTAAA AACCTTTTCT TTAACGTCCC AGCACGAAGA AACTTTTTAA AATCAGAGCC TGTAGAAACC AAGCATATTA TAGAAGAGTT TCAACACATT GCTTTAGCAA GGCCAGACAT CTCCTTTTCA CTATATCAAA ATGAGCAAGA AACCTACCAC CTACCAGCCA CTAAGCTTGC CAATCGAATT GTACACCTTT TTGGAGAAAC TTATAAAAAG CAACTAATAC CTTGCCAAGA AGGCACTGAT ATTATACAGA TACATGGGTA TGTAGGAAAT CCTTCTTATG CTAAAAAGAC TAGAGGCGAG CAATTCTTTT TTGTTAACAA CCGTTTTATA AAAAGTACAT ATTTACACCA TGCTGTTAAG AGTGCATTTG AAGAACTAAT TCCTAAAGAT ACTTTTCCTT TTTATGTATT ATTTATTGAA ATTTCACCTG AACGTATTGA TGTAAATGTG CATCCTACCA AAACAGAGAT CAAGTTCGAT GATGAGCGGA TGGTATACTC TATATTACAA GCATCAGTAA GGCAAGCCCT AGCACATCAT ACCACACCGG CATTCGACTT TGAGCAAAAT ATTAATTGTG ATCCGCTCGG TTTACAAGAG CAACCAAGAC AAAAAAGCTT TACAACAAGT ACAGATAGGG CATATAGCTC ATTTAAAAAG TTTGACAATC CTTCTGTTAA TCAACAAGAA TGGGAACAAC TGTTTCAACG AATCAACGTA GATACTGATA GTACTACACC TATCCACGGG CAACAAGCAC AACTGTTAGA CCTAGAAAAT ATGTTGTCAA CAGTTAATGA GGTATCTTCT ATGGCTACAA TGCAAGAAAA CATCCAGCCA TTTTCTATGC TAACTCATGA AGGGACAGAT ACTGCTGCTA AGATGCAACT GCATGCTACT TATATTCTAG CATCTGTCAA GTCAGGTTTA TTGCTTATCA ACCAACATGC AGCGCATGAA CGTATTTTAT ATGAAAAATA TATCGAACAT CTACAAAATC ACCATGCAGG TACTTCTCAG CAATTACTGT TTCCACATCA AATAGAATTA AACCCAGCAG ATTTTGCACT TATACAGGAT TATGAGGGGA CACTAGGAAC ACTAGGTTTT GCAATAGAAG ACTTTGGCAA AAACAGCATC ATATTAGTTG GCTACCCAGC AGAAGCTGCA CAGCATAATC CAAAACAATT ATTAGAAGAC ATATTAGAAC AAATTAAGTG GAATAAAAGC CATCTCTCCC TGCCAATTCA AGAAAATGCA GCTCGTGCTT TAGCAAAACA TGCTGCTATC CAACCTGGAA AAAAATTGAC CATGGTAGAA ATAGATAGCT TAGTAGATCA ATTATTTGCT TGCAAGAACT CAACTCATGC GCCAGATGGT AGAAGAATAT GGGTTATTAT AACCTTAGAA GAGTTGGCTA GTTTATTGAA AACCTAA
|
Protein sequence | MSTNIVRLLP DSLANQIAAG EVIQRPASVV KELVENAVDA ASTHIKVVIK DAGKTLIQVI DDGIGMSEVD ARMSLEKHAT SKISQADDLF NIRTMGFRGE ALPSIAAIAQ VEIETRTEDA ELGTRLVVEG SKIKLQEPVA TTKGTTISVK NLFFNVPARR NFLKSEPVET KHIIEEFQHI ALARPDISFS LYQNEQETYH LPATKLANRI VHLFGETYKK QLIPCQEGTD IIQIHGYVGN PSYAKKTRGE QFFFVNNRFI KSTYLHHAVK SAFEELIPKD TFPFYVLFIE ISPERIDVNV HPTKTEIKFD DERMVYSILQ ASVRQALAHH TTPAFDFEQN INCDPLGLQE QPRQKSFTTS TDRAYSSFKK FDNPSVNQQE WEQLFQRINV DTDSTTPIHG QQAQLLDLEN MLSTVNEVSS MATMQENIQP FSMLTHEGTD TAAKMQLHAT YILASVKSGL LLINQHAAHE RILYEKYIEH LQNHHAGTSQ QLLFPHQIEL NPADFALIQD YEGTLGTLGF AIEDFGKNSI ILVGYPAEAA QHNPKQLLED ILEQIKWNKS HLSLPIQENA ARALAKHAAI QPGKKLTMVE IDSLVDQLFA CKNSTHAPDG RRIWVIITLE ELASLLKT
|
| |