Gene Aasi_1174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1174 
Symbol 
ID6377415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1506104 
End bp1507990 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content37% 
IMG OID642682278 
Producthypothetical protein 
Protein accessionYP_001958237 
Protein GI189502520 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0643431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACAA ATATTGTCCG TCTACTTCCA GATAGTTTAG CTAACCAAAT AGCAGCTGGC 
GAGGTTATTC AAAGGCCTGC TTCTGTTGTT AAAGAGTTAG TAGAAAATGC AGTAGATGCA
GCAAGTACAC ATATTAAAGT AGTCATTAAA GATGCTGGTA AAACCCTTAT ACAAGTTATT
GATGATGGAA TAGGTATGTC AGAAGTAGAT GCTAGGATGA GTCTTGAAAA ACATGCCACT
TCTAAAATAA GCCAAGCAGA CGACCTATTT AACATTCGCA CTATGGGCTT TCGAGGAGAA
GCACTACCCT CTATTGCAGC CATAGCCCAA GTAGAGATAG AAACACGTAC CGAAGATGCA
GAACTTGGCA CACGCCTTGT GGTAGAGGGT TCTAAAATAA AACTACAAGA ACCTGTTGCT
ACTACAAAAG GCACTACCAT TAGCGTTAAA AACCTTTTCT TTAACGTCCC AGCACGAAGA
AACTTTTTAA AATCAGAGCC TGTAGAAACC AAGCATATTA TAGAAGAGTT TCAACACATT
GCTTTAGCAA GGCCAGACAT CTCCTTTTCA CTATATCAAA ATGAGCAAGA AACCTACCAC
CTACCAGCCA CTAAGCTTGC CAATCGAATT GTACACCTTT TTGGAGAAAC TTATAAAAAG
CAACTAATAC CTTGCCAAGA AGGCACTGAT ATTATACAGA TACATGGGTA TGTAGGAAAT
CCTTCTTATG CTAAAAAGAC TAGAGGCGAG CAATTCTTTT TTGTTAACAA CCGTTTTATA
AAAAGTACAT ATTTACACCA TGCTGTTAAG AGTGCATTTG AAGAACTAAT TCCTAAAGAT
ACTTTTCCTT TTTATGTATT ATTTATTGAA ATTTCACCTG AACGTATTGA TGTAAATGTG
CATCCTACCA AAACAGAGAT CAAGTTCGAT GATGAGCGGA TGGTATACTC TATATTACAA
GCATCAGTAA GGCAAGCCCT AGCACATCAT ACCACACCGG CATTCGACTT TGAGCAAAAT
ATTAATTGTG ATCCGCTCGG TTTACAAGAG CAACCAAGAC AAAAAAGCTT TACAACAAGT
ACAGATAGGG CATATAGCTC ATTTAAAAAG TTTGACAATC CTTCTGTTAA TCAACAAGAA
TGGGAACAAC TGTTTCAACG AATCAACGTA GATACTGATA GTACTACACC TATCCACGGG
CAACAAGCAC AACTGTTAGA CCTAGAAAAT ATGTTGTCAA CAGTTAATGA GGTATCTTCT
ATGGCTACAA TGCAAGAAAA CATCCAGCCA TTTTCTATGC TAACTCATGA AGGGACAGAT
ACTGCTGCTA AGATGCAACT GCATGCTACT TATATTCTAG CATCTGTCAA GTCAGGTTTA
TTGCTTATCA ACCAACATGC AGCGCATGAA CGTATTTTAT ATGAAAAATA TATCGAACAT
CTACAAAATC ACCATGCAGG TACTTCTCAG CAATTACTGT TTCCACATCA AATAGAATTA
AACCCAGCAG ATTTTGCACT TATACAGGAT TATGAGGGGA CACTAGGAAC ACTAGGTTTT
GCAATAGAAG ACTTTGGCAA AAACAGCATC ATATTAGTTG GCTACCCAGC AGAAGCTGCA
CAGCATAATC CAAAACAATT ATTAGAAGAC ATATTAGAAC AAATTAAGTG GAATAAAAGC
CATCTCTCCC TGCCAATTCA AGAAAATGCA GCTCGTGCTT TAGCAAAACA TGCTGCTATC
CAACCTGGAA AAAAATTGAC CATGGTAGAA ATAGATAGCT TAGTAGATCA ATTATTTGCT
TGCAAGAACT CAACTCATGC GCCAGATGGT AGAAGAATAT GGGTTATTAT AACCTTAGAA
GAGTTGGCTA GTTTATTGAA AACCTAA
 
Protein sequence
MSTNIVRLLP DSLANQIAAG EVIQRPASVV KELVENAVDA ASTHIKVVIK DAGKTLIQVI 
DDGIGMSEVD ARMSLEKHAT SKISQADDLF NIRTMGFRGE ALPSIAAIAQ VEIETRTEDA
ELGTRLVVEG SKIKLQEPVA TTKGTTISVK NLFFNVPARR NFLKSEPVET KHIIEEFQHI
ALARPDISFS LYQNEQETYH LPATKLANRI VHLFGETYKK QLIPCQEGTD IIQIHGYVGN
PSYAKKTRGE QFFFVNNRFI KSTYLHHAVK SAFEELIPKD TFPFYVLFIE ISPERIDVNV
HPTKTEIKFD DERMVYSILQ ASVRQALAHH TTPAFDFEQN INCDPLGLQE QPRQKSFTTS
TDRAYSSFKK FDNPSVNQQE WEQLFQRINV DTDSTTPIHG QQAQLLDLEN MLSTVNEVSS
MATMQENIQP FSMLTHEGTD TAAKMQLHAT YILASVKSGL LLINQHAAHE RILYEKYIEH
LQNHHAGTSQ QLLFPHQIEL NPADFALIQD YEGTLGTLGF AIEDFGKNSI ILVGYPAEAA
QHNPKQLLED ILEQIKWNKS HLSLPIQENA ARALAKHAAI QPGKKLTMVE IDSLVDQLFA
CKNSTHAPDG RRIWVIITLE ELASLLKT