Gene Aasi_1814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1814 
Symbol 
ID6376977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1419274 
End bp1421742 
Gene Length2469 bp 
Protein Length822 aa 
Translation table11 
GC content34% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003573196 
Protein GI294661320 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.614936 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCGTA TTGTTTTGTG CATATTACTT ATTGTAGCAA GCAGCAGCTA CCAGACATGT 
GTAACAGAAG TAGTAAATCA AAGGCATGGG GAAGGTTTGA TAATGAACGT CACTTCAAGA
CAGGATAGAA AAATGGAAGT CTATATTAGC TTGGGAGCAA GGAGAAATGA AGCTATAATT
GGTAACTATA GATTAAGAGT TAGGTTGTTT TCTACAACAG AAAGAGATGC TACTAGTTTT
ATAGACTATA TAAATGGCGC AGGGCGGCAT GAGCGAGCAA CAAACATCGA CCAACCGTTG
AGCTATTTTA CAAGAGAAGA GAGGATAAAT TTGGAAAGTG AAGAGATTCT AATACCTTTT
ACATTTATTC CAGGGTTAGA AGTAACTAAT GCAAAAATAT GCTTTGAGCT TTTCAATGAA
GCAGGGATGT TAATTAGTTC AGCTAATGTA AGCTGGGAGC ATACTATAAT AGAGCCAGAA
AATAACTTAA TGCTGCTAGA AAGTGCTGAA ATACATAGAA TGGAGGAGTT AGAGGCACTA
GATGTGTTGC CAGTAAATTA TGGGAATACA TTTACAACAC TAACAGGTGA ACATAGCCAT
ATAAAAGTGG GGAGCAAGCC ACAAAAAGAA CTAACAACTG CTTTCTATCT AAACAATGCT
AAACACAAAG TGACATTAAG GGAGGTAGAA AATAGAGAAA AAATTGATGC ACGGTTAGAA
TTAGAAAAGA TGGTTATACC TGCTAGATAT AATAACATGA GCAACCAAGA GTTAGCTAAG
CGAGCGAATG AGAATGATTT ATATGCGCAA GAGCTAATGG TTAGAAAGGT TATGGCAGAA
GGCTCTTTTT TAAGCTTGCT AAAATTAGCT GATCCATATA ATTGGAAGGG AATTAGAGAA
AAAGTACAGC AAGATGGGCG TTACATTTAT ATATTGCTAA GCCGTCCAAA TCAGATCAAA
GATGATGCTA TTTGTAACTT GTTTGTTGAT AGTGTAATAG AGCATGCGCA AGCAGGAGAT
TCTTTGGTAC AAACCAATTT AGGAATGATG TATTTGGCAG GTAAAGGTGT ACAGATAAAT
AGTGACCAAG GACTAGCATG CCTTATTCAA GCAACTGAGA AAAATTTTGG ATTAGCTTAT
TATATGCTAG GACAAGTTTA TAAGCATGGG ATAGGTATTA AGAAGAATAC AAAAGAGGTA
GTTAAATGGC ATATAGAAGC AGCTAACCAA GGGATTATAC GCTCCAAGTA CAATTTAAGG
GATATATACA TAGCCAGTAT TTTATTTGAT AAAGCTGAGG TTGAAGAAGA AATGGACGAA
GAAGAGGGCT TGAAAAAAGA AATAGAGTGG CTTGCTAAAG TAGCAGATAA AGAAGGAGAT
AAAAATGCGC AAGCGGCATT AGGATTGATA TATTGTATAG GTAAAGGAGT AGAGCAAAAT
GTTGAATTAG GTTTAGAATG GCTTAATATG GCCACTGACT ATAGAAATAA GACTTTATTA
AGTGACCACT ATTTAATAAA GAGAATAGGA GACATATATT ACTATGGTAT GCTTGGTACA
GCTAAAGATT CTAAAAAAGC TATAGAGTTA TATATAAAAG CAGCTGAGGG TGGGATGGTT
GCAGCTCAGA AGCGTTTAGT AAAAGTGTAT TTTAAGGGAG AATATGTTGA ACAAGATTTT
GCTCAGGCAA TTTTTTGGGC TTTACAAGCA AAAGATAAAG CATGGTTATT GGATAAATTT
GAAATTAACT TAAATAATCC TTTAATGTAT AACCAAGTTA AAAGAGATTT TGAGGCTTTG
GGTAAAGATC TTCTTGTTAG TTGTCGACGT ATGCGAGCAC GAGAGAAGCA CTCACTTATA
GGGAGGCATG CTGATATGTT ACCTGACTTG TTTCAAGGAT TAAAAGAGAT AATAGATGAA
CTTATGAAGT GGGGACAACA ATTAAAGTCT CAATCTGGCT TGTTGATAAA TTTCATGAAT
TTTAAGGATC CAAGCTTTAA AAGCATAATA CAAGAACGTC AAACGGAAAC AGGTGTTATC
CCCTACGTAA AGGAATATGA ATATCAAGAA AAGAATTATA TAAGCTTTGG AGAATCAAAT
GTTCGGTTTG CAGACCAGCT TTTAGAGGAA CAGGTTTATC AAACTCATTA TAATGAAGCA
CTTAATCTAT TAGAGATAAT CACATCTATT TATAAAAAAG CACAGGTAAA GTTAATTTGT
GAAGCAAAAG AGATTAAAAA AGTACTATCT AAGTTGCATA TAGATGAGAA GATAAAGGAA
CAGATGTTTT TAAAAGAGTT GAATGGAAAA ATCAGAATAA TTAACTTATT TAATGTCAAA
TTAAAATTAC TTGAAGATAA GAAAAAACAG TTTATGAGTT TTTATCAACT ATTATTCAAA
CAAATTGAAA AAGGACTGTT CGTACGTAAT AAGCAGTTTA AAGCAAAACA CAGTTATATT
TTTGATTAA
 
Protein sequence
MRRIVLCILL IVASSSYQTC VTEVVNQRHG EGLIMNVTSR QDRKMEVYIS LGARRNEAII 
GNYRLRVRLF STTERDATSF IDYINGAGRH ERATNIDQPL SYFTREERIN LESEEILIPF
TFIPGLEVTN AKICFELFNE AGMLISSANV SWEHTIIEPE NNLMLLESAE IHRMEELEAL
DVLPVNYGNT FTTLTGEHSH IKVGSKPQKE LTTAFYLNNA KHKVTLREVE NREKIDARLE
LEKMVIPARY NNMSNQELAK RANENDLYAQ ELMVRKVMAE GSFLSLLKLA DPYNWKGIRE
KVQQDGRYIY ILLSRPNQIK DDAICNLFVD SVIEHAQAGD SLVQTNLGMM YLAGKGVQIN
SDQGLACLIQ ATEKNFGLAY YMLGQVYKHG IGIKKNTKEV VKWHIEAANQ GIIRSKYNLR
DIYIASILFD KAEVEEEMDE EEGLKKEIEW LAKVADKEGD KNAQAALGLI YCIGKGVEQN
VELGLEWLNM ATDYRNKTLL SDHYLIKRIG DIYYYGMLGT AKDSKKAIEL YIKAAEGGMV
AAQKRLVKVY FKGEYVEQDF AQAIFWALQA KDKAWLLDKF EINLNNPLMY NQVKRDFEAL
GKDLLVSCRR MRAREKHSLI GRHADMLPDL FQGLKEIIDE LMKWGQQLKS QSGLLINFMN
FKDPSFKSII QERQTETGVI PYVKEYEYQE KNYISFGESN VRFADQLLEE QVYQTHYNEA
LNLLEIITSI YKKAQVKLIC EAKEIKKVLS KLHIDEKIKE QMFLKELNGK IRIINLFNVK
LKLLEDKKKQ FMSFYQLLFK QIEKGLFVRN KQFKAKHSYI FD