Gene Aasi_1844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1844 
Symbol 
ID6377414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1503015 
End bp1506032 
Gene Length3018 bp 
Protein Length1005 aa 
Translation table11 
GC content36% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003573216 
Protein GI294661340 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0832694 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTTTATT TTAGCAAGCA TATTATGAAA ATATTTCGTA CTATCAATTT ATGGATTGGC 
TGGGGACTCT TTTTTCTAGC CATGCTCGTC TATACACTAA CTATAGAGCC TACAGCTAGT
TTCTGGGATT GCTCTGAATA TATTGCTGCT GCTTATAAAT TACAGGTAAC GCACCCACCT
GGTGCTCCCC TATTTCTTCT AATTGGCAGG ATGTTCTCTT TTTTAGCTGG TAATAACACA
GAGAAAGTAG CTTTTTGGAT CAATATGAGT TCGGTAATAA CTAGTTCGGC TACTGTAATG
GTAGTATTCT GGATTATTTC TTTACTAGCT AGACGAATTA TAGGTAAGAC AACACAAGAT
TTACAACTTT ATGAAGCAGC ATCCATATGG GGTGCTGGCA TAATAGGTGT GCTGTCGCTA
ACTTTCTGCA GTACCTTTTG GTCCAATGCT ACAGAGGCTG AGACATATGC CTGCTCTACA
CTCTTAATGT CCCTTACAGT ATGGGCTATG TTAAATTGGG AGTATACAAC ACCCAGACCA
CGAAGCTATC AATGGCTTTT ATTAGTCGCT TACTTGATAG GATTAAGTTT AGGAATACGG
ATGTTTAGTG TATTGACAAT ACCTGCTTTG TGCCTCATAT TTTATTTTAA GCGAGTATCA
AAAATCACTT TGCTGGGTAC TACTATAACA CTTTTAATTG GTGGAATACT TTTAGCCTTT
ATATATACTG GCATTACCTT GAGCTTACCT ACCTGTGCCA TGCAGTTAGA ATTACTTTGC
GTTAATCAGT TAGGTTTGCC TTTTAAAAGT GGCATCATTA TACTAAGTAT TACACTAATA
GCTAGTTTAA CTTATGGTAT CATTTATACT ATACAAAAGC AGCATACCAC AATACATATA
GGATTGCTAT GTTTAGGATT TATTTTAATA GGATACTCTT CTTATGGACT AGTGCCCATT
CGTGCTCATG CCAACCCTCC TATCAATGAA GGGCATCCAA GCGATATTAT TAGTTTTATT
AACTATCTCA AGAGAGAACA ATATGGCCAT AGACCTTTGG TATACGGACC ACATTTTGCT
GCCCAGGTCA TAAGTGCTAA AAAAGGTGAC CCTATTTATA GAAATACTGG GAAAAAATAT
GAGATTATTG ACTATAAGCA TATCCCTATT TACGATGCTG GAGCCTATAC GCTTTTGCCT
AGGACTTGGA GTCAGCAAAA TTCTATGCAT ATAACAGCTT ATAGGAAGAT TCTTAATCTT
AAACCTTGGC AAAAACCTAG TTTGGGAGAT CAGCTATATT TTCTCATAAG GCACCAGCTA
GGACATTTCT ATTTACGTTA TTTCTTATGG AATTTTGCAG GACGTGCAAG CGACATGCAG
GGTGCTTCAT GGCTTACACC ACTAGATGCT TTTGAGAAAT TGCCGCCTAG CTTAACACAA
ATACCTGGAA GAAGTAATTA CTTATTCCTT CCATTCCTAT TAGGCCTAAT AGGAATGCTT
TTCCAGTATA GGCATGATAG ACGTTATTTC TGGGTAATAA CTATTTTATT TGTGATGCTA
GGAGCAGCAT TAGTAACTTT TTTAAATCCT CCTCCTATTG AACCACGTGA AAGAGACTAT
ATTTATGTAG GTTCATTCCT GTTTTTTACA ATCTGGATAG GCCTAGGTAC ATTAGCTGTT
GTAAACTATT TCAGGAAACT ATTTACACAA TATAAAATAG CTGTTACAAT AGGTATTATT
AGTTGCCTAG CAGTACCTAG CATTATGGCT ACGCAAGCTT GGCAAACACA TAATCGTTCT
CAGCGTTACT TTTCAGTAGA AAGCGCCAAA AATTTACTAG CCTCCTGTGC CCCTAATGCT
ATACTCTTTA CAGCAGGTGA TAATGATACT TTCCCGCTAT GGTATGTACA GGAAGTAGAG
GGCTTTAGAA CAGATGTACG AGTAGTTATC CTTAGCTATG CTAATGCAGC CTGGTATATT
AAGCAACTCA CACGCCCAGT AAACAATTCA GCACCACTAC CTTTATCTCT TCCATTTGAA
ATTTACCAGC AATATGGGCT TAATGATATT TTACCGTATG TACCACAACC CAATATACAA
GAATTAGATA TCATACAATA TCTCCAACTT ATCCGTGAAT CACATCCAGC CTTGCAAATA
CAGAACATAT TAAGAGAAAC TACCAATACA TTACCTTGCA AGAATATGTG TTTCCATATC
GATAAAACAG GAATAGCTGC TAAAGAAATT GTACCAACAC AATATGAATA TTTAATTCCT
GAAAAAATGA GCTGGTCTAT AAAAGGTAGA GGATTAGATA AAAGAGACTT GCTTATACTA
GATTTACTAG CGACTAACAA CTGGGAAAGG CCTATTTACT TTAATCATAG TTCATTACAT
ACCTTAAATA TAGACCTAAG TACCCATGTA ATGGTGGAAG GCTTAACACT CCGTTTAATG
CCTATACAGA ACAATATAGG TCACGAGCTA GTCAATACCG AAACAATGTA TAATAATATG
GTGAAAAACT TTTATTGGAA AGGAATGGAT AAGCCAGGAG TATATTATGA TGAAAATTAT
AGACTAGTAT TTATCCGTAA CCAACGTATG AGTTTTTGTA CGTTAGCTAA AGCATGTTTA
CATGAAGGAA AATTGCAACA AGCCAAAGAA GTACTATTAT ATGGCTTATC GGTAATACCT
GATGAAGTGG TACCATATGA TATAGCCAAC GTGTATATGA TACATTTGCT CTTTGAAGTA
GGAGAAAATG AACATGCCTT AAATATGATA AAAATTATAG GCAACAGAGC TGAAGAAATA
CTAACCTACA AAACAAGAAA AAGTAGTTTT ATAGATAGAG AAGTACAGGA ACAGATGGGG
ACATTATATG AAATAGCTAG AAGCCTAAGA GCAATAGATT ATCAAGAGTT AGCACAAGAA
TATGAAGACC TTTTAAACAA ATACCAAATT TTACTTGATG TACCTGATGA TAATAATAAT
GATATAGCTA GACGCTAA
 
Protein sequence
MFYFSKHIMK IFRTINLWIG WGLFFLAMLV YTLTIEPTAS FWDCSEYIAA AYKLQVTHPP 
GAPLFLLIGR MFSFLAGNNT EKVAFWINMS SVITSSATVM VVFWIISLLA RRIIGKTTQD
LQLYEAASIW GAGIIGVLSL TFCSTFWSNA TEAETYACST LLMSLTVWAM LNWEYTTPRP
RSYQWLLLVA YLIGLSLGIR MFSVLTIPAL CLIFYFKRVS KITLLGTTIT LLIGGILLAF
IYTGITLSLP TCAMQLELLC VNQLGLPFKS GIIILSITLI ASLTYGIIYT IQKQHTTIHI
GLLCLGFILI GYSSYGLVPI RAHANPPINE GHPSDIISFI NYLKREQYGH RPLVYGPHFA
AQVISAKKGD PIYRNTGKKY EIIDYKHIPI YDAGAYTLLP RTWSQQNSMH ITAYRKILNL
KPWQKPSLGD QLYFLIRHQL GHFYLRYFLW NFAGRASDMQ GASWLTPLDA FEKLPPSLTQ
IPGRSNYLFL PFLLGLIGML FQYRHDRRYF WVITILFVML GAALVTFLNP PPIEPRERDY
IYVGSFLFFT IWIGLGTLAV VNYFRKLFTQ YKIAVTIGII SCLAVPSIMA TQAWQTHNRS
QRYFSVESAK NLLASCAPNA ILFTAGDNDT FPLWYVQEVE GFRTDVRVVI LSYANAAWYI
KQLTRPVNNS APLPLSLPFE IYQQYGLNDI LPYVPQPNIQ ELDIIQYLQL IRESHPALQI
QNILRETTNT LPCKNMCFHI DKTGIAAKEI VPTQYEYLIP EKMSWSIKGR GLDKRDLLIL
DLLATNNWER PIYFNHSSLH TLNIDLSTHV MVEGLTLRLM PIQNNIGHEL VNTETMYNNM
VKNFYWKGMD KPGVYYDENY RLVFIRNQRM SFCTLAKACL HEGKLQQAKE VLLYGLSVIP
DEVVPYDIAN VYMIHLLFEV GENEHALNMI KIIGNRAEEI LTYKTRKSSF IDREVQEQMG
TLYEIARSLR AIDYQELAQE YEDLLNKYQI LLDVPDDNNN DIARR