Gene Aasi_0951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0951 
Symbol 
ID6377095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1224281 
End bp1227706 
Gene Length3426 bp 
Protein Length1141 aa 
Translation table11 
GC content35% 
IMG OID642682078 
Producthypothetical protein 
Protein accessionYP_001958039 
Protein GI189502322 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATATA AGCATACTAC AGAAAAGTGG GTAGGTGGAC ATATTATGCT GTTGCCAACA 
CTTTTGATAT TAACGTTAAT GTTACCTTTG GAAGGTTGTG GAAATGGCTT CAATGTCTCT
TTTAGTAAAG AGCCTCAAAA AGTTAATAAT GATAACACGA CTGTTTCAAT TTTTGAAAAG
GAATTGACAA ATAACGGAGT AGAAGATGTA ACCAATCAGA TGCAAACACC ATCATCGGAT
GCCATAACTT TATATACTCA ACAGGAAGTT TTGCCACCCC AAGCAGAAAT ATTATCACTT
GCTCAAGGTT ACGAGGTTGG AGCTAACCAT TCGTCTGATA AAGATGAATA TTCACCTGTT
GAAGCTAAAA GCAGTAATGA TGCTGTTAAT AAACATGAGC AAGCAGGCTC GCTAAGCGTA
CAACAGCAAG ATAGTAAAGC TAATAATAGT ACTACTCAAA AATATCTTTT GCAAAAGAGG
CGGCATACAA TAAGACAAAA GACAACCCCA AAATATAATG AATTATTGGT GGAAGAAAGC
AAGATATTAG ATAGAATTTA TAAGACACAA CAAGGATACG AAATTAAATT TTATAAAGTA
GGAGATAAAC TAAGGGCCAA CATTAAGGTA ATAAGAACAG GAAAAAATTT GGATTTACCT
GTAGCTTCTA CATCAGTGAA AGGGGTAGAA TTAATTTTAA ATCAATTAGC TTATGTGTAT
AAGTATGGAG CATCTACAAC TAACCTTATA CTAGTTGCCT CACCATATCA AGATAATGGG
TATGGATATA TAGGTGGTGT ATTGGGAGGT AGTAATACCG GCAATAAAAA AGGCAATACA
CAAGAACAGA AGAAACAACA AAGTAATCTT GGTAAGAAAG ATGCAAAAGG GAAGGGAAAG
GCAGAAGAGG AAGAAGAGGA AGAAATGCTA GTAACGGCCC CCTTAGCAAG TGAGAAGAAG
AAAAGAGAAA GAGATAGAAT GCCTACTTTT GTTTCTCCAA ATCCTACGGA AACTAAGCGC
CAAAAAGGCA AACAGGAGAG TCAAAAACGC CAATCGAAAC CTAGGAAATT ATCTTTTGCT
ATAGATCCTG AAGATGTTAT AGATCAAGGA CCAACCACAG TAGAAGGACT TAAAAACAAC
CCTGATAAAT TACCATTACT TGAAGAGAAA GCCCATTTAG ACAATAGCTT AGAACTTCAA
CATTTACTAG GAGATTATTA TCTAAGTAAT TTTTATGATT ATATACGCAG ACATGAACAT
ACTCCCACCA GAGAAGACAA TCAAGACCTA GATCAAGCTA TTATATGGTA TAAAATGGCT
GCCGACCAAG GATACCAACC AGCTCAAATA AAACTTTCAG AACTAAGGAG AAATGATCAC
TATACAAATC CTAGCCTGGA ACTAATGCAA GCATATCACC AAGCTATACA GAAAGCTGGT
TGTGACGACA AAGACTTTGA GACTAAAGCT AATCAAACTT CTGGTCGATT GATATTATCA
AATCTAATTG CCATGGATCT TAATCCTACT AAAGGCCCCA CACAAGCAAA ATGGGGCGGA
CCTTTAGCAA GCCCAGAGAC AAACTCTAAA ATATATGAAA CTTTATATGA ACCTTTAGTT
TATAAAAGAT ATGAGGATGG TGTAATGTCG ATCGACCCAT CTGGAGATGG TCCAGATGAA
ACAGCTTATG TAGTTATGAA AAGATATAAA AATTATTATT TTATTATAGC GCTAGGTGGG
ATGGCTGGTA AATACACTAA AAAAGAAGAT AATCCTCAAG CTACTGTGGG AAATAGCCCA
CAAGTTATTG AAAAATTAGT ATCAGTTGCT TTAGCAAATA AGGTTAATTA CATCCATATT
GAAAAAAATA ATGATAATTC TTTTGCAAAT CTTTTAGAAG GGCATTTAAA AAAAATAGGT
TCTGAGATAA AAGTTATCAA ATTTCAGCAA AGCACCAATA AAGAGAACAA AATACAAGAT
ATAATAGGGC CTTTATTGGA TAATAAGTAT TTAATCATTG ATAAAAAGGC TTTGAAAGAT
GATTTTAGTT CTATACTACA ACACGATTTA AACTTTAAAT TTTTCTATCA ATTAATGACT
ATTGATAGCG AAACTTCTAG TTCCACAGAC ATAGGTTTTC CTAAACCAGA ACATGATGAT
CGGCTTGATG TGGTAGCAAA TGCAATACGC TTTTTACAGA GAAGAATAGA AATTCGAGAA
GTATTTCAAG ATAAAATAAA CCAACTGACC GAAAACGCAG AGCTAGGAGA TTTATCGGCA
CAAGCTGAAT TAGGTGAAAT ATATAGAAAT GGGATGGGCG TTGAAGAAAA CTACCAAGAA
GCATTAAAAT GGTATACTAA AGCGTTAAAC GAGTACAAAG AAGCATTAAG TGCTAACAAG
ACAACTGAGC AAAGCGAAGA AGAGAAAAAT GACTTTAATA ATGTATTATT TGGTTTAGGA
GAAATGTATA GAAAAGGGCT GGGAAGTGAT CCAAATTATA AGGAAGCATA TAAGTTATAC
CAGGAGCTTG CTTACGAAAA TGTAGATGCA AGAGGTTATT ATGGATTAGC GAAGCTGTAT
GAAAAGCATT TGATAAAAGA CGAAAGTATA GATATTAATC AAAAAATATT AGGCTTGTAC
ACCGATGCAG CAAGGAGAGG TCATACAAGA GCACAATTTA AACTAGCTAC GATTTACCAA
AATGGTGAAG CTTGGGGAAT TTCAGTAAAT CCTAAGAAAG CACTTGGTTG GTATACAGAA
GCTGCCTCAA AAGGAGATCG GGAAGCTTGT TTTAAATTAG CAGACATGTA TTATAAAGGC
GAAGGAATAC AAGCAGCAAA TTATGAAGAA GCATTTAAAT GGTATATGAC TCTTGCACCC
CAAGGTGAGA TAAAGGCGCA GCTTAAGGTA GCTAAGATGT ACCGAAAAGG CATAGGAATA
AAACAAGACT ATATAGAAGC ACTTAAATGG TATACGAGGG CTCTTCGCCG AGGAAATGTA
AAATCGCAAT ATAATATTGC TAAAATATAC CAAAAAGGTT GGTCAGGGCA TAAGGATGAG
GATAAAGCAT TAAAATATTA CGAAAAAGCA GCTAGACAAG GATTGTTTAA TGCTCAAGTT
GAAGCAGGTA TAATATACCA AAAAAGAGAG AATTATGAAA AAACGATTGA GTTGTATAGT
AAAGCAACTG AGAATATAGA TGCTGTTAAG AATAATAAAA AGTTTGCACA TGTACAATTT
AACCTCGGAT TATTGTATGA AAAGCAAGAA AATTATGATA AAGCATTTCA ATATTATGAA
AAAGTAGCTT CACAAGGATA TGCTAGTGCA AATACCAAGC TAGGTTGGAT GTACCAACAT
GGAAAGGGAG TAGAAATAAA TATGGAGAAG GCGCTGGAAT ATTACAGCAA AGGGGTCCAA
TTTTAG
 
Protein sequence
MKYKHTTEKW VGGHIMLLPT LLILTLMLPL EGCGNGFNVS FSKEPQKVNN DNTTVSIFEK 
ELTNNGVEDV TNQMQTPSSD AITLYTQQEV LPPQAEILSL AQGYEVGANH SSDKDEYSPV
EAKSSNDAVN KHEQAGSLSV QQQDSKANNS TTQKYLLQKR RHTIRQKTTP KYNELLVEES
KILDRIYKTQ QGYEIKFYKV GDKLRANIKV IRTGKNLDLP VASTSVKGVE LILNQLAYVY
KYGASTTNLI LVASPYQDNG YGYIGGVLGG SNTGNKKGNT QEQKKQQSNL GKKDAKGKGK
AEEEEEEEML VTAPLASEKK KRERDRMPTF VSPNPTETKR QKGKQESQKR QSKPRKLSFA
IDPEDVIDQG PTTVEGLKNN PDKLPLLEEK AHLDNSLELQ HLLGDYYLSN FYDYIRRHEH
TPTREDNQDL DQAIIWYKMA ADQGYQPAQI KLSELRRNDH YTNPSLELMQ AYHQAIQKAG
CDDKDFETKA NQTSGRLILS NLIAMDLNPT KGPTQAKWGG PLASPETNSK IYETLYEPLV
YKRYEDGVMS IDPSGDGPDE TAYVVMKRYK NYYFIIALGG MAGKYTKKED NPQATVGNSP
QVIEKLVSVA LANKVNYIHI EKNNDNSFAN LLEGHLKKIG SEIKVIKFQQ STNKENKIQD
IIGPLLDNKY LIIDKKALKD DFSSILQHDL NFKFFYQLMT IDSETSSSTD IGFPKPEHDD
RLDVVANAIR FLQRRIEIRE VFQDKINQLT ENAELGDLSA QAELGEIYRN GMGVEENYQE
ALKWYTKALN EYKEALSANK TTEQSEEEKN DFNNVLFGLG EMYRKGLGSD PNYKEAYKLY
QELAYENVDA RGYYGLAKLY EKHLIKDESI DINQKILGLY TDAARRGHTR AQFKLATIYQ
NGEAWGISVN PKKALGWYTE AASKGDREAC FKLADMYYKG EGIQAANYEE AFKWYMTLAP
QGEIKAQLKV AKMYRKGIGI KQDYIEALKW YTRALRRGNV KSQYNIAKIY QKGWSGHKDE
DKALKYYEKA ARQGLFNAQV EAGIIYQKRE NYEKTIELYS KATENIDAVK NNKKFAHVQF
NLGLLYEKQE NYDKAFQYYE KVASQGYASA NTKLGWMYQH GKGVEINMEK ALEYYSKGVQ
F