Gene Aasi_1245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1245 
Symbol 
ID6377661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1591170 
End bp1592573 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content34% 
IMG OID642682340 
Producthypothetical protein 
Protein accessionYP_001958296 
Protein GI189502579 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAA CAAATTTAAC TACAACTAAC CTTCAGAGTA TTCTTGTAAT CCTTGTTATG 
GGCTATATAT GTGCTTGTAA GGAACCTGTC CCAACAGATT CTACAAGCAG GCAAGAAAAA
AATATATATA CGCAAAACAT AAGTTTAGCT TCTTTAGATG CAAATAGTCC AGATAGTTTG
TGGGAGCAAT TTTTTGCACA ACGGGGTATG CGGAGGGTAG GTGATCACTT GTTTCTGGAA
GAAGATAAAT CCATTTCTTT ATTCAGAGGA AGGTTTGGTA CAATGTCTAT TACAGATGCG
GAGGCTAAAG AAGCAATAAT AGGATTTATA AAATATCTTC ATTATCGTAA TATAACTATG
CGTTCAATAA ACATTGTAAA GGATGTATTA TGTGAACTAG ATCTAGCAAT TATTGAATTT
GCTAGTATTA CTGAGCAGTT TAGACATTTA GAAAATCTTA CCTTACATAA TAACCGCATA
CAAGACATGC CCGCACTTAT AGCATCTTTG CCTAACTTAA AAGAATTACA CTTAGATAAC
AATAACATCG GCCAATTACC TGAACAGTTA GGTGCTCTTA CACAGTTAGA AAAGTTTCAG
GTATCCAACA ATCAACTATG TGAGCTTCCT AGCTCTATAA CTCAACTTAC TAATTTAACT
GAATTAGATT TAAGTAATAA TCAGTTTAGC TATTTCCCTT TGCCTATATA TACTGCTAAG
CAAATAAGCA TACAAGCTGA TGACGTTGGT GAGGTAATGC CTGTTACAAA TCCGTTTATT
AGAATAAGAA CCCTTTCTTT GAAAAATAAC CAGCTTCGAG AGATTCCTGA GTATATAGGC
TTATTTACCA ACCTTGAGCG ACTTTATTTG GACAGTAATA AAATTCATAG GTTACCGCAT
ACAATGACAC AACTTACGAA TCTTTCACGG CTTGTACTAG CTCATAATGA ACTGAAAGAA
TTACCTGGTT GTATGTATTC ATTTATTACA TTGGGGCGGC TTAATGCATG TCGAAATGCC
TGGTATCAAC CGAAAGATTT AATAAAGCTT AATTATAAAG AAATTAGAAA AGAGGCACAG
AAAATGCTAC CTACTAGCTT GGCTACACTT TGTATGCGTT GTATAGTTGG GCCTGTTGAG
CGCTTAAATA ATTATACACC AAGAGAATTG AAAAACTTAC TTCCTATAGG CTTAAGCTAT
AGATATTTAC CTTATATGTA TCAAAAAGCA TCTTACTGGA AAGAAGGGGA AAAATATGTG
TGCTTTATGC CGCTTGGAGA TCTCAATATT CCGTTCTCTA TGGATTTCTC ATTTGTTACA
CTTGCTGATT TAGATAACTT ATTCGAAAAA ATAATTAAGA GGAGTAAAGA GGAAATTGTC
TTTCAGAAAA AAGGGTATTG TTAA
 
Protein sequence
MKRTNLTTTN LQSILVILVM GYICACKEPV PTDSTSRQEK NIYTQNISLA SLDANSPDSL 
WEQFFAQRGM RRVGDHLFLE EDKSISLFRG RFGTMSITDA EAKEAIIGFI KYLHYRNITM
RSINIVKDVL CELDLAIIEF ASITEQFRHL ENLTLHNNRI QDMPALIASL PNLKELHLDN
NNIGQLPEQL GALTQLEKFQ VSNNQLCELP SSITQLTNLT ELDLSNNQFS YFPLPIYTAK
QISIQADDVG EVMPVTNPFI RIRTLSLKNN QLREIPEYIG LFTNLERLYL DSNKIHRLPH
TMTQLTNLSR LVLAHNELKE LPGCMYSFIT LGRLNACRNA WYQPKDLIKL NYKEIRKEAQ
KMLPTSLATL CMRCIVGPVE RLNNYTPREL KNLLPIGLSY RYLPYMYQKA SYWKEGEKYV
CFMPLGDLNI PFSMDFSFVT LADLDNLFEK IIKRSKEEIV FQKKGYC