Gene Aasi_1767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1767 
Symbol 
ID6376961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1255049 
End bp1258162 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content37% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003573165 
Protein GI294661289 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.636968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTTAT TAGTAATTGG AAACGCCATG AAAATGTTTA AAATTAATCC TTCACTATTT 
CTAAGTAGAA AAACCATTCT TTTTATACTT AGCTTACAGC TTATTGGTTG TGGCCATAAC
TCACCTGATA ATCAGAAAGA AGCACGTTTG CTCATGGAGA TAAATCCCCA CCATTTAATA
GGTGATCAGA AAGCCATTGA AGCCTGTTTT TCATTAGCTG AGCATAAGCA ACATGTTATG
CTTAATAAGT ATAGGTTAAA AATTAGCTTA AGTGGTAGCG CGAACCAATT GCTGTTTGAA
AAAGACACCG AAACTACAAG AAGTTTCCAG AATACGACCC AGAATTTAAC TTATTTTACA
TCCCAAAGAG AGCTAAACCT GGAAGATGAA CTATTAATTG TCCCCTTCAC CTTATTGCCA
GCTGCAACTA CTGATAATGT AATCATTAAG TTTGAACTTT TAGATGAAGA TGGTAGCATC
ATACAAAAGC ATAGGGTTGA TTGGAGTAGC CATACCAAAC AGACGAGCTT ATTACTTCCA
AGCAGTGATC TACCAGAAGG GGAAATTAAA ATTTTAACTA GTACTTTAAG TAAACGTCCT
TTGTTGCCTA CTTTACCATA TCCAGTTGAA AAGACTAAGG AAATGCAGGA AGAAGAATAC
CATGATCTGA TATGTAGTTC TGGTTCAGAA ATAATAGATA GATCCTTACA ATATCCAGCC
CAAAAGAAAT ATAAGAAAAG CTTACCAAAC CCTAAAGAAT CGACAGAATC TTATGATACT
TTGAGTATAC GTGCATTAGC TGAGCGAGCC GATCATAACG ATACTCAAGC ACAAGAAGAA
ATTATTAGGC GTTGCTTACA AGTGAGTATA AATCCGCTTA TAAAAAAAAT ACTCCATCCT
TTTAGCTGGC CAGGTATACA AGAAAAAGCA AAACAGAAGC AAGAATATGC ATACTTACTG
CTATGCTTTT CAAAACAAGC AACAGATTAT ACGATTTATC AAACATTGAT GGAACATGTA
AAAGAACAGG CACAAGCAGG AGATCCTTTA GCTCAAACCA ATTTAGGGTA TATGTATAGT
GAAGGATTAG GTTTTCCAGT TGATGCTCGG AAAGCTATTG AATGGTATAC TAAGGCTGCT
CATCAAGAGT TTGCCATAGC ACAATGTCTT TTAGGGGATA TATATTATTT TGGAAAAATA
GTTTCATGTA ACTATCAAAA TGCATTGAAA TGGTATAAGA AAGCTGCTGG AAAGGGGTAT
GCTAAGGCAC AAAATGCTTT AGCATATATG TACGAGGAGG GATTAGGCAT CCAAAATAAG
AGTGAAAGAG CTGTCGAGTG GTATACCAAG GCAGCTATGC AAGGAAATAT AACTGCTCAG
TATAACCTGG GTAGAATATA TTACAATGGT AAAGGTGTAA GGCGGGCCTA TAACAAAGCA
TTTAAGTGGT ACCATAAGGC TGCTAACCAA GGCAATATAA AAGCACAAAC TAAATTAGGA
TATATGTATG CTAAGGGCTT GGGGATTGAG CAAAATCTTG GAAACTCAGT AAAATGGTAT
AACAAAGCGG CTAATAAAGG GAATATAACC GCTCAATTTA AGTTAGGCCT TCTATACAAA
AAAGGAGAAG GGGTTGCTCA GGATTACCAT AAAGCATCTG AGTGGTTTAC TAAAGCGGCT
AACCAAGGGC TTGTAAAAGC TCAATATAGC TTAGGATGTC TCTACTATAA TCTAGGTGAA
AGCATTGAGC ATAACTATCA ACAAGCTTTT AAATGGCTTA GTAAAGCTGC GAATGAAGGT
CATGCGGAAG CTCAATTCAG CCTAGCACGT CTCTTTGAAG ATGGATTAGG GGTCGAACAA
GATAAACAGG AGGCTATAGA GTGGTTTACT AAGGCAGCTA ACCAGGGTCT TGTAAAAGCT
CAATATAGTC TAGGTCTTCT CTATGAAACA GATGAAGACA TTGGACATGA TTACCACAAG
GCATTCGAAT GGTACAGTAA AGCAGCAAAT CAAAATGACG CAGTGGCACA ATCTAGTTTA
GCATTTCTCT TTATAGATGG CTTGGGGGTT GAGCGAAATG TGCAGCAAGC CATAGAATGG
TTTACTAAGG CTGCTCAACA GGGGGTTGTA GAGGCACAGT ATAATTTAGG GATTATCTAT
AAAAGGGGGG AGGATATTGA GCGTAACTAT CAAAAATCAT TCGAGTGGTT CACTAAAGCT
GCTAGTCAAG GCAGTGTAGC TGCACAAAAT AAGCTGGGAA GTATTTACAA AAAGGGTTTA
GGAAGAGAGA AAGATCTGAG CCAAGCCATC TTCTGGTGGA TGAAGACACG AAATACAGAC
AAGCTGATGC ATATCTTTAA TGTTAATGCT ACACTTCTTC CTTTAGTAGC TACTCTGCCA
GATAATCAAA CTACTAGCGT TGAAGCAGAA ATGCAGTCTG ACGAAGCAGC TGAACTTTTG
AGTACTTGTC AGGCAATGTT AATACAGAAT AAGTATGATT TAGCTGGCAA GCAAGCTAGT
TTAAGACTTG GTTATTGTGA AGAACTCGAA AAAATAATTT TACAGCTTAT CGATTGGAAA
CAGCAACTAT CTATACAAAC TGGCTTAATG GTTAGCTGTT TATCCTTGCA AAAGTCAGAG
GATACAACGG CTATAGAGAA CTATCAACAG CGGACAGGCA TTGTGCCTTT TGTTAAGCAA
CATATCTTAG CTGAAAATAA AACTTGTCTT TCTTTTGGCC AGGCGAATGT AGAATTAGCT
GACCAAATTA TTACAGAGCT ACAGCATAAG CGTACTTATA GAAGTTTTTA TGGGTTAATT
AACCAACTTA AAGAGTCTTA TGAGTTGGCT CGTCAAAAAG TTACAGTTAA GGTAAGTGCT
ATACAAGAGC AACTACAGAA GTCTGTTGTA GAAGAATCAG AAAAAGCAGT GCTTCTAGCC
ACATTAGCTA GAAAAATGAA ATTAGTAGAT ACATTTAATA CTACTTTACA AACATTAGAA
GAAATACCTA TACAGTTTAA TACTTACTAT AACTTGCTAT TAGAAGAAAT TGATAAAGGA
CAGGCTATCC GTAATCAAAA GTTTAAAGAA GAATATATTT ATCTTTTTCA ATAG
 
Protein sequence
MHLLVIGNAM KMFKINPSLF LSRKTILFIL SLQLIGCGHN SPDNQKEARL LMEINPHHLI 
GDQKAIEACF SLAEHKQHVM LNKYRLKISL SGSANQLLFE KDTETTRSFQ NTTQNLTYFT
SQRELNLEDE LLIVPFTLLP AATTDNVIIK FELLDEDGSI IQKHRVDWSS HTKQTSLLLP
SSDLPEGEIK ILTSTLSKRP LLPTLPYPVE KTKEMQEEEY HDLICSSGSE IIDRSLQYPA
QKKYKKSLPN PKESTESYDT LSIRALAERA DHNDTQAQEE IIRRCLQVSI NPLIKKILHP
FSWPGIQEKA KQKQEYAYLL LCFSKQATDY TIYQTLMEHV KEQAQAGDPL AQTNLGYMYS
EGLGFPVDAR KAIEWYTKAA HQEFAIAQCL LGDIYYFGKI VSCNYQNALK WYKKAAGKGY
AKAQNALAYM YEEGLGIQNK SERAVEWYTK AAMQGNITAQ YNLGRIYYNG KGVRRAYNKA
FKWYHKAANQ GNIKAQTKLG YMYAKGLGIE QNLGNSVKWY NKAANKGNIT AQFKLGLLYK
KGEGVAQDYH KASEWFTKAA NQGLVKAQYS LGCLYYNLGE SIEHNYQQAF KWLSKAANEG
HAEAQFSLAR LFEDGLGVEQ DKQEAIEWFT KAANQGLVKA QYSLGLLYET DEDIGHDYHK
AFEWYSKAAN QNDAVAQSSL AFLFIDGLGV ERNVQQAIEW FTKAAQQGVV EAQYNLGIIY
KRGEDIERNY QKSFEWFTKA ASQGSVAAQN KLGSIYKKGL GREKDLSQAI FWWMKTRNTD
KLMHIFNVNA TLLPLVATLP DNQTTSVEAE MQSDEAAELL STCQAMLIQN KYDLAGKQAS
LRLGYCEELE KIILQLIDWK QQLSIQTGLM VSCLSLQKSE DTTAIENYQQ RTGIVPFVKQ
HILAENKTCL SFGQANVELA DQIITELQHK RTYRSFYGLI NQLKESYELA RQKVTVKVSA
IQEQLQKSVV EESEKAVLLA TLARKMKLVD TFNTTLQTLE EIPIQFNTYY NLLLEEIDKG
QAIRNQKFKE EYIYLFQ