Gene Aasi_1962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1962 
Symbol 
ID6377487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1873800 
End bp1876895 
Gene Length3096 bp 
Protein Length1031 aa 
Translation table11 
GC content34% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003573286 
Protein GI294661410 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAACT TACTACGTTA TACTGCTATA CTTATGATAT GGTCTATAAG TAATCTAGCC 
ATATATGGAC AAGAACTGCT AGAAAAGCCC ATATCTATTT CAGATTTTTA TAAAGGGTTA
GAATTATATG AAAAACAACA ATATGAGGCA GCACAGCACT ATATGGATAG GTATATAACC
GAACATACAG CCTACATAGG AAATGAATAC GTTATAGAAG CAACTTACTA TGCAGCATTT
TGTGCTATTA AACTAGATAG AATAGATGGA GAAGTACGTC TTCAGCAATT TGTAGAAAAA
TATCCTTATC ATCCGAAAGC TGCTCTAGCA TATTATGAAT TAGGTAATCT ACGTTGTTAT
CAACAAGACT ATGCTAAAGG TATTACCTAT TACTTATCAG TCAACAAAGA ACAGTTGGCT
AACACTTTAC ATACTGAATT GCAATACAGG TTAGCGTATG CATACCTAAA TGAAAGAGAC
TTTGGCCAAG CTTTGTCTTA TTTTAATGCT ATTAAAAATC ATGATACTCC CTACACACCT
GCCAGCAACT ACTATGCTGG ATATCTTGCC TTAAAAAAAG GGGATTATGA GAGTGCATTA
ATAGACCTAA GAAAAGCTGG GAACCATGAG GCATATGAAG CTGTAGTCCC TTATATGATA
ATGGAAGTAC TTTACCAAGC GAAACGTTTC CAAGCAGCCA TTAATTATAT TAAAGACGTA
CAGACTAAAC AACCAACATT AAAAAATTAC GAAGATATAG AATTACTTAC TGCCGAATCT
TATTTCTTCT TAAAGGATTA TGCATCAGCT ACCCGACATT ACGAAAATTA TATTCACCTT
CAACCTTCAG AGGTAACGCA TGAAGTCTTT TATCGGCTAG CTTACTCTTT ATATAAATCA
GGAGAAAACT ACAAAGCGCT CAAATATTTA AAAGAGTTAG CATTGCAAGA TGATTATCTG
GCTCAGCTAG CGAGCTACTA TATGGGGTTG ATATATATCA AAACGAGCCA AAAGAATTTA
GCATTAGCAG CATTTGACCA AGCAAGGCAG ATGAACTTTA TTAATGAAAT ACAAACAGAA
GCATCTTTCC AATATGCACA ATTAAGCTAT GAGTTAGGAA AACTTACCAT ATCCATTGAC
GCATTACAAA AATTTAAAAG ATCTTACCCT AACAGCCCGC ATATAACTAC AGTTGATCAG
TTACTAAGTC AAGTGTACTT TCATACCAAC CACTATGATT TAGCAATTGC TCATATTGAG
AGCTTACAAG AAAAGCCAGA AACTGTTTTA CAAGTGTATC AAAAAGCAAC TTTTTATAAG
GGCAATGCTT ACTTCAACCA GGAAGCTTAT GATAAAGCCA TTACTTGGCT ACAAAAATCT
TTATATTATC CTTTAGATAC AGATATAACA CTACAAACAC ATTTATGGCT TGCAGAAAGT
TATGTAGCAC AGCAAGCTTA CGAGCAAGCG ACTACGCATT ACCAAACAGT ACTTGCAGCA
ACCGACAAAA AGAATACAAA TTATTACCAA GATGCACTTT ACGGGCTTGG CTACGTTTTA
TTTAATACAG AAAAATACAA GGCAGCATTA CCATTATTTT TACAATATAT CAATATACCT
AATATAACTA ACGATAATAA TTGGCGTTTA GATGTATTAG TTAGAACAGC AGATTGCTAT
TATGCTATTA AAGATTACCA TAAGGCGCTA GATTTATATA CTAAAACTGA AGATAATTAT
CCTGCACATA ACCGTTATCA GAAAGCTCTT ATTTATGGGT TGCTTGGCAA GTTCGTAGAA
GCCAAACAAA ACTTAGAAAG TATTATTAAT ACCTGTCCAC ATACTGCCTA TTATGAAAAA
GCATTATTTG AATATGCATA CCTAGCGTTG CAACATCAGG AGTATGATCT AGCAATCAAA
AGTTTTACCA ACTTTATTCA AAAGAAACCT TATAGTACTC TTGTGCCAGA TGCTTTGCTT
CATAGAGCAG TTGCTAAGGT AAACTTAAAA CAATATGCAG AAGCCGGAAA AGATTATGAA
ACATTACTTA AAGACTATCC AACTCATCCA AATGCACAGA GTGCATTATT AGAGCTTCCC
AATTTAGTTG TACAAGAAGG AAAACCAGAG AAGCTACAAC AATATTTAGC TAGCTACAAG
GCTGCTAATC CAAGCAGTGA GACACTAGCA GCCATAAGCT TTGAAGCTGC TAAAAATTTA
TTTTATAGTC AGAACTACAC ACCAGCAGTT CAACAACTTA AAGAGTTTAT AACCAGTTAT
CCTAATAGTA CATTGATAGA TGAAGCTAAC TTTTTAATAG CGGAAGCTTA TTATAGATTA
GCAGAAGATG AGCAAGCGCT TATACAATAT CATATTACTA GTAAAAATAA ACAGACACCC
TTTTATAATA GAATCTTATT ACGTATTGCA TCGCTTGCTT ATAAGCACAA GGATTTTAAT
ACAGCACTTA CACATTACAA GCAGCTTAAA GAAAGTGCTA GCAATAAAAA AGAAACTTAT
TATGCCTTAG AGGGAATCAT GAAGACTAGT GATGCGCTAC AACAATATGA AGAAGTCAAC
AAAGCAGCTT CACAAATTAT TAATCAAGGT AACATAACAA TTAATGCTGT AAGCCAAGCA
GCTTTATATC TAGGAAAAAC TGCCCTAAAG CAAGCTAAGT ACCAAGAAGC TCATGAACAT
TTTAAACAAA TTGTAAAAAA TGGACAAGAT ATGTATGCAG CAGAAGCTCA ATATCTAATA
GCATACACTT ATTACCAACT AAGAGAATTT AAACAATCAT TAGAAGCATT GTTTATCCTT
AACAAACAGT TTGCTGAATA TACTGAATGG ACTAACCAAG GGTTCTTGTT GATGGCAGAT
AATTATATAG CCTTACAAGA ATTTTTCCAA GCAAGAGCTA CACTTCAGTC TATTATAGAA
AATGCTACTG ATTCAAGCTT TGTTAACACA GCACAACAAA AACTACAGCA ACTTATACAG
CAAATAGAAG CAGATAGCCT TGAACAAGCA CAGGCAAAAA CTACAACCAC GCAACCATTG
CAGGATGAGG ATAATGAATT TAAAACATTA GAATAA
 
Protein sequence
MTNLLRYTAI LMIWSISNLA IYGQELLEKP ISISDFYKGL ELYEKQQYEA AQHYMDRYIT 
EHTAYIGNEY VIEATYYAAF CAIKLDRIDG EVRLQQFVEK YPYHPKAALA YYELGNLRCY
QQDYAKGITY YLSVNKEQLA NTLHTELQYR LAYAYLNERD FGQALSYFNA IKNHDTPYTP
ASNYYAGYLA LKKGDYESAL IDLRKAGNHE AYEAVVPYMI MEVLYQAKRF QAAINYIKDV
QTKQPTLKNY EDIELLTAES YFFLKDYASA TRHYENYIHL QPSEVTHEVF YRLAYSLYKS
GENYKALKYL KELALQDDYL AQLASYYMGL IYIKTSQKNL ALAAFDQARQ MNFINEIQTE
ASFQYAQLSY ELGKLTISID ALQKFKRSYP NSPHITTVDQ LLSQVYFHTN HYDLAIAHIE
SLQEKPETVL QVYQKATFYK GNAYFNQEAY DKAITWLQKS LYYPLDTDIT LQTHLWLAES
YVAQQAYEQA TTHYQTVLAA TDKKNTNYYQ DALYGLGYVL FNTEKYKAAL PLFLQYINIP
NITNDNNWRL DVLVRTADCY YAIKDYHKAL DLYTKTEDNY PAHNRYQKAL IYGLLGKFVE
AKQNLESIIN TCPHTAYYEK ALFEYAYLAL QHQEYDLAIK SFTNFIQKKP YSTLVPDALL
HRAVAKVNLK QYAEAGKDYE TLLKDYPTHP NAQSALLELP NLVVQEGKPE KLQQYLASYK
AANPSSETLA AISFEAAKNL FYSQNYTPAV QQLKEFITSY PNSTLIDEAN FLIAEAYYRL
AEDEQALIQY HITSKNKQTP FYNRILLRIA SLAYKHKDFN TALTHYKQLK ESASNKKETY
YALEGIMKTS DALQQYEEVN KAASQIINQG NITINAVSQA ALYLGKTALK QAKYQEAHEH
FKQIVKNGQD MYAAEAQYLI AYTYYQLREF KQSLEALFIL NKQFAEYTEW TNQGFLLMAD
NYIALQEFFQ ARATLQSIIE NATDSSFVNT AQQKLQQLIQ QIEADSLEQA QAKTTTTQPL
QDEDNEFKTL E