Gene Aasi_0353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0353 
Symbol 
ID6377298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp414318 
End bp415592 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content39% 
IMG OID642681523 
Producthypothetical protein 
Protein accessionYP_001957507 
Protein GI189501790 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.326353 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATAA AAATAACCAA AGAAGCAGTA TTAGATTATC ATGCACAAAA GCCTGCAGGT 
AAACTAGGTA TACATGCCAC CAAGCCATTA CAAACTCAAT ATGACCTATC AATAGCATAT
TCACCAGGAG TGGCCATCCC TTGCCAAGCT ATTGCCGAAG ATAAGCAACA AGTATATAAT
TATACTGCTA AAGGAAACTT AGTAGCAGTT ATTTCTAATG GCACAGCCAT ACTAGGGTTA
GGCAATCTAG GCCCTGAAGC AGCTAAACCT GTTATGGAAG GTAAGGCTAT TTTACTTAAA
AAATTTGCAG GTATTGATGC GTTTGACATT GAAATTGACG CAACAGAGCC TGCGGATGTG
ATACATATTA TCAAGGCTTT AGCACCTACC TTTGGCGGTA TTAACTTAGA AGACTTTAAA
GCACCTGAAT GTTTTGAAAT TGAAACTGCA TTAAAAGAAC AATTATCTAT ACCAGTCATG
CACGACGACC AGCATGGCAC TGCTATTATA GCAGGTGCTG CACTAAAAAA TGCACTCTTG
TTGGTAAAAA AAGAGATTGG TAATATTCAA GTAGTTATCA ACGGTGCTGG TGCTGGTGCT
ATTGCATGTG CTAAGCTTAT TGTAGCATTG GGTGTAAAAC CTGGTAACTT GGTAATGTGT
GATACACAAG GGGTTATTCG CAAAGATAGA GAAGAGCTGG CAGGAGAGAA ATCAAGATTC
GCTACTGATA GGTCTGTCCA TACTTTAGTA GAAGCCTTGA AAGGAGCTGA TGTGTTTATG
GGACTTTCAA AAGGCAATAT CCTACAGCCA GAACATATTC TTGACATGGC AGAGCGTCCT
ATTGTATTTG CTTTAGCCAA TCCAAATCCA GAAATTAATT ATGATTTGGC AGTGAACACA
CGGAAAGATA TTATCATGGC TACGGGAAGA TCTGATTATC CTAATCAGAT TAACAATGTG
CTAGGGTTTC CTTATATTTT TAGAGGAGCG TTAGATGTAT GGGCTACAGC TATTAATGAG
CCTATGAAGC TAGCAGCTGT AGAAGCTTTA GCCGCACTTG CACAACAGCC TGTTCCCAAT
CAAGTGAAAA AGGCTTATGG GGTAGAGTCG CTTGAATTTG GATCTACTTA TATATTACCT
AAACCAATAG ACCCTCGACT TATAACAACT GTTTCTCCAG CTGTAGCACA GGCTGCCATA
TCATCAGGAG TAGCAAAAAA ACATATAGGT GACTGGGAAG CTTATAAAGA ATCTCTCAAA
CAATATATTG AATAA
 
Protein sequence
MSIKITKEAV LDYHAQKPAG KLGIHATKPL QTQYDLSIAY SPGVAIPCQA IAEDKQQVYN 
YTAKGNLVAV ISNGTAILGL GNLGPEAAKP VMEGKAILLK KFAGIDAFDI EIDATEPADV
IHIIKALAPT FGGINLEDFK APECFEIETA LKEQLSIPVM HDDQHGTAII AGAALKNALL
LVKKEIGNIQ VVINGAGAGA IACAKLIVAL GVKPGNLVMC DTQGVIRKDR EELAGEKSRF
ATDRSVHTLV EALKGADVFM GLSKGNILQP EHILDMAERP IVFALANPNP EINYDLAVNT
RKDIIMATGR SDYPNQINNV LGFPYIFRGA LDVWATAINE PMKLAAVEAL AALAQQPVPN
QVKKAYGVES LEFGSTYILP KPIDPRLITT VSPAVAQAAI SSGVAKKHIG DWEAYKESLK
QYIE