Gene Aasi_0816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_0816 
Symbol 
ID6376949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1033488 
End bp1034840 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content35% 
IMG OID642681957 
Producthypothetical protein 
Protein accessionYP_001957919 
Protein GI189502202 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0771] UDP-N-acetylmuramoylalanine-D-glutamate ligase 
TIGRFAM ID[TIGR01087] UDP-N-acetylmuramoylalanine--D-glutamate ligase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.646747 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGT TAGTCGTTTT AGGAGCAGGG GAAAGTGGAA CAGGAGCTGC TCTATTGGCA 
CAAGCAAAAG GCTATCAAGT ATTTGTCTCA GATAAAAACA TAATTACAAA AGGCTATAAA
CAGCAGCTAT TAGACCATAA AATTGAATTT GAGGAGTGCA AACATACCTG GGATAAGATA
AAAATTGCCG ATGAAGTTAT TAAAAGCCCT GGTATCCCTA ATCATATAGC TATCATACAA
GCACTTCAAG AAGCAAGTAT ACCTATCATA GACGAAGTAG AGTTTGCGAG TCGCTATACA
AAGGCTTCGC TTATTGCCAT AACTGGCTCA AACGGTAAAT CTACAACCAC CCATTTAGCT
TATCATTTAC TGAAAGCAGG TGGACTAAAT GTAGGTATAG CAGGAAATAT AGGGACAAGT
TTTGCAAGAA AAGTTTTATT AGAAGAGCAT AATTATTATG TACTAGAGCT CAGCAGTTTT
CAATTAGAAC ATCTACAGAC CTTTAAAGCA GATATAGCTT GTATACTCAA TATTACACCT
GATCATTTAG ATCGTTATGA CAATCAACTA AATAATTATA TAGCCGCTAA GTTTAGGATA
TTACGTAATA TGATTAAAGA GGGGTATTTT ATTTATAATC AAGATGATAC TAACATACGT
GAGTACCTTG TTAAACAAAC TACTAATTTA CCTCAACTCT ATCCTATTTC TTTGACCCAA
TCGACCCATC ACGGTGCTTA TGTAGAAAAT AACCATCTTC ATTTTGTAGG AGACAACTAC
AATTTTCAAC TACCTACCCA AACCTTACCC CTGCTAGGCA AGCATAATAT ATATAATACT
ATGGCTGCTA TCTCGATAGC TAGTCTATTA GGCCTCTCTT ACGCTTCTAT CTTAGATGGG
CTAAGAACTT TTAAAGGGCT ACCTCACCGC ATGGAATGGA TTGCTAACGT TAATCAAGTA
AGTTTTTATA ATGATTCTAA AGCCACCAAT GTAGAATCAG CTGCCGTAGC TCTAGAAAGC
TTTACAAGTC CGATTATCTG GATAGCAGGT GGCTATGATA AAGGGAATGA TTATAATGTA
CTAAAACCAA TAGTAAAAAA TTCTGTAAAA GCACTCATTT GCTTAGGAAA AGATAATCAA
ACTATCTGCA AAGCTTTTAA GGAATCCAAT ATTCCTATTT ATGAAACACA AGTTATGCAG
GAAGCAGTAA GCCTAGCATA TACCCTAGCA AAACCACAAG ATATAGTGCT ATTGTCACCT
GCTTGTGCTA GCTTTGATTT ATTTAAAAAT TTTGAAGATA GAGGAGAACA GTTTAGGTAT
GCAGTACAAC AACTTTTAAC TTCAATAAAT TAA
 
Protein sequence
MKKLVVLGAG ESGTGAALLA QAKGYQVFVS DKNIITKGYK QQLLDHKIEF EECKHTWDKI 
KIADEVIKSP GIPNHIAIIQ ALQEASIPII DEVEFASRYT KASLIAITGS NGKSTTTHLA
YHLLKAGGLN VGIAGNIGTS FARKVLLEEH NYYVLELSSF QLEHLQTFKA DIACILNITP
DHLDRYDNQL NNYIAAKFRI LRNMIKEGYF IYNQDDTNIR EYLVKQTTNL PQLYPISLTQ
STHHGAYVEN NHLHFVGDNY NFQLPTQTLP LLGKHNIYNT MAAISIASLL GLSYASILDG
LRTFKGLPHR MEWIANVNQV SFYNDSKATN VESAAVALES FTSPIIWIAG GYDKGNDYNV
LKPIVKNSVK ALICLGKDNQ TICKAFKESN IPIYETQVMQ EAVSLAYTLA KPQDIVLLSP
ACASFDLFKN FEDRGEQFRY AVQQLLTSIN