Gene Aasi_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1103 
SymbolmurC 
ID6377743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1416540 
End bp1417943 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content35% 
IMG OID642682215 
ProductUDP-N-acetylmuramate--L-alanine ligase 
Protein accessionYP_001958175 
Protein GI189502458 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0773] UDP-N-acetylmuramate-alanine ligase 
TIGRFAM ID[TIGR01082] UDP-N-acetylmuramate--alanine ligase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.651159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACATT ATAAATACGT TTATATATTA GGTATTGGTG GTATTGGTAT GAGTGCGCTT 
GCTAGATGGT TTCATCGAGA AGGTTCTCAA GTATTTGGTT ATGATAAAGG ACAAACGTTG
CTTACTGACC AGCTAATACA AGAAGGCATA TCAATACACT ATGAAGCAAC CATAGATGCT
ATTCCTACTG CTATTTTGGA TCATCCACAC GAGACTTTAG TTTTATATAC ACCTTCAATT
GCTGCAACTC ATCCTATTTG GCAATATTTA TCTGACCATG GATACAATTT GTGTAAAAGA
GGTGATATAC TAGATATCAT TACAAAGAAT CACTTCACAT TAGCCATAGC AGGTACTCAT
GGTAAGACAA CCAGTACTTC TTTAGCTGCA CATGTCTTAT GTCATGCCAA TAGAAATATA
ACTGCTTTTT TAGGGGGTAT TGCCAAAAAT TATAATAGTA ATTTTATAGC AAGTTCTAAT
ACAGAACAGG AAAGCTCCAT TGTTATAGAA GCAGACGAAT TTGATCGCTT TTTCTTAAAA
TTGCATCCCG ATATAGCAAT TATTACTACA GTAGACCCAG ATCATTTGGA TATATATAAA
GACCAAGATG GCTTTGAAGA AGGATTTAAA CAATTTCTAA GTAGGCTTCC TAAACAAGGA
ATTGCTATCC TACACCATGA TGTTGCAAAG AGATTAATAG ATGATGAGTC ATATCCTACC
CAAGCACAGG TTATTCAATA TGCATTGAAA GATTCACCGA TTTGCGCTGA TAATATATAT
ATTAGTGAAA CAGGAAGGTT TTGCTTTGAC TATATAAGTG AAAAGTTTGT TATTAGAGAT
ATACAATTGC CGGTACCAGG GTACCATAAT ATAGAAAATG CGTTAGCAGT TATTACGGCA
TGCTTGCATA TAGGAATAGC ACCTGAAATT ATTCGAGAAG CCATACATAC CTTCCAGGGA
ATAGCAAGAA GGTTTGACCT CATTATACAA CGTAATGACT TAGTTTTTAT TGATGATTAT
GGACATCATC CTGTTGAGAT AACAGCATTA CTTACTACTA TCCGAAAACT ATATCCAAAC
AAAAAGGTTA CAGTAATTTT TCGTCCTAAC CAGTATAGTA GGACAAAAGA TTTTTTACAG
GAAATTGCAC AAAGCCTTAG CTTAGCAGAC TGTGTATTGA TACTAGATAT TTATAGCGAC
CGTGAAGTGC CTATTGAAGG AATAAATTCA GAAGCTATCT TGCAGCATAT AACTTTGCCT
TATAAGTACG CTTGTACAAA AGAAGATTTG GTTTTGCAGC TTGCACAGGT AGATAAATTA
GAAGTGGTTG TAAATCTAGG AGCAGGTGAT GCAGATGAGT TTATACAACC GGTCAGAGAG
TTTTTACTTA AAAATTACAC TTAA
 
Protein sequence
MEHYKYVYIL GIGGIGMSAL ARWFHREGSQ VFGYDKGQTL LTDQLIQEGI SIHYEATIDA 
IPTAILDHPH ETLVLYTPSI AATHPIWQYL SDHGYNLCKR GDILDIITKN HFTLAIAGTH
GKTTSTSLAA HVLCHANRNI TAFLGGIAKN YNSNFIASSN TEQESSIVIE ADEFDRFFLK
LHPDIAIITT VDPDHLDIYK DQDGFEEGFK QFLSRLPKQG IAILHHDVAK RLIDDESYPT
QAQVIQYALK DSPICADNIY ISETGRFCFD YISEKFVIRD IQLPVPGYHN IENALAVITA
CLHIGIAPEI IREAIHTFQG IARRFDLIIQ RNDLVFIDDY GHHPVEITAL LTTIRKLYPN
KKVTVIFRPN QYSRTKDFLQ EIAQSLSLAD CVLILDIYSD REVPIEGINS EAILQHITLP
YKYACTKEDL VLQLAQVDKL EVVVNLGAGD ADEFIQPVRE FLLKNYT