Gene Acid345_2854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2854 
Symbol 
ID4070373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3393342 
End bp3395132 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content64% 
IMG OID637984872 
Productamidase 
Protein accessionYP_591929 
Protein GI94969881 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAT CTCGCCGCGA CTTCCTAACG CAAACTTCGC TCGCCGTCAT CGGCGCTGCC 
GTTGCCTCGT CCGCACAACA ACCGACGGAA CCGACACCCG GGGCGCCACC GGCATTTGGG
ACGGCGCCGC CGGTGGGGCC GGAGGTCTCG GCGGAGACGT TCGCTCAGGC CGAGAAGCTG
GTGCAGGTGC AGATGTCGGC GGAACAACGC GCCCAGGCCG CAAGCAACTG GCGGATGTCG
ATGGCCGCGA TGTACGAGCG TCGCGTGGGC CCGAAGAGGA CAGCGCTGGA AGACACGCTA
CCGCCGGCGA CGCACTGGAA CCCGATGATC GCGGGCGTGC AGCCGGGGCC GACGCGCGAT
GTGTTCGTGC GGAGCCAAGC CGACCCCGGG CCGCTTCCGA CGAGCGACGA AGAGATTGCA
TTCGCGCCGG TAACGAAGCT ATCGAGGTGG ATCGAGCAGC GGAAGCTGAC TTCCGAGCGC
CTGACCAGGC TCTACCTCTC GCGACTGGAG AAGTTCACTC CCAAGCTTAA GTGCGTGATC
ACACTGACCA CAGACTTGGC AATGAAGCAG GCGCAAGCCG CGGACAAGGA GATCGCGGCG
GGCAAGTATC GCGGGCCACT GCACGGCATT CCGTGGGGCG CGAAAGATCT GCTCGACACG
GCGCACATCC GCACAACGTA CGGCGCCGAG CCGTTCCGCA ACCGTGTTCC CAGTGCAGAC
GCGACCGTGG TGAAGCGGCT GCATGACGCC GGAGCAGTGC TGGTGGCCAA GCTCAGCCTC
GGCGCGCTGG CTCTCAACGA CGTTTGGTTT GGTGGGCAGA CGGTGAATCC GTGGCTGCCG
GAGGAAGGGG CTTCGGGATC GAGTGCGGGG CCGGGTGCGG CGACGGCGGC GGGACTGGTC
GGCTTTGCCA TTGGGAGCGA AACCGGAGGC AGCATCGTGG CGCCGTCGAT GCGCTGCGGC
GTTACGGGAT TGCGTCCGAC TTACGGCCGT GTGGCGCGTA CCGGTGCGAT GACGCTCTGC
TGGTCGCTCG ACAAGCTTGG GCCGATGACG CGCAGCGTAG AAGATGCGGT GCTGGTGCTG
CAAACCATCA GCGGGCCGGA TGCGGAGGAT GTGGCCAGCG TGCCGAGTCA TCTTGATTTC
GACGCAGGTG CCGCTGTCAC TGGGCTGAAG GTCGGTTACT TCCCTGCATG GATGAAGGAA
GCTCCGGCGA CCGACGTGGA TCGCGCTGCA CTCGAAACCG TGAAGAAGCT TGGCATGGCG
GCAGTCGAAG TATCGCTTCC CGATTGGAAT TACGACTGTC TCGACACCAT TCTCTTTGCC
GAGAGCGCCG CGGCCTTCGA AGAACTGACC CTCAGCGGCG CCGTAGATGC GCTCAAAATG
CAGACGCCGG ATTCCTGGCC GAATACTTTC CGGCAGTCGC GATTCTTGTC TGCCGTGGAT
TTCGTACAAG CCGACCGCAT GCGGCGGAAG GTCGCGATGG AGATGGCCCG CGTGATGTCC
GAGGTGGATT TGCTGCTGGT GCCTTCCCTG CGCGACGAGA TGCTCACGCT CACCAACTTC
ACCGGACATC CCTCGCTCAC ATTGCGAGCG GGATTTGTGG AAGTCGGCGA AGCGCGCAGC
GACTGGGCGC CGGACCCGAA GCATCCGTTG CCGAAGTTCA ATCCGCCGCG ACGGGTGCCG
CATGGGGTGA CCTTAATCGG CCGGCTGTTC GATGAAGGGA CGCTGGGGAG AGTCGGGATC
GCGATGGAGA AGGCATTTGG GGTGGAGGAG AGGCCTAACG GGTACGCGTA G
 
Protein sequence
MSKSRRDFLT QTSLAVIGAA VASSAQQPTE PTPGAPPAFG TAPPVGPEVS AETFAQAEKL 
VQVQMSAEQR AQAASNWRMS MAAMYERRVG PKRTALEDTL PPATHWNPMI AGVQPGPTRD
VFVRSQADPG PLPTSDEEIA FAPVTKLSRW IEQRKLTSER LTRLYLSRLE KFTPKLKCVI
TLTTDLAMKQ AQAADKEIAA GKYRGPLHGI PWGAKDLLDT AHIRTTYGAE PFRNRVPSAD
ATVVKRLHDA GAVLVAKLSL GALALNDVWF GGQTVNPWLP EEGASGSSAG PGAATAAGLV
GFAIGSETGG SIVAPSMRCG VTGLRPTYGR VARTGAMTLC WSLDKLGPMT RSVEDAVLVL
QTISGPDAED VASVPSHLDF DAGAAVTGLK VGYFPAWMKE APATDVDRAA LETVKKLGMA
AVEVSLPDWN YDCLDTILFA ESAAAFEELT LSGAVDALKM QTPDSWPNTF RQSRFLSAVD
FVQADRMRRK VAMEMARVMS EVDLLLVPSL RDEMLTLTNF TGHPSLTLRA GFVEVGEARS
DWAPDPKHPL PKFNPPRRVP HGVTLIGRLF DEGTLGRVGI AMEKAFGVEE RPNGYA