Gene Amir_3958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3958 
Symbol 
ID8328151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4640945 
End bp4642042 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content68% 
IMG OID644944434 
ProductPectate lyase/Amb allergen 
Protein accessionYP_003101671 
Protein GI256378011 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3866] Pectate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGAT CAGTCGTCCG GCGAATCCCC GCGGTCGTGG CCGCGCTGCT CGTCTCGACC 
ACCGCGGGCG CCGCCCTCAC CGCCGGGTCG GCCTCCGCCG CGGGCGCCGC GACCGGGTAC
GCCTCCGCCA ACGGCGGCAC CACCGGCGGC CAGGGCGGGG CGACGGTCCG GGCCACCACG
GGCACCCAGA TCCACCAGGC GCTGTGCGGC CGGGCCGGCA CGAGCACCCC CATCACCATC
GAGGTGCAGG GGACCATCAA CCACGGCAAC ACCAGCAAGG TCTCCGGAAG CTGCGACACC
GCCGCCGGGG TCATCGAGCT CAAGAAGATC AGCAACGTCA CGATCATCGG CGTCGGCTCC
GGCGCGGTGT TCGACCAGAT CGGCATCCAC ATCCGCGAAT CCCGCAACAT CATCATCCGG
AACGTGACGA TCCAGAACGT CAAGAAGTCC GGATCGCCCA CGTCCAACGG CGGCGACGCC
ATCGGCATGG AGCGGGACGT GCGCAACGTG TGGGTCGACC ACGTCAACCT GATCGCCTCG
GGCGGCGAGT CGGCGGGGTA CGACGGGCTT TTCGACATGA AGGACAACAC CCAGTATGTG
ACCCTGTCCT ACAGCACCCT GCGCAATTCC GGTCGCGGCG GTCTGGTCGG TTCCAGCGAG
AGCGACCGCT CGAACAGCTT CATCACCTAC CACCACAACC TGTACCAGAA CATCGACTCC
CGGACCCCGC TGCTGCGCGG CGGCACGGCG CACATGTACA ACAACAACTA CGTGAGCCTG
AACGAGTCCG GCATCAACTC GCGCGCGGGC GCGAAGGCCA AGGTCGAGAA CAACTACTTC
AAGAACTCCC GCGACGCCCT CGGCACCTTC TACACCGACG AGGCGGGCTA CTGGCAGGTC
AGCGGGAACA CGTTCGACAA CGTCACCTGG TCCACCCCGG ACGACGAGAC CAACCCGGCG
GGGCCGAACC CGCAGTCCAC CACCTCGGTC ACCGTGCCCT ACAGCTACCG GCTCGACCAG
ACGAGCTGCG TGCCGACCAT CGTCGCCCGC ACGGCGGGGG CCAACACGGG CCTGAAGGAG
TCGGACGGCT CCTGCTGA
 
Protein sequence
MKRSVVRRIP AVVAALLVST TAGAALTAGS ASAAGAATGY ASANGGTTGG QGGATVRATT 
GTQIHQALCG RAGTSTPITI EVQGTINHGN TSKVSGSCDT AAGVIELKKI SNVTIIGVGS
GAVFDQIGIH IRESRNIIIR NVTIQNVKKS GSPTSNGGDA IGMERDVRNV WVDHVNLIAS
GGESAGYDGL FDMKDNTQYV TLSYSTLRNS GRGGLVGSSE SDRSNSFITY HHNLYQNIDS
RTPLLRGGTA HMYNNNYVSL NESGINSRAG AKAKVENNYF KNSRDALGTF YTDEAGYWQV
SGNTFDNVTW STPDDETNPA GPNPQSTTSV TVPYSYRLDQ TSCVPTIVAR TAGANTGLKE
SDGSC