Gene Amir_4199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4199 
Symbol 
ID8328392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4949211 
End bp4950635 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content68% 
IMG OID644944663 
ProductRicin B lectin 
Protein accessionYP_003101900 
Protein GI256378240 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3866] Pectate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.876848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAAGA AGCACCTGTT CAAGGCCGCC GCCGCGGCGG TCGGGCTGGG CGCGGTGATC 
GCGCTGACCG TGGCCATGCC CTCGGCGCAG GCGGCCGTGC CCGCCGCCGG GAGCACCTAC
ACCATCGCGG TGAAGAACAG CGGGAAGTGC CTGGACGTCC TGGACGGCAA CACCGCCAAC
GGCGCGCTCG TGGCGCAGTG GGACTGCTGG GGCGGGACCA TGCAGCAGTG GACGCTGCGC
GGGTCCGGGT CGGGGACGTA CGCGCTGGCG AACGTCGCGA CCGGCAAGTG CCTGGACATC
CCGTACGGCA CGACCGAGCA CTACGTGCAG GCCCAGCAGT GGAGCTGCTC CGGCGACACG
ATGCAGCAGT GGCGGCTGAC CGCGTCCGGG TCCGGGACCT ACCAGCTGGT CAACGTGGCC
AGCGGGTTGT GCCTGGCCAA CAAGGACGCC GGTCAGGGCC TGGGCGTGGC GATCGTGCAG
GAGGGGTGCA CGGCCAACTC GAACAAGCAG TGGCTGTTCA CGCCGGTGTC CGGCCGGACC
TGGTCGGGCA CGCCCGACGG GTTCGCGGGC GCGGCGGGGA CGACCGGTGG CGCGGGCGGG
ACCGTGGTGA CCGCGACGAC CTTCGCCGAC CTAGTGAAGT ACGCGTCGGC CAGCACACCA
CACGTGATCC GGGTGGACCG GGCGATCACG GTGACGCCGT ACGGGAAGGA GATCCCGGTG
ACGTCGAACA AGACCATCGT CGGGGTCGGC ACGTCCGGGC AGATCGTGAA CGGCGGGTTC
ACCCTCAACG GCGTGTCGAA CGTGATCATC CGGAACCTCA CCATCCGCGA CACCCGCGTG
GCCTCGGACG ACCCGGACGA CAAGGACTTC GACTACGACG GCATCCAGAT CGACAGCTCC
ACCAAGGTCT GGATCGACCA CAACACCATC ACGCGGATGA ACGACGGCCT GATCGACAGC
CGCAAGGACA CCACCGACCT GACCGTGTCC TGGAACGTGC TGGCGGACAA CAACAAGTCC
TTCGGCATCG GCTGGACCGA CAACGTCACC GCCCGCATCA CGATCCACCA CAACTGGATC
CGCGACACCG ACCAGCGCAA CCCCAGCACC GACAACGTCG CCTACGCGCA CCTGTACAAC
AACTACCTGC AGAACGTGAA GTCCTACGGC AACTACGCGC GCGGCGCCAC GAAGATGGTC
CTGGAGAACT CGTACTTCGA CAAGGTCAAG GACCCCTACT ACAAGGACGA CACCGCCCAG
CTCAAGCAGA GCGGCAACGT GGTCGTCAAC TCCAGCGGCA AGCAGCAGAG CGGCGGGGCG
GCCTTCGACC CGAAGACGTT CTACAGCTAC GCGCTCGACC CGGCCGCCGA GATCCCGAAG
ATCCTCGGGA CGTACGCGGG GCCCCAGGGC AACATCGGGG GCTGA
 
Protein sequence
MRKKHLFKAA AAAVGLGAVI ALTVAMPSAQ AAVPAAGSTY TIAVKNSGKC LDVLDGNTAN 
GALVAQWDCW GGTMQQWTLR GSGSGTYALA NVATGKCLDI PYGTTEHYVQ AQQWSCSGDT
MQQWRLTASG SGTYQLVNVA SGLCLANKDA GQGLGVAIVQ EGCTANSNKQ WLFTPVSGRT
WSGTPDGFAG AAGTTGGAGG TVVTATTFAD LVKYASASTP HVIRVDRAIT VTPYGKEIPV
TSNKTIVGVG TSGQIVNGGF TLNGVSNVII RNLTIRDTRV ASDDPDDKDF DYDGIQIDSS
TKVWIDHNTI TRMNDGLIDS RKDTTDLTVS WNVLADNNKS FGIGWTDNVT ARITIHHNWI
RDTDQRNPST DNVAYAHLYN NYLQNVKSYG NYARGATKMV LENSYFDKVK DPYYKDDTAQ
LKQSGNVVVN SSGKQQSGGA AFDPKTFYSY ALDPAAEIPK ILGTYAGPQG NIGG