Gene Amir_3796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3796 
Symbol 
ID8327988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4454686 
End bp4456479 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content76% 
IMG OID644944284 
ProductRicin B lectin 
Protein accessionYP_003101522 
Protein GI256377862 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.475116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCGGT CCACCCCCTG CGGAGTCGGC GCGCGACGCT CACCCGGCCG CCGGGCGGGC 
AGACCACCCG GCAGGCTCGC CGCGCTGGTC GCGGCGGCGC TGGTCGCGGT GGGGCTCGTC
CCCGCCCAGG CCGCGACCGG TTCGGCGGAA CCGGCGTTAG CCGCAGCGGG CGCGGAGACA
GCGGGCCGGT CGGCGCCTGA CCAGGTCGCG CCGGCAGCGG GCTCCCCCGC CGACGTGCGC
CCCGCCGACG TGCGCCCCGC CGCCCCCGGC AGCCCGGCGA CCACCCCGCC GATGGGCTGG
AACTCCTGGA ACACCTTCGG CTGCAACATC AGCGAGAGCA CGATCCGCGA CGGCGCCGAC
GCGCTGGTCT CCTCCGGGAT GCGCGACGCC GGCTACCAGT ACGTCGTCGT GGACGACTGC
TGGTTCGACG TCCAGCGCCT GCCCGACGGC AGCCTGCGCG GCGACCCCAC CCGGTTCCCC
AGCGGCATGA AGGCGCTCGG CGACTACATC CACGCGCGCG GCCTGAAGTT CGGCATCTAC
CAGGTGCCCA CCGACCGCAC CTGCGCCCAG CGCGGCGGCG CCTACCCCGG TTCCACCGGC
AGCGTCGGGC ACGAGGAGCT GGACGCCCGC ACGTTCGCCT CCTGGGGCGT GGACTACCTC
AAGTACGACT GGTGCTCCCC CGAGGGCGAC CGGGACGAGC AGGTCGCCCG GTTCGCGCTG
ATGCGCGACG CGCTGCGCGC CACCGGCCGA CCGATCGTGT ACAGCATCAA CCCGAACAGC
TACCACGCGA TCACCGGTTC CACGTACGAC TGGGGCGAGG TCGCGGACCT GTGGCGCACC
ACCGAGGACC TGCTGGACAT CTGGCGCAAC GAGAACACCA ACAGCTACCC GATGGGCGTG
GTGAACGTCG TCGACGTGAA CGCCCCGCTC GCCGCGCAGG CCGGGCCCGG CCGGTGGAAC
GACCCGGACA TGCTGGTGGT GGGCAGGCCG GGCCTGACGA CGCAGCAGTC GCGGGCGCAC
TTCGCGCTGT GGGCGCTGAT GGCCGCGCCG CTCATGGCGG GCAACGACGT GCGCGCCATG
CCCGCCGAGA TCTCCTCCAT CCTGCGCACG CCCGGCCTCG TCGCGGTGAA CCAGGACGCG
CTGGGCGCGG GCGGTCGCCG GGTGCGCGAC GACGGCGACA CCGAGGTGTT CGCCAAGCCC
CTGGCCGACG GGTCGGTCGC GGTGGGCCTG TTCAACCGGG GCGCGCAGCC CGCGCGGATC
AGCGCGGGAC CGGCCGAGGT CGGGCTGGCC GGGACGTCGC TGGCGCTGAC CGACCTGTGG
ACCGGGGCGA CCTCGACCGG CGCGCGGATC ACCGCGGACG TGCCCTCGCA GGGGGTCGCC
GCGTTCCGCG TCACCGGGGC GGGGCCGCTG GCCCAGCAGA CCGGGACGCT GCTGGGCGTG
GCGTCGCGGC GCTGCCTGGA CGTGCCGGGC GCGGTGACGA CGCCCACGGC GCGGCCCGCG
CTGTGGGCGT GCCACGGTGC GGCGAACCAG CTGTGGACGC TGTGGCAGGG CGGCGAGGTG
CGGATCTACG GGGCGCAGTG CCTGGACCTG CTGGAGACGG GTGGCGCGCC GGGCACCCCG
GTGGTGACGG CGCTGTGCGA CGGGCGGGCC TCGCAGCGGT GGACGCGCGA CGGACAGCGG
CTGGTGTCCA CCTCCGAGGG CACCTGCCTG GACGCGCGCG GCGTCGCGGC GGGCACGGCG
GCGGGGGTCC AGCCGTGCGA CGGGCGCGCC TCGCAGCGGT GGCTGCTGTC CTGA
 
Protein sequence
MPRSTPCGVG ARRSPGRRAG RPPGRLAALV AAALVAVGLV PAQAATGSAE PALAAAGAET 
AGRSAPDQVA PAAGSPADVR PADVRPAAPG SPATTPPMGW NSWNTFGCNI SESTIRDGAD
ALVSSGMRDA GYQYVVVDDC WFDVQRLPDG SLRGDPTRFP SGMKALGDYI HARGLKFGIY
QVPTDRTCAQ RGGAYPGSTG SVGHEELDAR TFASWGVDYL KYDWCSPEGD RDEQVARFAL
MRDALRATGR PIVYSINPNS YHAITGSTYD WGEVADLWRT TEDLLDIWRN ENTNSYPMGV
VNVVDVNAPL AAQAGPGRWN DPDMLVVGRP GLTTQQSRAH FALWALMAAP LMAGNDVRAM
PAEISSILRT PGLVAVNQDA LGAGGRRVRD DGDTEVFAKP LADGSVAVGL FNRGAQPARI
SAGPAEVGLA GTSLALTDLW TGATSTGARI TADVPSQGVA AFRVTGAGPL AQQTGTLLGV
ASRRCLDVPG AVTTPTARPA LWACHGAANQ LWTLWQGGEV RIYGAQCLDL LETGGAPGTP
VVTALCDGRA SQRWTRDGQR LVSTSEGTCL DARGVAAGTA AGVQPCDGRA SQRWLLS