Gene Amir_3106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3106 
Symbol 
ID8327296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp3585645 
End bp3587291 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content73% 
IMG OID644943626 
ProductRicin B lectin 
Protein accessionYP_003100866 
Protein GI256377206 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0930028 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGTTC GCTCGCGCAG ACGATTAACG ACCCTGCTCC TGGCCGCGCT CGCCGCGCTG 
ACCGGGGTGA CCGCGCCCAC CGCGCACCCC CCGGCGCAGG CCGCCCCCGG AAGCCCCGCG
CTGACCCCGC CGCTGGGCTG GAACAGCTGG AACAGCTTCG GCTGCGGCAT CACCGAGGGC
CAGGTGCGCC AGGCCGCCGA CGCGATGGCG TCCTCCGGGA TGCGGGACGC CGGCTACCGC
TACGTGGTCG TGGACGACTG CTGGTTCGAC CCGCAGCGCG ACAGCGCGGG CAACCTGCGC
AACCACCCGA CGAAGTTCCC CTCCGGCATG AAGGCGCTGG GCGACTACAT CCACGGCAAG
GGCCTGAAGT TCGGCATCTA CCAGGCCCCC AACGAGAAGA CCTGCGCCCA GGGCACAGGC
GCGCACCCCG GCGCCACCGG CAGCAAGGGC CACGAGGCGC AGGACGCCCG CTCGTTCGCC
TCCTGGGGCG TGGACTACCT GAAGTACGAC TGGTGCTCCG GCGCGGGCAC CCGCGACGAG
CAGATCGCCC GGTTCACGAT CATGCGCGAC GCGCTGCGCG CCACCGGCCG CCCGATCGTC
TACAGCATCA ACCCCAACAG CTTCCACGCC ATCACCGGCG ACAAGCACGA CTGGGGCGAC
GTCGCGGACC TGTGGCGCAC CACCGAGGAC CTGCTGGACG TGTGGCAGAA CGGCAACACC
AACAGCTACC CGATGGGCGT GGGCAACGTC CTGGACGTCA CCGCGCCGCT GGCCGCCCAG
ACCGGCCCCG GCAACTGGAA CGACCCCGAC ATGCTCGTCG TCGGCAGGCC GGGGCTCACC
CTGACCGAGT CCCGCGCGCA CTTCGCGCTG TGGGCGCTGA TGGCCGCGCC GCTCATGGCG
GGCAACGACA TCCGCACCAT GTCCCCCGAG ATCAGCGCGG TGCTGCGCAA CCCCGGTCTG
ATCGCGGTCA ACCAGGACCC GCTGGGCGCG GGCGGTCGCC GGGTGCGCGA CGACGGGGCC
ACCGAGGTGT TCGCCAAGCC CCTGTCCGAC GGGTCGGTCG CGGTCGGCCT GTTCAACCGG
GGCGGCGGCG CCACCACCGT CGCCACCACG GCCGCGCAGA TCGGGTTGTC CGGCACCGGG
TTCACCCTCA CCGACCTGTG GACCGGCGGC ACGTCCACCA GCTCGGGCGC GATCTCGGCG
ACCGTGCCCG CCCACGGCGT CGCCGCCTTC CGCGTCACCG GCGGAACCCC GCTGGCGGCC
ACCACCTCGC GGCTGCGCGG GACCGGCTCG GGCCGCTGCC TGGACGTGGA CAACGCCTCC
ACGGCGGCGG GCGCGACCGT GCTGGTCTGG GACTGCCACA CCGCCGCCAA CCAGCTCTGG
ACGACCTGGG CGGGCGGCGA GGTGCGGGTG TTCGGCGACA AGTGCCTGGA CGCCTACGAG
CAGGGAACGG TCAACGGCAC GCGCGTGGTG ACCTGGCCGT GCAACGGGCA GGACAACCAG
CGGTGGGTCG TCGGCTCGGA CGGCTCGGTG CGCAACACCC GCGCCGGGCT GTGCCTGGAC
GTCGACGGCG CGGGCACGGC GAACGGCACG CGGCTGGTGC TGTGGACGTG CAACGGGCAG
GGCAACCAGC GGTGGTCCCG GACCTGA
 
Protein sequence
MPVRSRRRLT TLLLAALAAL TGVTAPTAHP PAQAAPGSPA LTPPLGWNSW NSFGCGITEG 
QVRQAADAMA SSGMRDAGYR YVVVDDCWFD PQRDSAGNLR NHPTKFPSGM KALGDYIHGK
GLKFGIYQAP NEKTCAQGTG AHPGATGSKG HEAQDARSFA SWGVDYLKYD WCSGAGTRDE
QIARFTIMRD ALRATGRPIV YSINPNSFHA ITGDKHDWGD VADLWRTTED LLDVWQNGNT
NSYPMGVGNV LDVTAPLAAQ TGPGNWNDPD MLVVGRPGLT LTESRAHFAL WALMAAPLMA
GNDIRTMSPE ISAVLRNPGL IAVNQDPLGA GGRRVRDDGA TEVFAKPLSD GSVAVGLFNR
GGGATTVATT AAQIGLSGTG FTLTDLWTGG TSTSSGAISA TVPAHGVAAF RVTGGTPLAA
TTSRLRGTGS GRCLDVDNAS TAAGATVLVW DCHTAANQLW TTWAGGEVRV FGDKCLDAYE
QGTVNGTRVV TWPCNGQDNQ RWVVGSDGSV RNTRAGLCLD VDGAGTANGT RLVLWTCNGQ
GNQRWSRT