Gene Amir_3301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3301 
Symbol 
ID8327491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp3874243 
End bp3875700 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content70% 
IMG OID644943813 
Productglycoside hydrolase family 62 
Protein accessionYP_003101053 
Protein GI256377393 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0111017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCTCG CAAGGCTGTT CACCCCGCGC CGGGCCCGGA TCGGCGCCAT GGCCACCACC 
TCGCTCGCCA TGGTGGCCGC GCTCGCCGCC CTCCCACCCC AGGCCGCCGC GGCGCCGGGG
TGCTCGATCG CCTACGCGGC GACCTCCCAG TGGCAGGGCG GTTTCACCGC GAGCGTCGCG
ATCACCAACC TCGGTGACGC CGTCGACGGC TGGACGCTCA CCTGGACCTA CGGCTCGGGC
CAGCAGGTCG CGCAGGCCTG GAACGCCGAG GTGTCCCAGT CCGGCGGCCA GGTGAGCGCC
CGCAACGCCT CCTACAACGC CGCCATCCCC ACCGGGGGCC GCGTCGAGTT CGGGTTCACC
GCGACCTCCA CCGGGTCCAA CCCCGACCCG ACCTCGTTCA GCCTCAACGG GACCACGTGC
ACGGGAGGAG TCGGGCCCAC GACCACCACG CCCCCGACCA CGCCGCCCAC CACCACGACC
CCGCAGCAAC CGGGCGGCTC GCTGCCGAGC AGCTTCCGGT GGTCCTCCAG CGGCGCGCTG
ATCGGCCCGA AGCCCGACTC CTCGCACGCC ACGGTCTCCG TCAAGGACCC CAGCGTGGTG
CGCCACAACG GCCGCTACCA CGTGTTCGCC TCGGTCTACA CCAACGGCTA CAACCTCGTG
CACACCAGCT TCACCGACTG GTCCCAGGCC GCGTCCGCCC CGCACCACTA CCTGGACCGC
TCCGGGATCG GCACGGGCTA CCGGGCCGCG CCGCAGGTGT TCTACTTCGC CCCGCAACGC
CTGTGGTACC TGGTGTACCA GACCGGGTCC AACGCCTCGT ACTCGACGAC CGCCGACATC
GAGAACCCCG CGTCCTGGTC CGCGCCGAGG AACTTCTACG CCAACGGGAT GCCGCAGATC
ATCCGGGACA ACATCGGCAA CGGCTACTGG GTCGACTTCT GGACCGTCTG CGACACGGCC
AAGTGCCACC TGTTCTCCTC GGACGACAAC GGCCACCTGT ACCGCTCGGA GACCAGCCTC
GCCCAGTTCC CCAACGGCTT CACCAACACC GTGATCGCCA TGCAGGACAG CAACCGCAAC
CGGTTGTTCG AGGCGTCCAA CATCTACAAG GTCGCCGGCA AGAACCAGTG GCTGATGCTC
CACGAGGCGA TCGGCTCGGA CGGCCGCCGC TGGTTCCGCT CCTGGACCGC CCCGGCGATC
GCCGGACCGT GGACCGCGCT GGCCGACAGC GAGTCGAACC CGTTCGCCAG GGCCAACAAC
ACCACGTTCC CCGGTGGCCA GTGGACGCGC GACATCAGCC ACGGCGAGCT GGTGCGCAGC
GGGACCGACC AGACCATGGA GATCAACCCC TGCAAGCTGA GCTACCTGTA CCAGGGCCTG
GACCCCAACG CGTCCGGCGA CTACAACCGC CTGCCCTGGC GGCTGGGCCT GCTGACCCAG
ACCAACTCCC CCTGCTGA
 
Protein sequence
MLLARLFTPR RARIGAMATT SLAMVAALAA LPPQAAAAPG CSIAYAATSQ WQGGFTASVA 
ITNLGDAVDG WTLTWTYGSG QQVAQAWNAE VSQSGGQVSA RNASYNAAIP TGGRVEFGFT
ATSTGSNPDP TSFSLNGTTC TGGVGPTTTT PPTTPPTTTT PQQPGGSLPS SFRWSSSGAL
IGPKPDSSHA TVSVKDPSVV RHNGRYHVFA SVYTNGYNLV HTSFTDWSQA ASAPHHYLDR
SGIGTGYRAA PQVFYFAPQR LWYLVYQTGS NASYSTTADI ENPASWSAPR NFYANGMPQI
IRDNIGNGYW VDFWTVCDTA KCHLFSSDDN GHLYRSETSL AQFPNGFTNT VIAMQDSNRN
RLFEASNIYK VAGKNQWLML HEAIGSDGRR WFRSWTAPAI AGPWTALADS ESNPFARANN
TTFPGGQWTR DISHGELVRS GTDQTMEINP CKLSYLYQGL DPNASGDYNR LPWRLGLLTQ
TNSPC