Gene Amir_4021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4021 
Symbol 
ID8328214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4720433 
End bp4721644 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content76% 
IMG OID644944493 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003101730 
Protein GI256378070 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0221723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGCAGC ACGGCAGGGA CCCCCGGCGC AGCCCGGCGC GGCGCGGCGC GTCCGCCGCC 
AAGCTCGGGT CCGTCGCGCT CGGCGCGGTG CTCGCCGCCT CCGTGCTGAC CGGCGTCGCG
CTCAACCGGG ACGACACCGT CACCGGCGCC GCCGTCCCCG AGGGCGGGAC CACCGAGACC
GGCGAGACCA CGACGACCAC GACCTCCACC CCGCCGCCCG ACCCGTGCGC CCCCGTCCTG
GCCGGGCTGA CCCCGCGCGC CGGGCTCGCC CAGCTGCTCC AGGTCGGCGT CAACCCGCGC
GGCCCGCAGG ACGCGCTGTC CATCGTCGGC TCCGAGCAGG TCGGCGGCAT CTTCGTCGGC
GGCGACGACG TCGGCCTGCT GTCGGGGGAC GCGCTGGCCG CCGTGCACGC CGCCTCCACG
CTGCCGCTCA CCGTCTCGGT GGACGACGAG GGCGGCCGGG TGCAGCGGAT CGACGCGCTC
GACGGCGACA TCCCCAGCGC CCGCACCATG ACCCGCACCC TGTCCACCGA GCAGGTCCGC
GAGCTGGCGC GCAAGCGCGG CGAGGCGATG AAGGCGCGCG GCGTCAACAC CGACCTCGCC
CCCGTGCTCG ACCTGACCTC CCAGGCCGCG AACACCGTGA TCGGCGACCG CTCGTTCAGT
GTCGACCCGG CCACCGCCGT CTCCTACGCC GAGGCGTTCG CCGAGGGCCT GCGCCAGGCC
GGGGTCGTCT CGGTGGTCAA GCACTTCCCC GGCCACGGCA ACACCTCCGG CGACTCGCAC
CTCGGCTCGG TCACCGCGCC CCCGCTCGCC CAGCTGCGCG CCCACGACCT CGCGCCCTAC
CGGGAGCTGC CCAGGTTCGG CGAGGACGTG CAGGTCATGG TCGGCCACAT CGCCGTCCCC
GACCTGACCG GCGGCCTGCC CGCGAGCCTG AGCCCGGCCG CCTACGAGCT GCTGCGCGGC
GAGTTCGCGT TCGACGGCCT GGTCATGACC GACGACCTGG GCGCGATGCG CGCGGTGACC
GACCTGGCCG ACCTGCCCGA CGCGGTGCTG CGCGCGCTGG TCGCGGGCGC GGACGTGGCG
CTGTGGTCGT CCGGCGGCCG GGTCGGCGAG GTGCTCGACC GGCTGCAGGC CGCCGTCGCG
AGCGGCGAGC TGAGCGCCGA GCGGGTGGAC CGCTCGCTGC GCCGCGTGCT CAAGTCCAAG
CACCTCTGCT AG
 
Protein sequence
MEQHGRDPRR SPARRGASAA KLGSVALGAV LAASVLTGVA LNRDDTVTGA AVPEGGTTET 
GETTTTTTST PPPDPCAPVL AGLTPRAGLA QLLQVGVNPR GPQDALSIVG SEQVGGIFVG
GDDVGLLSGD ALAAVHAAST LPLTVSVDDE GGRVQRIDAL DGDIPSARTM TRTLSTEQVR
ELARKRGEAM KARGVNTDLA PVLDLTSQAA NTVIGDRSFS VDPATAVSYA EAFAEGLRQA
GVVSVVKHFP GHGNTSGDSH LGSVTAPPLA QLRAHDLAPY RELPRFGEDV QVMVGHIAVP
DLTGGLPASL SPAAYELLRG EFAFDGLVMT DDLGAMRAVT DLADLPDAVL RALVAGADVA
LWSSGGRVGE VLDRLQAAVA SGELSAERVD RSLRRVLKSK HLC