Gene Amir_3665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3665 
Symbol 
ID8327855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4280158 
End bp4281126 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content70% 
IMG OID644944155 
ProductPeptide-aspartate beta-dioxygenase 
Protein accessionYP_003101395 
Protein GI256377735 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3555] Aspartyl/asparaginyl beta-hydroxylase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.536869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCT CGACGAGTCA GGGGCCGAGC GCTTCCAGCG CGATCGGTGC GCGCGCCTAC 
CTGGTGCAGA TGGCGCGCCG CCACTACACC GGTCTGATCG ACGCCTGGCA CTACGCGGGC
GACGCCGAGA AGGCGATGGC GTGCGCGGAA GCGGCGGTTC GGCAGGGCGT CTGGGAGCAC
CCGCTGCAGC GGGCCAGGGA GTACGTCCCA GGGCTCACCG CGCAGCCGCT GCATGACCCC
GGGCAGTTCT GGTTCACCTC CTACCTGGAG GAGGGCTACC CGCAGATCCG GGCGGAGATC
GAGCAGGTGA TCGGCGCCGC GCTCGACCCG GTCGTGCCCA CCACCGACGA CGCGGGCCTG
ATCCGCAAGG GGGACTGGAA GCAGGCGTAC CTGTACCGGG ACGGCCGCTT CCAGGCCCGG
AACTGCGCGC GGTTCCCCGT CACCATGGGG ATCCTGGAGA AGATCCCGGA CGTCACGGTG
CTCAGCCCAG GGGTGATCTC GGTGTCCAGG ATCTCGCCCG GCACGCACAT CATGCCGCAC
TGCGGAGCCA CGAACGCGCT GCTCCGCATC CACCTGCCGA TCAAGGTCCC CGCCGGGGTC
GGAATCCGGG TCGGTGACCG GGAGACGCGG TGGGAGGAGG GTAAGTGCCT GGTCTTCGAC
GACTCCTTCG AGCACGAGGT GTGGCACCGG GGCAGCGAGG ACCGGGTCGT GCTGATCGTC
GACGTGCTGC ACCCCGAGCT GAAGGGCGAC CACCGGGAGC GCCTGCTGCG GCACCGGCAC
AACTTCGAGG AGCAGATCCT CTCCTTCATG CGTGACCGCG GCCTGCGGCA GGTGCGCGTC
CAGGACGGCG AGGTCCAGCT CACCCCGGAC GACTCCGTCC GTCAGCTGGT CGAGACCTAC
CTTTCGGACA CCGGCATCAC CGGGGTCGAG CTGGACGGCG ACGGGGTGCG GTGGGAACGC
CGGGACTGA
 
Protein sequence
MTTSTSQGPS ASSAIGARAY LVQMARRHYT GLIDAWHYAG DAEKAMACAE AAVRQGVWEH 
PLQRAREYVP GLTAQPLHDP GQFWFTSYLE EGYPQIRAEI EQVIGAALDP VVPTTDDAGL
IRKGDWKQAY LYRDGRFQAR NCARFPVTMG ILEKIPDVTV LSPGVISVSR ISPGTHIMPH
CGATNALLRI HLPIKVPAGV GIRVGDRETR WEEGKCLVFD DSFEHEVWHR GSEDRVVLIV
DVLHPELKGD HRERLLRHRH NFEEQILSFM RDRGLRQVRV QDGEVQLTPD DSVRQLVETY
LSDTGITGVE LDGDGVRWER RD