Gene Amir_7049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_7049 
Symbol 
ID8331270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp8190298 
End bp8193360 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table11 
GC content73% 
IMG OID644947478 
Producthypothetical protein 
Protein accessionYP_003104687 
Protein GI256381027 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGACCT GGGCCAAGCG CGGTCTCCAG TCCGCACTTG TCACCGGTGG TTTGCTGATG 
CTCGGCACGG GCATCGCCTC CGCTGAGGAG CGGGTCAGCC CCGACCTCGC CCCCTCCATG
CTCGACAGCA TCGGCCAGGA GCTGGCCGAC GTCGACATCG AGATCCCCGT CCTCGTCGAG
AAGAACGCGA TCGGCGTCCT GGGCGACGCG GTCGAGCTGC CCGAGATCGA CGAGAAGCTC
GTCGTCCCCT CGCTCAACAC GATGATGGGC AAGCAGTCCA ACCCGCTCGT CAAGGACAGC
CGGGGCATCG GCTCCGACCA GATCATCCGC AACAACCGCA TCGTCGGCGA CCTCGTCGTG
CCGGTCATGG TGTGCGGCAA CGCGATCGGC GCGCTGGGCG ACGCCTACGT CGAGGCCGAC
TGCGAGCAGG AGGCCAGCAG CACCCGCACG ACCATCACCG ACGGCTCCCG CGACTCGCTC
GCGGGCAACG TCGTCGCCAT CGCCGCCGCC GTGCCCACCT CGGTCACCGG CAACGCGATC
GCCGGCCTGG GCAACGCCGA GGCCAACACC ACCGCGTCGC AGTCCGCCGT CGCGGGCGGC
GACATCACCA CCTCCGGCGA GCGCGGCTCG CTGTCCGGCA ACGTCGTCGC GGGCCAGTGG
GCCGTGCCGG TGCAGCTGAA CAACAACGCG GTCGGCACGA TCGGCAACGC GCTCGCCAAC
GGCTCGTCCG ACGTCACCGC CGAGGTGCCC GGCACCATCA CCACGAGCGG CGCGGACAGC
TTCGGCGGCG GCAACGCGCT GCTGGCCCCG CTGGCGCCCA TCGCCGCCGT GAACGGCAAC
GCCGCGGGCG TGCTCGGCAA CGGCGACGTG CTCGCCGAGA ACTCGGCCAC CGCCACGGCG
GGCAGCGAGC ACACCGGCAT CTTCGACCTG CCCACCTACG CGGAGACCTC GGGCAACCGG
GGCACCCTGG CGGGCAACAT CCTCCAGCCG CAGATCGCCG GGCCCATCTC GGCCGACGAC
AACGCGGTCG TGGTCGCGGG CAACTCCACC GCCACCTCCT CGGCGTCCAA CTACTCGCAG
GTCGGCGGCT TCACCAGCAC GACCGGCGAG AACGGCACGC TGTCCGGCAG CATCGGCGAC
CTGCCGATCG GCCTGCCCGC CGCGGTCGGC GGCAACGGCG CGGCGGTCCT CGGCAACGCG
GCGGCCGACC ACACCAACGA CTCCACGACG CTGGTGGGTG GCAACACCTT CACCAACGGC
GACGGCGCCG TGCTCGGCTC CAACGTGGTC TCCGCCCCGG TGACCGCCCC CGCCGACCTG
TGCGGCACCG GCCTCTCGGC GGGCGGCAAC GCCGACGGCA ACTGCAACAA CCTCGTCACC
ACCGAGGTCG GCGGCCTCAA CGCCAGCCGG GGCAACGACA GCGTCGTGTC CGGCAACCTG
GTGTCGCTGC CCTCGTTCGT GCCCGCCGAG TCCTTCGGCA ACGCGTTCTC CGTGCTGGGC
CAGGCCGACG GCACGGCCGA CGAGGTCAAG GGCTCCGCGG TCGGCGGCGA GCCGAACACC
CGCGACGACG ACGGCACGGT GTCCAGCAAC GCGGTCGGCC TGCCCACCGC GCTCGGCCTG
CAGATGCACA GCATGGCCGG TGGCATCCTG AGCAACACCA ACGCGACCGC CACCAGCGAC
AGCACCTTCG ACGTCGGCGG CCCGGCCGAG GCGACCGGCA AGCACGGCTC GATCTCCGGC
AACATCGGCT ACCTGCCGAC CAACAACATC GCGCAGGTCT TCGGGTCCGC GGTGGGCGTC
GGCGGCAACC ACTACAGCAA CAGCGACAAC TTCCTCACCT CCACGGTCGG CGGCGACGCC
GTCTCCACCG GTGAGGACGG CGCGATCGCG GGCAACGTCG CCTCGGTCCC GCTGGCGTCC
ACGCTCCAGT CCTTCGGCCA CTCGGTCGTG GTCGCGGGCA ACGGCGACGT CGTCACCACC
AACACCGCCG ACGTCACCAC CGGCGGCAAC GTCGCCACCA ACGGCGACAA CGCCTCGGTG
TCCGGCAACG GCATCGCCCC GCAGATCGGC CTGCCCGCGC AGGTGTTCGG CATCGCCGGT
GGCGTGGCGG GCAACGGCAA CACCACCCAC ACCAACGACA GCACCCTCGT CGTCGGCGGC
GACCACGACA CCTCGGGCCT GGACTCGGCC GCGTCCGGGA TGCTGGTGAC CGCGCCGATC
TCCGGCAACC CGGCCGTGCA CGGCGACGCC GTGTCGGTGC TGGGCCTGGC CGACAGCGTC
ACCGACTCCA CCGCCGTGAC CCAGGTCGGC GGCAGCACCG AGACGGTGGG CTCCGGCTCG
CTGTCCGCCG CCGAGGTGTA CGCGCCGGTC GAGGCCGCCG CGACCGTGGT GGACGTGCCC
GCGCAGGTCC TCGGCACCGC CACCACGCTG GTGTCCAGCA CTCACGACGT CGAGACCGGC
GACGAGGTCG ACGGCGCGGC CGACACCAAG GGCATCAACC TGCCCAGCTC GGTCGACCGC
CACCTGTCGA TCACGAGCCT GCCGCTGCTG AACAACGTCT TCATGATGGA GCGCTCGTTC
CAGGGCGACC TGCCGGTCGT GGGCGGCCTC GCGGGCGGTC TGCCCGTGGT CGGCGGCCTG
ACCGGCGGCC TGCCCGTGGG CAACCTGACC GGTGGCGGTC TGCCGCTGGT CGGTGGCCTG
GCCGGTGGCC TGCCGGGCAT CAGCGGTCGC AGCGCCCAGG GCGGCGGCCT GCCGGTCGTC
GGTGGCCTGA CCGGCGGTGG CCTGCCCGTG GTCGGCGGTC TGACCGGTGG CGGTCTGCCC
CTGGTCGGTG GCCTGACCGG TGGCGGCCTG CCGGTCGTCG GCGGTCTGAC CGGCGGTGGC
CTGCCCCTGG TGGGCGGCCT GACCGGTGGC CAGCGCTCCT CGGCCCCGAC CCTGCCGACC
GACCAGCTGA CCAAGGCCAC CAGCGGCCTG AAGGTCAACC CGCTGGAGCA GCTCGGCGGC
CAGAAGGGCG GCTCGCCGCT CGGCTCGCTG CTGGCGATCC CGTCGATGTT CGGCTCGCTC
TGA
 
Protein sequence
MQTWAKRGLQ SALVTGGLLM LGTGIASAEE RVSPDLAPSM LDSIGQELAD VDIEIPVLVE 
KNAIGVLGDA VELPEIDEKL VVPSLNTMMG KQSNPLVKDS RGIGSDQIIR NNRIVGDLVV
PVMVCGNAIG ALGDAYVEAD CEQEASSTRT TITDGSRDSL AGNVVAIAAA VPTSVTGNAI
AGLGNAEANT TASQSAVAGG DITTSGERGS LSGNVVAGQW AVPVQLNNNA VGTIGNALAN
GSSDVTAEVP GTITTSGADS FGGGNALLAP LAPIAAVNGN AAGVLGNGDV LAENSATATA
GSEHTGIFDL PTYAETSGNR GTLAGNILQP QIAGPISADD NAVVVAGNST ATSSASNYSQ
VGGFTSTTGE NGTLSGSIGD LPIGLPAAVG GNGAAVLGNA AADHTNDSTT LVGGNTFTNG
DGAVLGSNVV SAPVTAPADL CGTGLSAGGN ADGNCNNLVT TEVGGLNASR GNDSVVSGNL
VSLPSFVPAE SFGNAFSVLG QADGTADEVK GSAVGGEPNT RDDDGTVSSN AVGLPTALGL
QMHSMAGGIL SNTNATATSD STFDVGGPAE ATGKHGSISG NIGYLPTNNI AQVFGSAVGV
GGNHYSNSDN FLTSTVGGDA VSTGEDGAIA GNVASVPLAS TLQSFGHSVV VAGNGDVVTT
NTADVTTGGN VATNGDNASV SGNGIAPQIG LPAQVFGIAG GVAGNGNTTH TNDSTLVVGG
DHDTSGLDSA ASGMLVTAPI SGNPAVHGDA VSVLGLADSV TDSTAVTQVG GSTETVGSGS
LSAAEVYAPV EAAATVVDVP AQVLGTATTL VSSTHDVETG DEVDGAADTK GINLPSSVDR
HLSITSLPLL NNVFMMERSF QGDLPVVGGL AGGLPVVGGL TGGLPVGNLT GGGLPLVGGL
AGGLPGISGR SAQGGGLPVV GGLTGGGLPV VGGLTGGGLP LVGGLTGGGL PVVGGLTGGG
LPLVGGLTGG QRSSAPTLPT DQLTKATSGL KVNPLEQLGG QKGGSPLGSL LAIPSMFGSL