Gene Amir_3610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3610 
Symbol 
ID8327800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4223112 
End bp4225286 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content73% 
IMG OID644944105 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003101345 
Protein GI256377685 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATC TCGAGCTGCC CGACCGGCCC AGGGGCCGCA GACCCCGGCG GCGGGTGCTG 
CTCATCGCCC TGTTCGCCGT GGTGACCGCC GTGGTCGCCT CGGTGTCCGC CTGGCTGGCC
GCGGACGTGG GCGAGCGGTG GCGGGCGGGC GACGTCCTCC ACCTGTGCGC GACCGGGGTC
ATGCTGGTGC TCGCCGGGGT GCTGATCACC CTGCTCCCCG GGGCGGTCAA GGAGTTCTGG
CGCTGGTCCG GGGTGTTCGC CGCCGTCGCC GAGCGGGTGA TCAGGGCGCT GCGCCCGCTC
AAGGCCGTCG GCGTGGTCGT CACGGTGCTC GTGCTGGGCG TCGGCGGCTC CCTGCTGGCC
CCGGCTCCCG ACACGGACGC GACCGCGCTG GAGGAGGGCG AGCTGTGGCT CATGGCGGGC
GAGGACCTGA GCCCGAGCGA CCCGCGCGCC GTGCTGCTGG AGCAGTGGAA CCAGGCGCAC
CCCGAGACCC CGGCGAGGCT GGTGGACGCC GCCGGGGCCA CGGACGAGCA GCGGGAGCGG
ATGCTCGACG ACGCCCGCCC CGACGGGCCG CACCGCGCGG ACGTGTACCT GCTGGACGTG
GTGTGGCTGC CGGAGTTCGC GGGCGAGGGG CACCTGCGCG AGCTGGACGG GTCCACGATC
GGCGGCGGCA CGGCCGACTT CCTGCCCGTG GTGCTCGACA CCTGCCAGGA CGGCGGGAAG
CTGTGGGCGC TGCCGCTCAA CACCGACGTC CGGCTGCTCT ACCACCGCAC CGACGTCCCC
GGCCTGGTCG TGCCGACCAG CTGGGACGGC CACTCCGGCT CCGCCGCGAC CAGGGCGCAC
GGCGGGCTGG AGGCGGCCGA CGCGCCGCAG CTGGCCGAGG AGATGCTCAC CGTGGAGGCG
CTGGAGGCGA TCTGGGCGGC GGGCGGCAAC GTGGTCACCC GTGACGGGGA GGTCACCCTG
ACCCCGGACG GGAGCAGGGT GCAGTTCACC GAGGACGACT ACGAGGGGAT GCGCAAGCTC
AACGCGATGG CCAGGACCGA GGGCGTGGTG CCGCGGGGCG TGCGCCCCGA GGAGACCGGG
GCCGAGGACG CGCTCACCCT GTTCGCGGAC GGGCGGACCG CGTTCCTGCG GAACTGGGCC
GTCACCCACC GGCAGCTCAC GGGCAGGCAG GACAGGCCGA GCACGACGCA CGTCGGGTTC
GACAGCGCCG TGCAGCCCGC GCCGAGCGTG CTGGGCGGGC AGGACCTGGC GATCTCCCGG
CACACCGCGA AGCCGAAGGC GGCGAAGGCG CTGCTGGAGT TCCTGACCAG CACGCAGAGC
CAGCAGATCC TGGCCGAGGT CGGCGGGTTC GCGCCGAGCC GGACGTCGGT GTACGAGCTG
TCGGACAGCA GGCACCTGGC GAACGTGCGC ACCGCGCTCA AGGACGCCAG GCCCCGGCCG
AGGACCGAGC GGTACACCGA GTTCAGCAAG CTCTTCCGGG AGGCGGTCGA GAAGGTGGTG
GCGCCGGAGG ACTCGATCCC GGACGAGACC GCGCGGAAGC TGGCGGACGT GCTGCGCGGG
CGATCCGGCG CGGGTGATCA GACGCGGGTG GGGCGGGGGT TTCGCGGCGG GCGGGAACGG
TGTTTCGAAG GGGTTCGAAA GTCCGCTCCG ACGGGCGGTG GGCGCGCCAG GCTGAGGGCC
GACGCACCGA CCGAGACCGT TGGAGTGATG ATGCGAAGGA CCTTGACCGC GTTGTTCGTC
GCCGTCGTGG CGCTGCTGTC GTCGCTGGTG TCCGCGAGCG CGGAACCCGG CTCGGTGGAC
AGATCCGCCA CGCGGGTGGA GGCGCCGGTG ACCGTGGCCG CACCAGTTCA GCAGGACGTG
TCGGCGCAGG CGCTGTCGTG CACGGCGGGC GACCTGTGCG CGTGGAACGG CGTCGGGGGC
GGGAGCCGTT GCAGCTGGAC CAACAGGGAC AACGACTGGT GGTACGCCCC GACCACCTGC
TCGTGGTCGT CGGGCAGCGC GGTGTGGTCC GTCTACAACA ACGGGCGCAA CACCGCCTAC
GACAGGGTGT GCCTCTACCC CGAGGCGAAC TACGGGGGCA GCACCGCGTA CTACGTCCTG
CGGGGGCAGC AGGCCGAGGG GTGGCCTGGC GTGATCATCC GCTCCCACAG GTGGGTCAAC
GGGTCCTGCT GGTGA
 
Protein sequence
MADLELPDRP RGRRPRRRVL LIALFAVVTA VVASVSAWLA ADVGERWRAG DVLHLCATGV 
MLVLAGVLIT LLPGAVKEFW RWSGVFAAVA ERVIRALRPL KAVGVVVTVL VLGVGGSLLA
PAPDTDATAL EEGELWLMAG EDLSPSDPRA VLLEQWNQAH PETPARLVDA AGATDEQRER
MLDDARPDGP HRADVYLLDV VWLPEFAGEG HLRELDGSTI GGGTADFLPV VLDTCQDGGK
LWALPLNTDV RLLYHRTDVP GLVVPTSWDG HSGSAATRAH GGLEAADAPQ LAEEMLTVEA
LEAIWAAGGN VVTRDGEVTL TPDGSRVQFT EDDYEGMRKL NAMARTEGVV PRGVRPEETG
AEDALTLFAD GRTAFLRNWA VTHRQLTGRQ DRPSTTHVGF DSAVQPAPSV LGGQDLAISR
HTAKPKAAKA LLEFLTSTQS QQILAEVGGF APSRTSVYEL SDSRHLANVR TALKDARPRP
RTERYTEFSK LFREAVEKVV APEDSIPDET ARKLADVLRG RSGAGDQTRV GRGFRGGRER
CFEGVRKSAP TGGGRARLRA DAPTETVGVM MRRTLTALFV AVVALLSSLV SASAEPGSVD
RSATRVEAPV TVAAPVQQDV SAQALSCTAG DLCAWNGVGG GSRCSWTNRD NDWWYAPTTC
SWSSGSAVWS VYNNGRNTAY DRVCLYPEAN YGGSTAYYVL RGQQAEGWPG VIIRSHRWVN
GSCW