Gene Amir_2404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2404 
Symbol 
ID8326593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2663250 
End bp2664524 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content71% 
IMG OID644942949 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003100190 
Protein GI256376530 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000132449 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTTAA CCCGAGCACT GGCCGCCGCA GCGGCGCTCT CCCTGCTCGC GTCCTGCGGA 
GGCCCGGCTG CCCCGCAGGA GCCCACCGGC CCGATCACCC TCACCTGGTG GCACAACGGC
ACCTCCGACC CGATCAAGAC GGTCTGGCAG GACGTCGTCA CCGACTACCA GGGCAAGAAC
CCGGACATCA CGATCGAGGC CCAGCCCATC CAGAACGAGG GCTTCTCCAC CAAGATCCCC
CTGGCGCTCC AGTCGCCCAC CCCGCCGGAC GTCTACCAGC AGTGGGGCGG CGGCGACCTG
GCCTCGCAGG TCACCTCCGG CAAGCTCGCC GACATCACCG ACGCGAGCAA GCCGTGGATC
TCCACCATCG GCGACTTCGC CAAGGGCTGG CAGGTCGACG GCCGCCAGCT CGGCGTCCCG
TTCGCCCAGC ACGTCGTGGG CTTCTGGTAC CGCAAGGACC TGTTCTCCCA GGCCGGGATC
AGCGCCCCGC CCACCACGAT GGCCGAGCTG AACGCCGCCG TCGCCAAGCT CAAGTCCGCC
GGGCTCGCCC CGATCGCCGT CGGCGGCAAG GACCGCTGGC CGGACGCCTT CTACTGGAAC
TACTTCGCCG TCCGCGAGTG CTCGCAGCAG ACCATCGAGT CCTCGGTGAA GAACCTCAAG
CTCGACGACC CGTGCTGGGT CAAGGCGGGC CAGGACCTGG TGGACTTCCT GCGGACCGAG
CCGTTCCAGG AGGGCTTCAA CGGCACCCCC GCCCAGCAGG GCGCGGGCAG CTCGGCGGGC
CTGGTGGCCA ACGGCAAGGC CGCCATGGAG CTGCAGGGCG ACTGGAACCC CGGCACCATG
TCCTCCCTCA CCGAGGACAA GGACCTGGAC AGCAAGGTCG GCTGGTTCCC GTTCCCGACC
GTCCCCGGTG GCCAGGGCGA CCCGGCGGCC GTGCTCGGCG GCGGTGACGG CTTCTCCTGC
ACCACCCGCG CGGCAGCCGC CTGCGCGAAG TTCCTGGAGT ACCTGATCGG CCCGGAGATC
CAGAACAAGC TCGCCGCCGC GGGCACCGGC CTGCCGGTCA ACGAGGCCGC CGTCACCGCG
CTCAAGACCG AGAACCTCAA GACCGTCGCC GAGCACGGCC GCAATGCGCC GTACGTGCAG
ATGTACTTCG ACCGGGCCTT CCCGACCGAC GTGGGCGCCG CGCTGAACGA GGCCGTCGCG
AACCTGTTCG CGGGCCAGGG CTCGCCTCAG GGCATCGTCG ACGCGGTCAA CGAGGCGGCG
GACGGGGCGA AGTGA
 
Protein sequence
MRLTRALAAA AALSLLASCG GPAAPQEPTG PITLTWWHNG TSDPIKTVWQ DVVTDYQGKN 
PDITIEAQPI QNEGFSTKIP LALQSPTPPD VYQQWGGGDL ASQVTSGKLA DITDASKPWI
STIGDFAKGW QVDGRQLGVP FAQHVVGFWY RKDLFSQAGI SAPPTTMAEL NAAVAKLKSA
GLAPIAVGGK DRWPDAFYWN YFAVRECSQQ TIESSVKNLK LDDPCWVKAG QDLVDFLRTE
PFQEGFNGTP AQQGAGSSAG LVANGKAAME LQGDWNPGTM SSLTEDKDLD SKVGWFPFPT
VPGGQGDPAA VLGGGDGFSC TTRAAAACAK FLEYLIGPEI QNKLAAAGTG LPVNEAAVTA
LKTENLKTVA EHGRNAPYVQ MYFDRAFPTD VGAALNEAVA NLFAGQGSPQ GIVDAVNEAA
DGAK