Gene Amir_4609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4609 
Symbol 
ID8328807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp5490270 
End bp5491553 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content71% 
IMG OID644945056 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003102288 
Protein GI256378628 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCCCACT CCATCAGCAG GCGCACCCTG CTGCGCGCCG CGATCGGTGG CGGTGTCGCC 
GCGCCGTTCC TGGCGGGGTG CGGCGCGTTG ACGCCCGCGT CCGGCAAACC CGGCGCCCTC
TCGGTGCACA CCCAGCTCAG CGGCGCCGTC GCGGGCGCCA AGGTGTTCGC CGACGCGGTC
GCCGCCTACG AGCGGCGGAC CGGTCGTCCG GTCGCGCTGC TCAAGAACGG CTCCGACCTG
CCCATCGTCT TCGAGACCAG CTCGCTCGCC GGTGCGGAGG CCGACGTCGC CCTGGTGAAC
CTGCAGGGCC GCACGCTCTC CTGGACCGGC CTGGGGGCGA CGATCCCGCT CACCGGGCTG
CTCGACGAGT GGGGGCTGCG CGACAAGATC ATCCCCGAGG CGCTGGCCGA GTGGACCGAC
GGGGACGGCA ACCTGCGCGC GTTCCCGTTC ACCCGCACCA ACTGGCCCGT CTCGTACAAC
ACCAGGCTGC TGGAGCAGGC GGGCGTCCAG ATCCCGACGA CCTCCGACGA GCTGATCGCC
GTGGCGCAGG CGTTGCGCGC CAAGGGGATC GGGCCGGTGA CGGTCGGCGG GTCCGACTGG
AGCGGGCAGA AGATGTTCCT CCAGGTCATC CAGGGCTTCC TCACCCCCGA CGAGGCGAAG
GAGGTGTTCG GCAGCGGCAA GCTCTCCGAG AGCCCCGCCG CCATCGCCGG GGTGGAGCAC
TTCGTGGAGC TGCGGGACGC GGGCGTCTTC GTCGACGACG TGCAGGGCTA CACCTCCGAC
TCGGAGCTGA CCCAGTTCAA CACCGGCAAG GCGGCGATCG TGCCCGCCAT GTCGTCGGCG
CTGGCGAAGG TGCCCGCCGA GCGGGCGAAG GAGGTCGTCG TCGGCGGCTG GCCCAAGCCC
TCGCGCGGCG GCGTGCTGGA GCACCCGAGC GTGATCCGCA GCTTCAACGG CCACGGCATC
TGGATCAGCC GCAGGGGCGC GGAGAAGCTC GACCTGATCA AGCCGTTCGT GCAGGACCTG
TACAGCGACG AGGTGATCGA CTCGATGATC CTGGGCTCCG GCCGGGACAT GAGCCGGATC
ACGGACACGG TCAGCGAGGA CTTCCCGCTC GTCGCGCAGG CCTCCCGCCT CACCGACCAG
CAGGTCTCCC CCGTCATGCT GCCCGACCTG GTCATCCCCC AGTCAGCGTT CGAGCCGATG
ATCCAGGCCA CGGCAGCCGC CTACGGCCCG ATCCCCGCCG AACGGATCAT CGAGGTCTTC
GAACGGGCCT ACGCCACGGT GTGA
 
Protein sequence
MSHSISRRTL LRAAIGGGVA APFLAGCGAL TPASGKPGAL SVHTQLSGAV AGAKVFADAV 
AAYERRTGRP VALLKNGSDL PIVFETSSLA GAEADVALVN LQGRTLSWTG LGATIPLTGL
LDEWGLRDKI IPEALAEWTD GDGNLRAFPF TRTNWPVSYN TRLLEQAGVQ IPTTSDELIA
VAQALRAKGI GPVTVGGSDW SGQKMFLQVI QGFLTPDEAK EVFGSGKLSE SPAAIAGVEH
FVELRDAGVF VDDVQGYTSD SELTQFNTGK AAIVPAMSSA LAKVPAERAK EVVVGGWPKP
SRGGVLEHPS VIRSFNGHGI WISRRGAEKL DLIKPFVQDL YSDEVIDSMI LGSGRDMSRI
TDTVSEDFPL VAQASRLTDQ QVSPVMLPDL VIPQSAFEPM IQATAAAYGP IPAERIIEVF
ERAYATV