Gene Amir_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_1040 
Symbol 
ID8325212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp1154158 
End bp1155252 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content77% 
IMG OID644941584 
ProductMonosaccharide-transporting ATPase 
Protein accessionYP_003098842 
Protein GI256375182 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAGGC TCGGGAGGGC GCTCTGCGCC GGGCTGCCCG CGCTGCTGCT CGCGGCCTGC 
GGCGGCGCCG CCGGACCGCC GGAGCGCCCC ACCAGGCCGC TCGTCGGCGT GATCCTGCCC
GACACCGAGT CCTCCGCCCG CTGGGAGGAG CAGGACCGGC CGCAGCTCCA GCGCGCCCTG
GAGGCCGAGG GGCTCGAACC GGTCGTCGAG AACGCCCGCA ACGACGAGTT CCGGTTCGCC
AGCATCGCCG ACGACCTGAT CGCCAGGGGC GTGGCGGTCC TGCTGATCAC CCCGCTGACC
CCCGAGGGCG GGGCCACCGT CGAGCACAAG GCGCGCAAGG CAGGCATCCC CGTCATCGAC
TACGACCGGT TCAGCGTCGG CGGGGCCGCC GACTACCTCG TGTCCTTCGA CAACGAGGCC
GTCGGCGAGC TCCAGGCGCG CGGGCTCGTG GACTGCATGG GGGACCGGCG GGGCGCGCGG
GTGATCGAGC TGCAGGGCGC GCCGCAGGAC AACAACGCCA TGCAGTTCGC CGACGGGCAG
CGCCGCGTCC TCGGCCCCCG CTACGAGCGC GGCGACTACC GGCTCGTGGC CAGCACGAGC
GCCGACCGCT GGGACCCGCT GCTCGGGCGG GCCCGGTTCG AGCAGGCGCT CAACGACAGC
GGCGGGCGCG TCGACGGGGT CCTCGCGGCC AACGACCGGC TCGCCGCCGC CGCCATCCAG
GTGCTGCGCG CCAGGGGGCT GGCCGGGAAG GTGCCGGTGA CCGGGCAGGA CGCCACGGTG
GACGGGCTGC GCGCGGTGCT GCGCGGCGAG CAGTGCCTGA CGGTGCACAA GTCCATCCGG
GACGAGGCGG AGGCGGCGGC CCGGCTCGCC TCGGCGCTGG CGGACGGGGA CGTGGCGCGC
GCGGACGCGC TGGCGAGCGC GACCACCGAG GACCCGACGA ACGGGCGCCG GGTGAGGGCG
GTGCTGCTGG GGGCGGTCCC GGTGCACCGG GACGGCGTGC GGGTGCTGGT GGCGTCGGGG
GTGGTGCGCG CCGAGGAGCT GTGCGTGCCG GACCTGGAGC GGACCTGCGC CGAGCTGGGC
ATCGCGCCGA GGTGA
 
Protein sequence
MRRLGRALCA GLPALLLAAC GGAAGPPERP TRPLVGVILP DTESSARWEE QDRPQLQRAL 
EAEGLEPVVE NARNDEFRFA SIADDLIARG VAVLLITPLT PEGGATVEHK ARKAGIPVID
YDRFSVGGAA DYLVSFDNEA VGELQARGLV DCMGDRRGAR VIELQGAPQD NNAMQFADGQ
RRVLGPRYER GDYRLVASTS ADRWDPLLGR ARFEQALNDS GGRVDGVLAA NDRLAAAAIQ
VLRARGLAGK VPVTGQDATV DGLRAVLRGE QCLTVHKSIR DEAEAAARLA SALADGDVAR
ADALASATTE DPTNGRRVRA VLLGAVPVHR DGVRVLVASG VVRAEELCVP DLERTCAELG
IAPR