Gene Amir_4360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4360 
Symbol 
ID8328557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp5152509 
End bp5153840 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content80% 
IMG OID644944824 
Productglycosyl transferase family 28 
Protein accessionYP_003102057 
Protein GI256378397 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000448703 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCGTCC TGTTCACGCC GCTCCCGGTG GACCCGAGCC TGCGCGGCCT GGCCCCGCTG 
GCCTGGGCGC TGCGCGCCGC CGGTCACGAG GTGCGCGTCG CGGTCCGCCC CTCGGGCGTG
GAGGCGGTCA CCGGTGCGGG CCTCACCGCC GTCCCGGTCG GCGACCGCGG TGACGCGCCC
ACCCCGATCC CCGCGCAGGG CGGCAGGCCC GACGGCGGCG ACCGCCTGCT GCACCTGGAC
GACGCGCTGC TCGACGACCT GCTCGACTTC GCCGAGACCT GGCGACCGGA CCTGCTGGTG
TGGGACCAGG CCGTGCTCGC GGGCCCGGTG GTGGCGAAGC TCCTCGGCGT CCCGCACGTG
CGCGTCCCGC ACGCCGTCGA CGTCGTCGGC CTGCACCGCG CGGGCCTGGC CCCCGACCAC
CCGGACGGCC TGACCGCCAA GCTCGGCGCG GCCCTGGCCC GCAGGGGCGC GGTGTTCACC
GAGGACGTCG CGGTCGGCGA CCTCACCGCC GACCAGCTCC CACCGGGCAC CCGCCCACCC
GTCGACCTGG ACCACCTGCC ACTGCGCCCC ACCCCGCACG ACGGCCCGGC CGAACTCCCG
GACCACCTGC GCGAACAGCC GGCGGGCCCG CGCGCCCACC CCGCCCCGCG CGACGATCCG
GCCGAACCGC CCGCCGCCCC GCGCGCCCAC CCCGCCCCGC GCGCCCACCC CGCCCCGCAC
GACGGCCCAG CCGAACCGCC ACCCCGCCCC CGCCCCCGCG TCTGCCTGGC CCTCGCCCGC
CCCGACCGCG AAGGCGGCCT CACCCCCGCC GAGCGCGCCA CCGCCGAGGT CCTCGTCCGG
GGCGCCACGC GGCTGGACGT CGAGCTGATC GCCCTGCTGC CCACCGGATC GCCCCCGCTC
CCGCCACCCG CCACCACCCA CCCGCACGAC CACGAGCCCC CGCGGGAACT CCTCCGCGCC
TGCGCCGCGA TCGTCCACCG CGGCGACCCC GCCACCACCT CCGCCGCCAC CGCCGCGGGC
CTGCCCCAGC TCGTGGCCCC CGGCGGCGCC TGGGACGAGC CGAACCTCGC CGCCCTGCTC
GCCGACCGGG GCGTCGCCCT CGTCCTGGAC CGCACCCACC TCACCGAGGA CGCCGTGGCC
GAGCACCTCC TGCGCCTGCT CGACGAACCC GCGTTCACCG ACCGCGCCGA GGCCCTGCGC
GCGGACGTCC TCGCCCAGCC GACCCCGCAC GACGCCGTGC ACCGCCTGGA GGACCTGGTC
GCCGAGCGCA CCGGACGGGT GGTCCCGTGG GCGCGCGGCG CGAGGCCCGG TCGAACGGCG
GCCCGGCGCT GA
 
Protein sequence
MRVLFTPLPV DPSLRGLAPL AWALRAAGHE VRVAVRPSGV EAVTGAGLTA VPVGDRGDAP 
TPIPAQGGRP DGGDRLLHLD DALLDDLLDF AETWRPDLLV WDQAVLAGPV VAKLLGVPHV
RVPHAVDVVG LHRAGLAPDH PDGLTAKLGA ALARRGAVFT EDVAVGDLTA DQLPPGTRPP
VDLDHLPLRP TPHDGPAELP DHLREQPAGP RAHPAPRDDP AEPPAAPRAH PAPRAHPAPH
DGPAEPPPRP RPRVCLALAR PDREGGLTPA ERATAEVLVR GATRLDVELI ALLPTGSPPL
PPPATTHPHD HEPPRELLRA CAAIVHRGDP ATTSAATAAG LPQLVAPGGA WDEPNLAALL
ADRGVALVLD RTHLTEDAVA EHLLRLLDEP AFTDRAEALR ADVLAQPTPH DAVHRLEDLV
AERTGRVVPW ARGARPGRTA ARR