Gene Amir_4090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4090 
Symbol 
ID8328283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4805992 
End bp4807317 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content77% 
IMG OID644944555 
Productglycosyl transferase family 2 
Protein accessionYP_003101792 
Protein GI256378132 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.157958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGGCC ACGTGCCCGC GCAGCGGTCC CCCCAGGACG AGCCGCCCGA ACCACCCCCC 
GACGGCCCGT CCCGCCCGAT CCCGGTCGGC GCCCTGCCCG GTGCGGCGCT GATCGGCGGC
GCCGCGCTCG CGGACCCGGT GCTGCTCGGC GCCGCGCCCG CGGACCCGGT GCTGCTCGGC
GCCGCGCCCG CGATCCCGGC GCTGCTGGAC GCGGTGCTGC CCGACGACCT GCTGCCCGAC
GACCTGCTGC CGGACGACCT GCTGCCCGCG CCCCTCCCAC TGGACGTGGT GCTGACCGAC
GTGGTGCTCT CGGAGGACCT GCTGCCCGAC GACCCCCTGC CCGACACCCT CCTGCCCGAC
ACCCTCCTGC TGCCCGCCGA ACCGCTCGCC GCCACGCCCC ACCCCGGCCG CCCGAGACCC
GCCCCGCACC CGCTGTCGAC GGTCACCGTC GACCTGGTCA TCCCGGTCTT CAACGAGGAG
CGCGCCCTCC CCGGCTGCGT CGCCACCCTG CACGACTACT GCACCCGGCG GCTGCCGTTC
GACTGGACCA TCACCATCGT CGACAACGCC AGCACCGACA CCACCCGCCA CGTCGCCCAG
GACCTGGCCG GGCACTGGCC GAGGGTGCGC GTCGTGTCGC TCGACCGGCG CGGCAAGGGC
AACGCCGTGC GCACCGCGTG GACCGGCAGC AGCGCGGGCG TGGTCGTCTA CATGGACGTC
GACCTGTCCA CCGGGCTGGA CGCGCTGGTC CCGCTCGTGG CCCCGCTCGC CGTCGGCCAC
TGCGACCTCG CCATCGGCTC GCGGCTCGCG CCGGGCGCCC GCACCGTGCG CGGCGCCCGG
CGCGAACTGC TGTCCAGGGG CTACAACGCC CTCATCAGGC TCACCCACGG CACCCGCTTC
CGGGACACCC AGTGCGGCTT CAAGGCCGCG CGGGCCGAGG TCGTCGGACC GCTGCTGCGC
CGGGTCAGGG ACGACTCCTG GTTCTTCGAC ACCGAGCTGC TGCTGCTCGC CGAGCACAAC
GGGCTGCGCG TGCTGGAGGT CCCGGTCGAC TGGGTGGAGG ACGTCGACAG CCGGGTCGAC
GTCACCGGCA CCATCGCGGG CAACGTGCGC GGCCTGGCCA GGGTCGCCCT GGCCAAGCTC
TCCGGCGCCG CCGCCGTGAC CGACCTGCCG ACCCGACCGG CCCCCGGACC GACCCACCCC
GACGCCGTGC TGCGCGACCG GCCCCGGTCC CGCCGCCCGT GGCTGCGCTG CCCACGACCG
GGCGCCCGGC GGCGGCGCGC GCTGCCCCCG CCCGGCCACC ACCCCGCCAC CCCCGCCTCC
GGCTGA
 
Protein sequence
MDGHVPAQRS PQDEPPEPPP DGPSRPIPVG ALPGAALIGG AALADPVLLG AAPADPVLLG 
AAPAIPALLD AVLPDDLLPD DLLPDDLLPA PLPLDVVLTD VVLSEDLLPD DPLPDTLLPD
TLLLPAEPLA ATPHPGRPRP APHPLSTVTV DLVIPVFNEE RALPGCVATL HDYCTRRLPF
DWTITIVDNA STDTTRHVAQ DLAGHWPRVR VVSLDRRGKG NAVRTAWTGS SAGVVVYMDV
DLSTGLDALV PLVAPLAVGH CDLAIGSRLA PGARTVRGAR RELLSRGYNA LIRLTHGTRF
RDTQCGFKAA RAEVVGPLLR RVRDDSWFFD TELLLLAEHN GLRVLEVPVD WVEDVDSRVD
VTGTIAGNVR GLARVALAKL SGAAAVTDLP TRPAPGPTHP DAVLRDRPRS RRPWLRCPRP
GARRRRALPP PGHHPATPAS G