Gene Amir_3864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3864 
Symbol 
ID8328056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4529109 
End bp4530809 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content75% 
IMG OID644944351 
Productprotein of unknown function DUF181 
Protein accessionYP_003101589 
Protein GI256377929 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000134117 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGACG TCGAGGTCGT GGAGCTGACC GGGTACGCGC TGGGCGAGCT GGAGCGGCTC 
GGGGTGGACG GGCCGGTGCT GCCGGTGCGG TTCGACGGCG CGCTGGTGCT CGTGGGGCCG
GTGCTGGACG GGAGCGGGGT GTGCCTGCGG TGCGCCGAGG ACGCCCGGCT CGCGGCGCTG
GGGGCGGTGG TGCCGCGTGC GGACCCGGCG ATGCGGGTGG GTGGTCTGGT CGTCCCCGCG
CTGCGGCCGG TGCTCGACGC GCTGGTGGAG CGCGTGCTGG CCGATCCCGG CGCCCATCGT
GATCGGGTGC TGGCCCTGCG GTCGGACCTG GGCGCGGTTG GCGAGCACCG GGTCCGGCCG
AGGCCCGAGG GGTGTGCGAG GTGCGGGCCG CTGCCCGAGG ACTCGGCGGA GGCCGCGAGC
GTGGTGCGCG CGCCCGTGCC GGTGGAGCCG GGGTCGTTGC GCGGTGAGAA CGCTGCGACC
GCAGGGGATT CGGTGCGCCG CGAGCTGTTC GACCTGCGGC ACGGGCCGGT GGGTGGGCTG
CACCGGATCG GGGACCTGGT CGTGGCGGCG GTGAGCGCCG AGCTGGTGGG CGGGCAGGCC
GGGTTCGGGC GCACGGGCGA TTACGAGCAC GCCGAGCGGG TGGCGCTGTT CGAGGCGGTG
GAGCGGCACG CCGGTCTGCG GCCGAGGCGG GTCACCACGG TGGTGGAGGC CTCGTTCGCC
GAGCTGGGGC CCGACCGGGC GCTCGACCCG GTGCGGTTGG GGTTGCCCGA CCTGGAGTCG
CCGCACGTGA CGCCGTACCG GCCGGACGTG CGGATCCGCT GGGTGCACGG GTGGTCCTAC
ACGCGGGGGC GGGCCGTGGC GGTGCCCGAG CACGTCGCCT ACTGGGGGCG GGCGATCGGG
CCGAGGTTCG TGGACGAGAC GTCCAACGGG TGCGGCACGG GCAACAGCCT CACGGAGGCG
GTGCTGCACG GGCTGTTCGA GGTCGCCGAG CGGGACGCGT TCCTGACCGC CTGGTACGGG
CGGGTGCCGT TGCCGTCGCT GCGCTCGGAC GACGGGCTGA CCGCGCACGT GGCGGACCGG
TTGGAGCAGG TGGGGTATCG GCTGGAGCTG TACGACGCGA CGAACGACCT CGGGGTGCCG
TCGGTGCTGT CGTTGGCCCG GCGGGTGGAG GGGCGGGGTG GGTTCCCGTG CGCGTTCTAC
GCGGCGGGGG CGGGGTTGGA CGTGGAGGCG GCGGTGCGGG CCGCGGCGGC CGAGGTGGTG
ATGGACGTGG AGGCGGGGGC CAAGCGGTAC CGGAGCGAGC CTGGGGACTA CGAGCTGGAG
CGGTTGCGGC GGATGCTCCG CGAGCCCCGG CTGGTGCGGA CGATGGACGA CCACGTGAAC
GTCAACGCGT TGCCGGAAGC GTTGGGGCGG CACGACTTCC TGGTTCCGGG GCCGGGGCGG
GAGCTGGTGG CGCCGGACGT GCCGAGCGGT GATCTCGACG CGCTGCTGGA GCACTACGTG
CGGCGGTGGG AGGCGTTGGG GCTGGAGGTG ATCGCGGTGG ACCAGAGCGA TCCGGTGGTG
CGGGAGCGGT TGGGGTTGTG CTCGGCGAAG GTGATCGTGC CGGGGGCGGT GCCGATGACG
TTCGGGGAGG TGAACCGGCG CACGGGTGGC ATACCCCGGC TGCGCCTGTC CGGTCCCCCG
CTGCCGCACC CGTTCCCGTG A
 
Protein sequence
MSDVEVVELT GYALGELERL GVDGPVLPVR FDGALVLVGP VLDGSGVCLR CAEDARLAAL 
GAVVPRADPA MRVGGLVVPA LRPVLDALVE RVLADPGAHR DRVLALRSDL GAVGEHRVRP
RPEGCARCGP LPEDSAEAAS VVRAPVPVEP GSLRGENAAT AGDSVRRELF DLRHGPVGGL
HRIGDLVVAA VSAELVGGQA GFGRTGDYEH AERVALFEAV ERHAGLRPRR VTTVVEASFA
ELGPDRALDP VRLGLPDLES PHVTPYRPDV RIRWVHGWSY TRGRAVAVPE HVAYWGRAIG
PRFVDETSNG CGTGNSLTEA VLHGLFEVAE RDAFLTAWYG RVPLPSLRSD DGLTAHVADR
LEQVGYRLEL YDATNDLGVP SVLSLARRVE GRGGFPCAFY AAGAGLDVEA AVRAAAAEVV
MDVEAGAKRY RSEPGDYELE RLRRMLREPR LVRTMDDHVN VNALPEALGR HDFLVPGPGR
ELVAPDVPSG DLDALLEHYV RRWEALGLEV IAVDQSDPVV RERLGLCSAK VIVPGAVPMT
FGEVNRRTGG IPRLRLSGPP LPHPFP