Gene Amir_4012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4012 
Symbol 
ID8328205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4693917 
End bp4695188 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content77% 
IMG OID644944484 
Productheat domain containing protein 
Protein accessionYP_003101721 
Protein GI256378061 
COG category[L] Replication, recombination and repair 
COG ID[COG4335] DNA alkylation repair enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTCCG TGCCCACAGC CGATGAGCTC CTGAGCCCCA CGACCGTCCT CGACCTGGCC 
GAGCGCCTGC GCCTGGCCGG GACGCCCTGC CCGCGCGTGG CCGGGGTCGC GGGCGCGCTG
GACGGCGTCG CGCTGGCGGG GCGGACCAGG CTGGTCGCGG ACGCGGTGCT GGCCGACCTG
CCGGAGGACT GGTCGGCGTT CGAGGCGGTG CTGCTGGCGG CGCTGACCGA TCCGGGCTTC
GGCGGCTGGG CGGTGTGGCC GCTGTCCGAG GCGCTGGCGG CGCGCGCCGC GTCGACCGGC
CGCGTCCGGG AGGGCTTGGC CGTGCTGGCG GCCCTGACCG GGCGGCTGAC CGGCGAGTTC
GCGCTGCGCA CGTTCCTGCT GGCCGACCTG GGCACGACCC TGGAGGTGGC GCTGGCGTGG
ACGGCGTCGC CGGACGAGCA CGTGCGGCGG CTGGCCAGCG AGGGCACCCG GCCGTTCCTG
CCGTGGGGCA GACGGGTGCC GGGGCTGACG GCGGAGCCGG GGCGGGCGCT GCCGGTCCTG
GAGGCGCTGC GCGCGGACGA GTCGGAGTAC GTGCGCCGCT CGGTGGCCAA CCACCTGAAC
GACGTGAGCA GGCTGGACCC GGCGCTGGTG GTCGACGTGG CCGGGCGCTG GCTGGCCGCC
CCGGCCCCGA CGACGCCCCG GCTGGTGCGG CACGCGCTGC GCACCCTGGT CAAGCGCGGT
GATCCGGGGG CGCTGGGGCT GCTGGGGTAC GGGGCGGCGG AGGTCGAGGT GGGCGGCCCG
GTGCTGACCA GGGCGGAGGT GCGGTTCGGG GGCGAGTTGG AGTTCACGGC GGAGGTGGTG
AACCGGGGCC GGGAGGCGGC GCGGCTGGCG ATCGACTACG CGGTGCACTA CGTGAAGGCG
GACGGTTCGA GGACGCCGAA GGTGTTCAAG CTGACCACGC GCGTGCTGGA GCCGGGCGAA
CGCGCGCTGC TGACCAAGCG CCACCCGTTC CGCGAGATCA CCACCCGACG GCACCACGCG
GGCACGCACG CGGTGGAGCT CCAGGTCAAC GGCGTCAGGC ACGGGCTGAC CGAGTTCACC
CTGACGGGGC TGCCCGGACC ACGCGCGGTG GTGAGGGGTG CGGCGCCGGG CGCAACGGCG
GAGGCAGCGG CGCCGAGCGC CACACCGGAC ACGACCGGAC CAGGAACGGG ACCGGGTGCG
GGTGCGGGTG CGGGAACGGG CGCGGGCGCG GGTGTGGGTG CGGGGCTGGC TGCGGAGGTG
ACGCGCACCT GA
 
Protein sequence
MGSVPTADEL LSPTTVLDLA ERLRLAGTPC PRVAGVAGAL DGVALAGRTR LVADAVLADL 
PEDWSAFEAV LLAALTDPGF GGWAVWPLSE ALAARAASTG RVREGLAVLA ALTGRLTGEF
ALRTFLLADL GTTLEVALAW TASPDEHVRR LASEGTRPFL PWGRRVPGLT AEPGRALPVL
EALRADESEY VRRSVANHLN DVSRLDPALV VDVAGRWLAA PAPTTPRLVR HALRTLVKRG
DPGALGLLGY GAAEVEVGGP VLTRAEVRFG GELEFTAEVV NRGREAARLA IDYAVHYVKA
DGSRTPKVFK LTTRVLEPGE RALLTKRHPF REITTRRHHA GTHAVELQVN GVRHGLTEFT
LTGLPGPRAV VRGAAPGATA EAAAPSATPD TTGPGTGPGA GAGAGTGAGA GVGAGLAAEV
TRT