Gene Amir_5051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_5051 
Symbol 
ID8329249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp6010206 
End bp6011885 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content71% 
IMG OID644945487 
Producttransposase IS4 family protein 
Protein accessionYP_003102719 
Protein GI256379059 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGGCCG TCGTCGTATT CAAGGGTGGG TTCGTGTCGT TGCGTGGGGT GGAGTTGGCG 
GAGATCCCGG AGGAGACGGC GCGGGTGGCG CGTGCGGTGT TCCCGAGGGG CTGTCTGGCG
ATGCGGATGT GTGACGTGCT CGGGCCGGTC TTCGCCGACG CGGACTTCGC GGAGTTGTTC
GCGGTGCGGG GGCGTCCGGC GGTGTCGCCA GCGCGGTTGG CGCTGGTGTC GGTGTTGCAG
TTCGCGGAGG GCCTGACCGA CCGGCAGGCC GCGCACGCGG TCCGCTCGCG CCTGGACTGG
AAGTACGCCC TGTCGCTGGA GTCGACCGAT ACCGGTTTCG ATTTCTCGGT GCTCAGCGAG
TTCCGTGCCC GGCTCGCCGA GGCGGACGCG GGCCGCGCGG TGTTCGACGC GGTGCTGCGG
GCTGCGGGGG AGGCCGGGCT GGTCAAGCCG GGCGGACGGC GGCGAACGGA CGCCACCCGC
GTGCTGGCCC CGACCAGGGA CCTGAACAGG CTTGAGTTCG TGGTCGAGAC ACTGCGCACG
GCACTGGACC AGGTTGCCGA GGTGGCCGGG GACTGGCTGG TGACGGTGGC CGCGCCGGAG
TGGTTCGACC GCTACTCGGC CCGGCCGGAG GACAGCCGTT TCCCGTCCCG GTGGGCCGCG
CGCGTCGAGC ACGGCGACCA GTGCGGAGCC GACGGCATGA CGCTGCTCGA AGCCGCCTGC
TCGACGCAAG CGCCTCCCGG CCTGTGGAAT CTGCCCGCGG TGGAGTTACT GCGCCGGACA
TGGGTGCAGC AGTTCCAGCA CGTCGAAGGC GTCGTGCTCT GGAGGCACCC GAAGGACGCC
CCGCCCGGCC TGATCCGCTT ACGCACCCCG CACGAACCCG AGGCCAGGAC CGGGGCGAAG
CGGGATCTGG CCTGGTCGGG CTACAAGGTC CACCTCGGCG AGACCTGCGA GCCCGACGCC
CCGCACCTGA TCACCCACAT CCACACCACC CCGGCACCGG TCAACGACAA CGCCGTCCTG
GAAGACGTCC ACACCGCCCT GGCCGAGCGT GAACTCCTAC CGGACGAGCA CCTGGTGGAC
GCCGGATACA TCGACGCCGA GCAGATCCAC CACGCCCGCC GCGACCACGA CATCGACCTT
GTCGGGCCGG TCGGGCAGAA CACCAACCGG GAACAGATGA CCGACCACTT CTTCGACAAC
ACGCACTTCG CCGTCGACTG GAACCGGCGT CAGGCGGTCC GCCCCGGCGG CCACACCAGC
GTCCAGTGGC GGGATGCCCA CGGCAACCGC GGCACGCCGG TCACCCGCGT CCGCTTCGCG
CGACGACACT GCGGCCCCTG CGAACTGCGC ACCTCCTGCA CCAATGCCAG GACCGGCCGC
AACCTGACCC TGCGGCCCAG AGCCGAGCAC GACATCCTCC AGCAGGCCCG CGTCGAGCAG
GACACCGACC ACTGGCGTCG CCGCTACGGA CACCGTGCCG GCGTCGAGGG CGCCATCTCG
CAGGGCGTCC AGGCGTTCGG CCTGCGCAGA TCCCGCTACC GCGGTCTCGC CAAGACCCGC
CTGCAACACC ACCTCACCGG TGCCGCGATC AACCTCGCCC GCCTCGACGC CTGGCACACC
GGCAGACCAC TCGCCCGCAC CCGCGTCTCC CCCTTCGCAG CACTCTGCCC CGCTGGATGA
 
Protein sequence
MQAVVVFKGG FVSLRGVELA EIPEETARVA RAVFPRGCLA MRMCDVLGPV FADADFAELF 
AVRGRPAVSP ARLALVSVLQ FAEGLTDRQA AHAVRSRLDW KYALSLESTD TGFDFSVLSE
FRARLAEADA GRAVFDAVLR AAGEAGLVKP GGRRRTDATR VLAPTRDLNR LEFVVETLRT
ALDQVAEVAG DWLVTVAAPE WFDRYSARPE DSRFPSRWAA RVEHGDQCGA DGMTLLEAAC
STQAPPGLWN LPAVELLRRT WVQQFQHVEG VVLWRHPKDA PPGLIRLRTP HEPEARTGAK
RDLAWSGYKV HLGETCEPDA PHLITHIHTT PAPVNDNAVL EDVHTALAER ELLPDEHLVD
AGYIDAEQIH HARRDHDIDL VGPVGQNTNR EQMTDHFFDN THFAVDWNRR QAVRPGGHTS
VQWRDAHGNR GTPVTRVRFA RRHCGPCELR TSCTNARTGR NLTLRPRAEH DILQQARVEQ
DTDHWRRRYG HRAGVEGAIS QGVQAFGLRR SRYRGLAKTR LQHHLTGAAI NLARLDAWHT
GRPLARTRVS PFAALCPAG