Gene Amir_2012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_2012 
Symbol 
ID8326197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp2230171 
End bp2231715 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content73% 
IMG OID644942561 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003099806 
Protein GI256376146 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGACGTG CAGGGATCGG CGTGAGAGCG GTAGCGGGTT CGGTGCTCCT GGCACTGGCC 
CCGCTGGGGG CACCGGTGGC GCAGGCCCAG GAGGTGACCA TCACCCAGGC CGCACCGGGG
ACCGCCATCG AGGGCAGCTA CCTGGTCGTG CTGGAGGACG GCGCGTCCGA GACCGCGCAG
GCCCTCACCG AGGAGCACGG CGGCCAGGTC ACCTCGACCT GGAAGTCCGC GCTGAAGGGC
TTCGCCGTCA AGGCGTCCGA GGACGAGGCC AAGCGCCTGG CCGCGGACCC GGCCGTGCGC
GCCGTCGCGC AGAACGCCGC GTTCACCGCC GCCGACGTGC GGTACAACCC GCCGTCGTGG
GGCATCGACC GGATCGACCA GCCCGGTCTG CCGCTGGACC AGGCGGTCCA CACCGACAGC
AGCGCGGCGA CCGTGCACGC CTACGTCATC GACAGCGGCA TCCGCACCAC CCACCGCACC
TTCGAGGGCC GCGCCAGCTG GGGCGTCGAC CTGGTCGACG GCAGCAGGCA GGACTGCGCC
GGTCACGGCA CGCACGTCGC GGGCACCATC GGCGGCAGGG AGTTCGGCGT CGCGAAGGAC
GTCAAGCTGG TCGCCGTCCG CGTCCTGGAC TGCGACGGCG GCGGCACCCT GGAGGGCGTC
GTCGGCGGCG TCGACTGGGT GACCGCGAAC GCCGTGAAGC CCGCCGTGGT CAACATGAGC
CTGTCGTCGC AGGCCCCCGG CGGCCCCACC CTCGTGGACG ACGCGATCCG CAACTCCATC
AACTCCGGCC TGACCTACGC GCTGTCGGCG GGCAACCGCA ACGCCGACTC CTGCACCTAC
AGCCCGGCCA GGGTCACCGA GGCGATCACG GTCGGCGCCT CCACCAGGAC CGACGCGCGC
GCCGGCTTCT CCAACCACGG CCCCTGCGTC GACCTGTTCG CGCCGGGCGA GGGCATCAAC
TCGGCCGACG CCGCCAACGA CACCGGCTCG TTCGACGCCG ACGGCACGTC CATGGCGGCC
CCGCACGTGG CGGGCGCCGC AGCCCTGGTC CTCGGCCGCA CCCCGAACGC CACCCCGGCG
CAGGTCCAGG ACGCCCTGAA GACCTCCGGC GTCGCCAACG TCATCACCAA CCCCGGCACG
GGCTCCCCGC GCACCCTCCT GCAGACCCGC CCGGCCCGGC AGGACGTCAA GCCGCTGATC
CGCTACTGGC GCTCCCCGGA CCACTACAGC TCCGCCACGG GCATCGGCCG CGCCGGCTAC
ACCGCCGAGG GCTCCCTGGG CGGCCTGCGC ACCACCCCGT TCGACGGCGG TCGCGCCGTC
TACCGCTGCA CCTACGCGGG CTGGGACAGC TTCACCTCGA TCCAGCCGGA CTGCGAGGGC
CACGCGAACG AGGGCGTGCA GGGCTACGCC CACACCACCG CCCAGCCGAA CACCCACCCG
CTGTACCGCT GCTTCGTCCC GGCCACCGGC GACCACATGG ACAGCACGGA CCCGAACTGC
GAGGGCCAGG TGACGGAGGG CCTGACCGGC CACGTCCTGA ACTGA
 
Protein sequence
MGRAGIGVRA VAGSVLLALA PLGAPVAQAQ EVTITQAAPG TAIEGSYLVV LEDGASETAQ 
ALTEEHGGQV TSTWKSALKG FAVKASEDEA KRLAADPAVR AVAQNAAFTA ADVRYNPPSW
GIDRIDQPGL PLDQAVHTDS SAATVHAYVI DSGIRTTHRT FEGRASWGVD LVDGSRQDCA
GHGTHVAGTI GGREFGVAKD VKLVAVRVLD CDGGGTLEGV VGGVDWVTAN AVKPAVVNMS
LSSQAPGGPT LVDDAIRNSI NSGLTYALSA GNRNADSCTY SPARVTEAIT VGASTRTDAR
AGFSNHGPCV DLFAPGEGIN SADAANDTGS FDADGTSMAA PHVAGAAALV LGRTPNATPA
QVQDALKTSG VANVITNPGT GSPRTLLQTR PARQDVKPLI RYWRSPDHYS SATGIGRAGY
TAEGSLGGLR TTPFDGGRAV YRCTYAGWDS FTSIQPDCEG HANEGVQGYA HTTAQPNTHP
LYRCFVPATG DHMDSTDPNC EGQVTEGLTG HVLN