Gene Sare_0543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0543 
Symbol 
ID5705643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp614364 
End bp615866 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content68% 
IMG OID641270069 
Productpeptidase M24 
Protein accessionYP_001535463 
Protein GI159036210 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00271711 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGAGCG AGAAGCAGGC GAAGGCCAGC CACCGCGGCA TTCCCAGCAG CAACGAGTTC 
CGTGAATTCA TCGCTTCGGG TTGGGCCACC ACCGACCGGG AGCCCGCGCC CCGCGCCGAG
GTGGCCGATC ACGCCGCCCG TCGCCGGGCG CTGCTGTCGC AGCGGTTTCC TGGCGAGCGG
CTTGTCATCC CGAGTGGTGG CCTCAAGGTG CGTAGCAACG ACACCGACTA CACCTTCCGC
CCGCACACCG CCTTCGCGCA CCTGTCGGGT CTCGGCGCGC ACGCCGACCC GGACAGCGTC
CTGGTGCTCG ATCCAAAGCC GGGCGGCGGT CACCACGCCA TCCTCTACTT CCGCCCCCTG
GCTCCGCGCG ACAGCCAGGA GTTCTACGCC GACTCCCGGC ATGGCGAGTT CTGGGTTGGG
CCGCGTCCGA CCATCGCCGA TCGGGAGTCG GAGTTGGGGA TCACCTCCCG TCATGTCGAC
TCGCTCGCCG ACGACCTTGC CAAGGATGAC GCGGCTGCTC GGATGCGGGT GGTTCGTGCG
GCCGACCCGC AGCTGACCGA GCTGGTCGAC GGCGTTCGCA GGGAGGCGGG TGCGACCGGG
TCGGCATCGG AGCTGCAGGA GGCGGACGCG GAGTTCGCCC GTCACCTGTC CGCGATGCGG
CTGGTCAAGG ATGAGTGGGA GGTGGCTGAG ATGCGTAGGG CCGTCGCCGC CACCCACAAG
GGCTTCGACG CGATGATCGC CTCGCTGCCT GAGGCGGTCC GCAAGGGTCG CGGTGAGCGC
TGGGTGGAGG GCGTCTTCGG CCTGTACTCC CGGCATGAGG GCAACGGTGT CGGCTATGAG
TCGATCTGTG CCTCCGGTGA CCACGCCAAC ACGATCCACT GGACGAAGAA CACCGGCGAG
GTACGCGAGG GCGATCTGAT CCTGATCGAC TGCGGAATCG AGATCGACTC CCTGTTCACC
GCCGACATCA CCCGTACCCT GCCGGTCACC GGTCGCTTCA CCGACGTGCA GCGCCGCATC
TACGACGCGG TGTACGAGGC GCAGCAGGCC GGCTTGGCCG CCGTGAAGCC GGGAAACAGG
TTCAGCGACA TTCATGCCGC GGCGAACGCC GTGATCGCCC GGATCCTGCA CGAGTGGGGA
CTGCTGCCCG AGGGCGTCAC CCTGGAGCAG ACCCTGGACC GGGAGAACGG GGGCTGGCAT
CGTCGCTGGA TGGTTCACGG CACCTCGCAC CACCTGGGCA TGGACGTGCA CGACTGTCAG
CTGATGCTGC GTGAGGACTA CCTGGACACT GAGTTGCGGC CGGGCATGGT TCTCACCGTC
GAGCCGGGGC TCTACTTCAA GTCCGACGAC CTGCTGGTGC CGGAGGAGTT CCGCGGCATC
GGCGTGCGTA TCGAGGACGA CGTGCTGGTG ACCGAGGATG GTTGCGAAAA CCTGTCGGCG
GCGATGCCGC GTACCTCCCA CGAGGTGGAG GAGTGGATCG CGCGGGTGTG GGCGAACGCC
TGA
 
Protein sequence
MTSEKQAKAS HRGIPSSNEF REFIASGWAT TDREPAPRAE VADHAARRRA LLSQRFPGER 
LVIPSGGLKV RSNDTDYTFR PHTAFAHLSG LGAHADPDSV LVLDPKPGGG HHAILYFRPL
APRDSQEFYA DSRHGEFWVG PRPTIADRES ELGITSRHVD SLADDLAKDD AAARMRVVRA
ADPQLTELVD GVRREAGATG SASELQEADA EFARHLSAMR LVKDEWEVAE MRRAVAATHK
GFDAMIASLP EAVRKGRGER WVEGVFGLYS RHEGNGVGYE SICASGDHAN TIHWTKNTGE
VREGDLILID CGIEIDSLFT ADITRTLPVT GRFTDVQRRI YDAVYEAQQA GLAAVKPGNR
FSDIHAAANA VIARILHEWG LLPEGVTLEQ TLDRENGGWH RRWMVHGTSH HLGMDVHDCQ
LMLREDYLDT ELRPGMVLTV EPGLYFKSDD LLVPEEFRGI GVRIEDDVLV TEDGCENLSA
AMPRTSHEVE EWIARVWANA