Gene Sare_3284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3284 
Symbol 
ID5707790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3791944 
End bp3793422 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content70% 
IMG OID641272711 
ProductXaa-Pro aminopeptidase 
Protein accessionYP_001538078 
Protein GI159038825 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.331819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0162357 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGG AACGCGTCAA CCGCAGTCCG GAGGGCACGG AGTCACATGA TCCGGCCTTT 
CCGCAGGCGC TCCTCGCGTT CATGCGGCAG GGCTGGCGGG ACACTGCGCT GCCGGTCAGC
CCGCGTCCGG AGGTGCCGAA CTACGCCAAG CGTCGGGCGG CGCTCTCCGC GGCCTTCCCA
GGCGAGACCC TGGTGATCCC CACCGGCGGC GAGAAGACAC GCGCCAACGA CACCGCGTTC
CGGTTCCGTC CGGGCACCGA CTTCGCGTAC CTGACCGGCG ACCACGCCCC AGACAGCGTG
CTCGTGCTAC ACCCGACCGC CTCCGGGCAC GAGCCCGTCC TCTACCTGCG TCCCCACTCG
TCCCGACAGA CCGACGAGTT CTTCCGCAGT CGCCACGGCG AGCTGTGGCT CGGCCGGCGG
CACACCCTGG CGGAGAAGGC GACCGAGTTG GGCCTGGCCA CCGCCGACCT CGACGAACTG
GACGCAGCGT TGGCCGGCCT CGCCCCGGCC CGGACGCGGG TGCTGCGCGG CTTCGATGCC
CAGGTCGACG CCGCGATCCG TTCGTACGAC GAGCAGCGGG AGCGGGGGCA GCCGGCCCGC
GACCGGGAGT TGGCCATCGC GATCTCGGAG CTGAGGCTGG TCAAGGACGA GTGGGAGATC
GCCCAGCTTC AGGACGCAGT GGATGCCACC GTCCGGGGTT TCGAGGACGT GGCCCGGGCG
CTGCCGGCCG ACCGGGGGGT CTCCGAGCGG CTACTCGAGG GCATCTTCGC GCTACGGGCC
CGGCACGACG GCAACGACGT CGGCTACGGC TCGATCCTCG GTGCCGGCGA GCACGCCACG
ATTCTGCACT GGGTGCACAA CCACGGCTCC ACCCGCCCGG GTGAGCTGCT GCTGATGGAC
ATGGGAGTGG AGAACCACCA CCTCTACACG GCCGATGTGA CCCGGGTGCT CCCGGTGAGC
GGCCGATTCA CCGCGCTACA GCGCCAGGTC TACGACATCG TGTACGCCTC GCAGCAAGCC
GGCATCGAGT TCATCCGGCC CGGCGTGGCG TTCAAGGATG TGCACCTGAC CTGTATGCGA
GTGCTCGCTG AGGGCCTGGC CGACCTGGGT CTGCTACCGG TCAGCGTCGA CGAGGCGATG
GACGAGAAGT CGGCGGTGTA CCGGCGGTGG ACGCTGCACG GTTTCGGGCA CATGCTCGGC
ATCGACGTGC ACGACTGTAC GAACGCTCGC GCGGAGATGT ACCGCGACGG AACGCTCGGC
GAGGGCTACG TGCTCACCGT GGAGCCGGGC CTGTACTTCC AGCCCGAGGA CGAGTTGGTT
CCGGAGGAAC TGCGCGGCAT CGGCGTCCGG ATCGAGGACG ATGTCCTGGT CACAGCGACC
GGCACGGTGA ACCTCTCCGC CGGGCTGCCC CGTACGGCCG GTGACGTGGA GACCTGGCTG
GCCGAGCAAC GGGAGGCTGG CCCGCGCCTG CCGGGCTGA
 
Protein sequence
MTEERVNRSP EGTESHDPAF PQALLAFMRQ GWRDTALPVS PRPEVPNYAK RRAALSAAFP 
GETLVIPTGG EKTRANDTAF RFRPGTDFAY LTGDHAPDSV LVLHPTASGH EPVLYLRPHS
SRQTDEFFRS RHGELWLGRR HTLAEKATEL GLATADLDEL DAALAGLAPA RTRVLRGFDA
QVDAAIRSYD EQRERGQPAR DRELAIAISE LRLVKDEWEI AQLQDAVDAT VRGFEDVARA
LPADRGVSER LLEGIFALRA RHDGNDVGYG SILGAGEHAT ILHWVHNHGS TRPGELLLMD
MGVENHHLYT ADVTRVLPVS GRFTALQRQV YDIVYASQQA GIEFIRPGVA FKDVHLTCMR
VLAEGLADLG LLPVSVDEAM DEKSAVYRRW TLHGFGHMLG IDVHDCTNAR AEMYRDGTLG
EGYVLTVEPG LYFQPEDELV PEELRGIGVR IEDDVLVTAT GTVNLSAGLP RTAGDVETWL
AEQREAGPRL PG