Gene Sare_4225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4225 
Symbol 
ID5704396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4796302 
End bp4797717 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content72% 
IMG OID641273644 
Productpeptidase M1 membrane alanine aminopeptidase 
Protein accessionYP_001538997 
Protein GI159039744 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.150681 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCCTA TCTCCGCCCG GGTCCGACCC GGCCTCGCCC TCGTCGTGGC GGCGGTCCTG 
CTGGCCGGCT GCCCGGCTGC CGACCCGGAT GGCGACTTCC GCCCGGGTGC CGCCGACGTC
GGCGATCCGT ATGTTCCCGG CGCGGGTAAC GGCGGGTACG ACGTGGAGCA CTACCGGCTC
GGCGTCGACT ACGACCCGCC CAGCGACCGG CTCTCCGGGC GGGCGGTGGT CACCGCCGTC
GCCACCCAGC CGCTGTCCCG GTTCAACCTG GACCTGCACG GCCTGGAGGT CACGGCGGTA
GGGGTGGACG GCGACCGGGC CCGGCACCGT CGCGACGGCG ACGAACTGGT GGTGACACCG
GCACGGGGGC TGGCCCAGGG CAGCCGGTTC AGCGTCGAGA TCGAGTACGC CGGTCGCCCC
GGCACTCAGG CGAACAGCCC GCTGGGCAGC GGTGGGTTCC TGCACACCGA GGACGGTGCT
ATCGCGCTCG GGCAACCCTA CTCGGCCGCC ACCTGGTTTC CGGTGAACGA CCACCCGAGT
GACAAGGCGA CGTACGACAT CGAGGTCACC GTCCCGGACG GGCTCGCCGC GCTCAGCAAC
GGGGTGCCCG GCGAACGAAG CAGCGCCGGC GGACGAACCA CCTGGCGTTG GTCGGAGCGG
GCCCCGATGG CGAGCTACCT GACCACGTTG GTGATCGGCG CCTACCGCGT GGAGACGGGC
GTCCATGCCG GGAAGCCGAT CGTCACCGCC GTGCCGGAGA GGCTGCCGGC AACCGGTCCG
GAGGCCGTCT CGTTGGCCCG TACCGGCGAG ATCGCCGACT TCCTGGCCGA GCGCTTCGGG
CCGTACCCGT TCGACTCCTA CGGCGGGGTG GCGGTGTCCG ACCCGCGGGT GGGGTACGCG
TTGGAGACGC AGTCACGCCC GGTGTACGGC CCCGGCTTCT TCCGGAGCGG GCAGCCCAAC
TTTGGTGTGG TTGTGCACGA GCTGGCACAC CAGTGGTTCG GCGACAGCGT GGCGGTGACC
CGATGGCGGG ACATCTGGCT GAACGAGGGC TTCGCCACCT ACGCCGAATG GCTCTGGGAG
GAGCATCAGG GCGGTCGGCC GGCGCAGGGC ACCTTCGAGT TGCACTACGC GATGACGGAC
TGGTCGGCCC CGAGTCTGGA CCCAGGGCCT GAGCACATGT TCGGCAGCGC CGTCTACCAA
CGGGGGGCGC TGACCGTGCA CGCGCTGCGG CGTGCGGTCG GCGACCAGGC CTTCTTCGCC
ATCCTGCGAG CCTGGACGGC CGAGCGGCGC GGTGGCAACG GCACCACCGG CGACTTCGTC
GAACTGGCCG AGCGGGTCTC CGGGAAGCAG CTCGACGGGC TCTTCGACGC CTGGCTGTAC
GGCACGACGA AGCCCGCCGT ACCGCAGCCG CGGTGA
 
Protein sequence
MRPISARVRP GLALVVAAVL LAGCPAADPD GDFRPGAADV GDPYVPGAGN GGYDVEHYRL 
GVDYDPPSDR LSGRAVVTAV ATQPLSRFNL DLHGLEVTAV GVDGDRARHR RDGDELVVTP
ARGLAQGSRF SVEIEYAGRP GTQANSPLGS GGFLHTEDGA IALGQPYSAA TWFPVNDHPS
DKATYDIEVT VPDGLAALSN GVPGERSSAG GRTTWRWSER APMASYLTTL VIGAYRVETG
VHAGKPIVTA VPERLPATGP EAVSLARTGE IADFLAERFG PYPFDSYGGV AVSDPRVGYA
LETQSRPVYG PGFFRSGQPN FGVVVHELAH QWFGDSVAVT RWRDIWLNEG FATYAEWLWE
EHQGGRPAQG TFELHYAMTD WSAPSLDPGP EHMFGSAVYQ RGALTVHALR RAVGDQAFFA
ILRAWTAERR GGNGTTGDFV ELAERVSGKQ LDGLFDAWLY GTTKPAVPQP R