Gene Sare_2138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2138 
Symbol 
ID5707264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2459843 
End bp2462347 
Gene Length2505 bp 
Protein Length834 aa 
Translation table11 
GC content70% 
IMG OID641271623 
Productaminopeptidase N 
Protein accessionYP_001536994 
Protein GI159037741 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02412] aminopeptidase N, Streptomyces lividans type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.010649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAGTC TGACCCGCGC CGAGGCCGTC GGCCGCAGCA GCGGCCTACA CGTCCGTTCG 
TACCACGTTG ACCTGGATCT GACCACCGGG GCCGACACCT TCCGATCTCG AACGCGGATC
ACGTTCGATG CCACCGGGCT GCCGACCTTT CTCGACCTCA AGCCGCACCG GGTGCACACG
ATCCGGCTAA ACGGCGAGGC GATCGACGCC GGGACACTGC GTGAGGGCCG GCTGCCGATC
CAGCCCCGCA CCGGCCGCAA CGTGATCGAC GTCGACGCCG ACATGAGGTA CTCCCGCGAG
TGCGAGGGTC TCCACCGCTA CGTCGACCCG GCGGACGGAA AGGTGTACGT CTACGCGTTC
GTGTACGTGG ACAACGCTCC GCGGGTGTTC GCCTGCTTCG ACCAGCCGGA TCTGAAGGCG
CCCTACACCT TCGCGCTGCG CACGCCTACC GATTGGCAGG TGATGGGCAC CAGCAGTGCC
GTGCGCACCG GCCCCGAACG GTGGTTGTTG ACCGAAAGCG CGCCGCAGGC CACCTACCTG
ACCACCGTGG TCGCCGGACC CTACCGATCG TTTCGGGTCG AGCATGGCGG CGTACGCCTG
GGCTTGCACT GCCGGGACTC ACTGGGTGAT GCGCTCACGC ACGATCTCGA CGAGCTGGTC
GACGTCACTC GGCAGTGCCT GGACGCCTAC CGTGACCTGT TCGGTGTCCC CTATCCCTAT
GCCAAGCTCG ACCAGGTCTT CGCACCCGAG TTCAGCGTGC TGTCGCTGGA CCATCCCGCC
TGCGTCCTGC TGCGGGAGCA GTATCTGTTC CGGTCGACCG CCACCGACAG CGAACGGGAG
ACCCGGGCGG TGGTCCTGGC ACACGGCATC TCCCTGATGT GGATGGCCGG ACTCGTCACC
AGCGGCTGGT GGGACGACCT GTGGCTGGGC CAGGCATTCG CCGACTACAT GGCGCATCGG
ATCACCAGCG AGGCCACCCG CTTCCCGGGG CCACCCACCA CCTTCGCGGC CCGGCGCAAG
GGACAGGCGT ACGTCGCCGA CCAGCGCCCC TCCACCCACC CGGTGGCGAT CGACGGTGCC
GATGTGCAGA GCGCCCTGCT GGACCTGGAC CGGATCTCCT ACTTCAAGGG GCACTCGGTG
ATTCGCCAAC TGGCCACCCG GCTCGGCGAC GACGCCCTGC GCGCCGGGTT GCGGAGGTAC
TTCGCCGACC ACGCCCACGG CACGGCCGGC TTCGCCGACT TCCTCTCGGC CCTGACCGCG
GCGACCGGAC AGGACCTCGA GGACTGGGCG CAGCGCTGGT TTCGGACCGC GAACGTCACC
ACGCTGGAGC CGGAGATCGA GGTCGTCGAC GGCCGGATCG TGCGGGCGGC GGTCCGGCAG
TCCGCGCCGA GGAGTCACCC GACGCTACGC CCGCACACCC TCGACATCGG GCTGTACGAC
GGTGCCGGAG GTGGTCACCG GGTGCGCGTC CAGGTCGACG GGCCGCTGAC CGCACTTCCA
CAGCTTGTCG GCGAGGCCGC GCCCCGATTC CTGCTGCTCA ACGACGGCGA CCTCAGCTAT
GCCAAGATCC GGTTCGATCC GACCTCGCTG GCGAGTTTGC CCGGGATCCT GCCCACGCTC
GAACCGATCA ACCGGGCGAT GGTCTGGTGC AACCTGCTAC TCGCCGTGCA GGACGGCGCG
TTTCCGGCCG ACGCTCATCT TGACCTGGTC ACGCGGATGG TGGCGGTCGA GACCGAGCTG
TCGATCCTCG CCGAGGTGCT GGAGCAGGCC CGCAACGACG TGGCGGACCG CTTCCTCGAC
CCGGTCCGGC GACCCGCGGC CATGCGGGCC GTCGCGGCGG CACTGCGCCA CCGGTTGGCC
ACGCTGGCAC CGGGTGACGA GCGACAGGTC ACCCTCTTCC GGGCGCTGGT GGACTTCAGC
GCCGACCCGA CGGAGCTTCA GGGCTGGCTC GATGGCACAG ACACGCCCGT CGGGCTCCCG
GTCGAGGCCG ATCTCGCGTG GCGGATCCGA TACCGGCTGG CTGTGCTCGG CCGGCTCGAC
ACCAACGATA TCGACCACGC GCTACGTACC GATCCGCGCG GGGACAGCGC TGCCGCCGCC
GCGCGCTGCC ACGCCGCTCG CCCCGACTCC GCGGCGAAGG CGTCGGCGTG GAGGACGATC
ACCCGGGACA GTTCGGTGTC CAGCTACCGA CTGTGGGCAC TTGCCGAGGG CTTCTGGCAG
CCGGAGCAGG CCGACCTGAC CGCGCCGTAC GTGCCACGGT TCTTCGTCGA GGCGCCCGGA
CTCGCCCGGT TGCGCGGCGA CCTCGTGCTG GACCTGCTGC TGCGCTTCCT CTATCCCCGT
CACGCCGCCC GCACCGAGAC ACTGACGGCA GCAGCGGACC TGTTGGCACG GGAGGAGCTT
CCGGTGCCGT TGCGCCGGCG GATGGCGGAC TTCACCGACG ATTTGCGCCG GGCGGCGCGG
GCCCGGACGG TGAGCGCCAA CTCGACGGTG GCTTGGGGGC GGTGA
 
Protein sequence
MPSLTRAEAV GRSSGLHVRS YHVDLDLTTG ADTFRSRTRI TFDATGLPTF LDLKPHRVHT 
IRLNGEAIDA GTLREGRLPI QPRTGRNVID VDADMRYSRE CEGLHRYVDP ADGKVYVYAF
VYVDNAPRVF ACFDQPDLKA PYTFALRTPT DWQVMGTSSA VRTGPERWLL TESAPQATYL
TTVVAGPYRS FRVEHGGVRL GLHCRDSLGD ALTHDLDELV DVTRQCLDAY RDLFGVPYPY
AKLDQVFAPE FSVLSLDHPA CVLLREQYLF RSTATDSERE TRAVVLAHGI SLMWMAGLVT
SGWWDDLWLG QAFADYMAHR ITSEATRFPG PPTTFAARRK GQAYVADQRP STHPVAIDGA
DVQSALLDLD RISYFKGHSV IRQLATRLGD DALRAGLRRY FADHAHGTAG FADFLSALTA
ATGQDLEDWA QRWFRTANVT TLEPEIEVVD GRIVRAAVRQ SAPRSHPTLR PHTLDIGLYD
GAGGGHRVRV QVDGPLTALP QLVGEAAPRF LLLNDGDLSY AKIRFDPTSL ASLPGILPTL
EPINRAMVWC NLLLAVQDGA FPADAHLDLV TRMVAVETEL SILAEVLEQA RNDVADRFLD
PVRRPAAMRA VAAALRHRLA TLAPGDERQV TLFRALVDFS ADPTELQGWL DGTDTPVGLP
VEADLAWRIR YRLAVLGRLD TNDIDHALRT DPRGDSAAAA ARCHAARPDS AAKASAWRTI
TRDSSVSSYR LWALAEGFWQ PEQADLTAPY VPRFFVEAPG LARLRGDLVL DLLLRFLYPR
HAARTETLTA AADLLAREEL PVPLRRRMAD FTDDLRRAAR ARTVSANSTV AWGR