Gene Sare_3892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3892 
Symbol 
ID5705830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4432775 
End bp4435321 
Gene Length2547 bp 
Protein Length848 aa 
Translation table11 
GC content69% 
IMG OID641273317 
Productaminopeptidase N 
Protein accessionYP_001538674 
Protein GI159039421 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02412] aminopeptidase N, Streptomyces lividans type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.213595 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCAACC TGACGCAAGT GGAGGCCACC GAACGAAGCC GCCTACTCGA CGTGACCGGG 
TACGACATCA ACCTGGACCT GTCGAGCGCC GTGCAGGCCG AGGGCCACAC CTTCCGGTCC
ACGACCGAGG TCCGGTTCCA GTGCAGTGAG CCGGGTGCGA GCACCTTCAT CGAGGTCGCC
GCACACTCGG TTCGGTCGGC CACCCTGAAC GGCACGCCAC TGGACCTGAG TGGCTGGTCG
GCGGAGAACG GCCTCACCCT GCCCGGCCTG GCCGCCGAGA ACACCCTCGT GGTGGACGCG
GACTTCGCGT ACTCCACCAG TGGGCAGGGG CTACACCGCA CCGTGGACCC GGTGGACGGC
GAGACCTACC TGTACAGCCA GTTCGAGACG GCGGACGCGC AACGGGTCTT CGCCGCCTTC
GACCAACCTG ACCTGAAAAG CCTCTACACC TGGCATGCCA CCGTGCCGGA ACACTGGAAG
GTCGTCTCCA ACATGCCGGT GGAGCGGGCG GAGCCCGCCG GCGAGGGCCT GACGAAAATC
CACTTCGCCG AGTCGGTGCG GATGAGCACC TACATCACCG CGCTCTGCGC CGGGCCGTAC
CACGAGGTTC GGGACAGCCA CGATGGCATC GACCTGGGCG TCTTCGTTCG CGCGTCGATG
GCACAGTACC TGGACGCCGA CGACCTGTTC CTGCTCACGA AGCAGGGCTT CGACTTCTTC
CACGCCCAGT TCGGGATTCG GTATCCACTA CCCAAGTACG ACCAGCTCTG GGTGCCGGAC
TTCAACGCGG GCGCGATGGA GAACTTCGGC TGTGTCACGC ACGCCGAGTC GCACTACATC
TTCCGGTCGC AGGTCACCGA CTTCGAGTAC GAGCAGCGCG CCAACACGAT CCTGCACGAG
ATGGCGCACA TGTGGTTCGG CGACCTGGTC ACCATGCGCT GGTGGAACGA CCTCTGGCTG
AACGAGTCGT TCGCCGAGTG GGCCAGCCAC TGGTGCAACA CCAACGCGAC CCGCTTCACC
GAGGCGTGGA CCACCTTCCT GTCCGTCCGA AAGAACTGGG GCTACCGGCA GGACCAACTC
TCCTCCACCC ACCCGGTCTA CTGCGAGATG CCGGACCTGG AGGCGGTCGA GGTCAACTTC
GACGGGATCA CGTACGCCAA GGGCGCAAGC GTACTCAAGC AACTCGTCGC GTACGTGGGC
GAGGAACCGT TCCTGGCTGG CCTGCGGTCG TACTTCCGCA AGCACGCCTG GGGCAACGCC
ACCTTCGACG ATCTGTTGTC CGAACTGGAG GCGGCCTCCG GGCGGGAGCT GCGCAAGTTC
GCCGCCCAGT GGCTGGAGAC CGCCCAGGTC AACACGCTGC GTCCGGACGT GACGATCGGC
GCCGACGGCA GCTACCAGCG GGTCGTGGTC CGCCAGGAGG CGCCGGCCGG TCACCCGACT
CTGCGCACCC ACCGGATCGG TGTCGGTCTG TACGATCGCA CCGACGGCCG TTTGGTCCGC
CGGGAACAGT TCGAGGTGGA CATCGTTGGC GAATCCACGG AGCTGACCGA GCTGGCCGGT
GTTCGCGCGG CGGACGTGCT GCTGCTCAAC GACGACGATC TGACCTACGC CAAGCTGCGG
CTCGACGAGC GGTCGATGGC CACCGTGGTG CAGCACATCA GCGGCTTCGA ATCCTCGCTG
GCGCGAGCCC TGTGCTGGAC GGCGGCGTGG GACATGACCC GCGACGCCGA GCTGGCCGCG
CGGGACTACG TGGCGCTGGT GCTTGCCGGC CTACCCGCGG AGGCCGACAT CAACCTGGTC
ACCGCCACCC TGCGACAGGC CAGCACCGCG CTCACCTTCT ACGCCGACCC GGAGTGGGCG
CCGATTGGCT GGGCCGACCT GGCACGGACC GCGAAGGCCG CCCTCACCGC CGCCGAACCA
GGCAGCGGAT TCCAGCTCGC CTGGGCCCGC GCGTACGCCT CGGCCTGTCG GTCGTCCGAG
GACCTGGCGA CGCTGCGCGG CTGGCTGGAC GGCAACGACG TGCCTCCCGG CCTGAGCATG
GATACCGAGC TACGGTGGAC GGTACTCACC GCCCTGGTGA CCAACGGTGC GGCCGGCCCC
GCCGACATCG AGGCAGAGCT GGCAACCGAC CGCACCGCCA GCGGCGAGCG GGAGGCCGCG
TTCGCGCACG CACGCGTGCC GACGCCGGAG AACAAGGCAG CCGTCTGGGC CCGGTTGACC
GGTCCGGATC CACTGCCGAA CTGGCGGAAC CGGGCGCTGT TGCAGGGCTT CGCCCACCCG
ACACAGGCCG AACTGGTCCG CCCCTACCGG GAGCGCTACT TCGCCACCAT CGCGCAGGTC
TGGGCCAGCC GGGACAGTGA GCCAGCACAG GAATTCGCCC TACTGGCGTA CCCGGCGTAC
CTGGTCGACG AGGACACCGT GGCGGCGACC GACGGCTGGC TGGCCGGCGA GGGCCAACCG
GCACCGCTGC GGCGGCTCGT CGCCGAGGGC CGCGACGGCG TCGTCCGGGC ACTCAAGGCC
CGCGTCCGGG ACGCCCGCAG CGGCTGA
 
Protein sequence
MRNLTQVEAT ERSRLLDVTG YDINLDLSSA VQAEGHTFRS TTEVRFQCSE PGASTFIEVA 
AHSVRSATLN GTPLDLSGWS AENGLTLPGL AAENTLVVDA DFAYSTSGQG LHRTVDPVDG
ETYLYSQFET ADAQRVFAAF DQPDLKSLYT WHATVPEHWK VVSNMPVERA EPAGEGLTKI
HFAESVRMST YITALCAGPY HEVRDSHDGI DLGVFVRASM AQYLDADDLF LLTKQGFDFF
HAQFGIRYPL PKYDQLWVPD FNAGAMENFG CVTHAESHYI FRSQVTDFEY EQRANTILHE
MAHMWFGDLV TMRWWNDLWL NESFAEWASH WCNTNATRFT EAWTTFLSVR KNWGYRQDQL
SSTHPVYCEM PDLEAVEVNF DGITYAKGAS VLKQLVAYVG EEPFLAGLRS YFRKHAWGNA
TFDDLLSELE AASGRELRKF AAQWLETAQV NTLRPDVTIG ADGSYQRVVV RQEAPAGHPT
LRTHRIGVGL YDRTDGRLVR REQFEVDIVG ESTELTELAG VRAADVLLLN DDDLTYAKLR
LDERSMATVV QHISGFESSL ARALCWTAAW DMTRDAELAA RDYVALVLAG LPAEADINLV
TATLRQASTA LTFYADPEWA PIGWADLART AKAALTAAEP GSGFQLAWAR AYASACRSSE
DLATLRGWLD GNDVPPGLSM DTELRWTVLT ALVTNGAAGP ADIEAELATD RTASGEREAA
FAHARVPTPE NKAAVWARLT GPDPLPNWRN RALLQGFAHP TQAELVRPYR ERYFATIAQV
WASRDSEPAQ EFALLAYPAY LVDEDTVAAT DGWLAGEGQP APLRRLVAEG RDGVVRALKA
RVRDARSG