Gene Sare_3549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3549 
Symbol 
ID5703930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4093755 
End bp4095320 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content73% 
IMG OID641272976 
Productleucyl aminopeptidase 
Protein accessionYP_001538342 
Protein GI159039089 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.984624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0353646 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACATCGT CCACCATCAC TCTCAGCCTC GTCGACACCG ACCCCGCCGA ACTCGCCGTC 
GACGCGATCG TCATCGGCGT GCACAGCCAG CCCGGTGAGC GGGCCGGCGA CCTCGTCGGC
ACCCTGCTGC TGGCCAGCGG CGCGGAGAGC ATCGCCGCGG CGTTCGATGG AAAATTGACC
GAAACGCTGG CGTTGCTCGG CGCAACCGGC GGACCGGGCG AGGTGATCAA GCTCGCCACG
CTCGGCACGG TAACCGCTCC GGTGGTTGCT GCGGTGGGCC TCGGACCGGA GCCGACCGGC
GCCGCCCCCG CCCCTGAGAT CCTGCGCCGT GCGGCCGGCG CGGCCGTGCG TGCGCTGGCC
GGCACGGCCC GGGTCGCGCT GACCCTGCCG CTGCCGGACG ACGCCGACGC GCCGGCGGCG
CTGCGCGCGG TCGCTGAGGG TGCGTTGCTG GGCGGGTACC GGTTCACCGG CTACAAGACC
CGTCCGCAGC CGGCCCGGCG GGAGCCGGTC GCGGAGGTGC TGGTGGCGGT CCCGGACGCG
GGTGACGCGG TCGCCACCGC TGAGGTCGCC CGGGCGCAGG CGGTGGCCAC CGCGGTCCGC
CGCTCCCGGG ACTGGGTCAA CGCCGCCCCC AACGAGCTAC GCCCGCCGGC CTTCGCCGAC
GCCGTGGCCG ACGCCGCCCG CGCAGCCGGG CTGGAGGTGG AGGTCCTCGA CGAGGTCGCC
CTGCGCGAGG GTGGCTACGG CGGCATCACC GCCGTCGGGC AGGGGTCGGA GGCACCGCCA
CGGCTGGTGC GAATCAGCTA CATCCCGGCT GGCGGGGGCA CCGGCAAGCG GGTCGCCCTG
GTCGGCAAGG GCATCACCTT CGACACCGGC GGCGTCTCGA TCAAGCCGTC TCAGGGCATG
TGGGAGATGA AGTCCGACAT GGCCGGCGCC GCCGCCGTCG CCGCCGCGAT GCTGGCGGTC
GCGGAGCTTG CGCCCGCCGT GCCGGTGACC GCGTATGTGC CGATGGCGGA GAACATGCCC
TCCGGCACCG CGTACCGGCC GGGCGACGTC ATCACGATGT TCGACGGTAA GCGTGTCGAG
GTGCTCAACA CCGACGCCGA GGGGCGGATG ATCCTCGCCG ACGCGATCGC CCGCGCCTGC
ACGGACGGCT GCGACTACCT GCTGGAGACC TCCACCCTGA CCGGCGGCCA GGTGGTCGCG
CTGGGCAAGC GGGTGGCCGG TGTGATGGGC ACGCCGGAGT TGTGTGAGCG GGTACGGACT
GCCGGCGAGG CGGTCGGCGA GCCGACCTGG CCGATGCCGC TGCCGGAGGA CGTGCGCAAG
GGCATGGACT CCGAGGTCGC CGACATCTCC CAGGTCAACG CCGGGATGGA TCGAGCAGGT
CACATGCTTC AGGGCGGCGT GTTCCTGCGC GAGTTCGTCG CTGACGAGGT GTCCTGGGCG
CACATCGACA TCGCCGGGCC CAGCTACCAC TCCGGCGAGC CGACCGGCTA CCTGACCAAG
GGCGGCACCG GCGTCCCCGT CCGCACCCTG CTGCACCTGA TCGAGGACAT CGCCACCCAG
GGCTGA
 
Protein sequence
MTSSTITLSL VDTDPAELAV DAIVIGVHSQ PGERAGDLVG TLLLASGAES IAAAFDGKLT 
ETLALLGATG GPGEVIKLAT LGTVTAPVVA AVGLGPEPTG AAPAPEILRR AAGAAVRALA
GTARVALTLP LPDDADAPAA LRAVAEGALL GGYRFTGYKT RPQPARREPV AEVLVAVPDA
GDAVATAEVA RAQAVATAVR RSRDWVNAAP NELRPPAFAD AVADAARAAG LEVEVLDEVA
LREGGYGGIT AVGQGSEAPP RLVRISYIPA GGGTGKRVAL VGKGITFDTG GVSIKPSQGM
WEMKSDMAGA AAVAAAMLAV AELAPAVPVT AYVPMAENMP SGTAYRPGDV ITMFDGKRVE
VLNTDAEGRM ILADAIARAC TDGCDYLLET STLTGGQVVA LGKRVAGVMG TPELCERVRT
AGEAVGEPTW PMPLPEDVRK GMDSEVADIS QVNAGMDRAG HMLQGGVFLR EFVADEVSWA
HIDIAGPSYH SGEPTGYLTK GGTGVPVRTL LHLIEDIATQ G