Gene Sare_1367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1367 
Symbol 
ID5707286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1581088 
End bp1582647 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content70% 
IMG OID641270878 
Productaminopeptidase Y 
Protein accessionYP_001536259 
Protein GI159037006 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.227193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000286251 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGATCCC GTACCCCTCG ACCCGTCTGG CCGGCGGTGC TGGCCGTCGT GGCCGCGACG 
ACGCTGACCG CCACCGCGGC CGCCGCCGCG CCGGTGCACC CCCACCTCGC GCCGTCCTCG
ACCGCCGTCG ACGCCCCGGA CATCCCGCTG GCCAACGTGA AAACCCACCT GACCCAGTTC
CAGTCGATCG CCAACACCAA CGGCGGAAAC CGGGCGCACG GCCGACCCGG CTACCTGGCC
TCGGTGAACT ACCTGCGGTC GCAGCTCGAC GCGGTCGGCT ACACCACCAC CGTGCAGTCG
TTCACCTACG CCGGTGCGAC CGGCTACAAC CTGCTCGCCG AATGGCCGGC GGGTGACCCG
GACGCCGTGG TCCTGACCGG AGCGCACCTG GACAGCGTCA CCAGTGGACC GGGCATCAAC
GACAACGGAT CCGGCTCGGC GGCGATCCTC GAGGTGGCAC TCGCCGTGCC GCGTAGCGGC
TTCACCCCGG ACAAGCGCCT ACGGTTCGCC TGGTGGGGCG CGGAGGAGCT GGGTCTGCGC
GGTTCCCGTC ACTACGTGAA CAGCCTGTCG GGCGCGGAGC GCGACCGGAT CCAGCAGTAT
CTCAACTTCG ACATGGTGGG TTCGCCGAAC GCCGGCTACT TCGTCTATGA CGGCGACGAC
TCCGACGGGG TGGGTGCCGG CCCCGGGCCC GAGGGTTCCG CCGAGATCGA GCAGACCATC
CAGGCGTACT ACACCTCGAT CGGCGTGACG ACCCAGGGCA CCGACTTCGA CGGCCGCAGC
GACTACGGGC CGTTCATCGC GGTCGGCATC CCGGCCGGTG GCACGTTCAC CGGCGCGGAG
GGCATCAAGT CCAGCGCCCA GGCGGCGCTC TGGGGCGGGA CGGCGGGACA GGCTTTCGAC
TCCTGCTACC ACCGTTCGTG CGACACCACC GCCAACGTCA ACGACACGGC GCTGGACCGC
AACGCCGACG CGATCGCGTA CACGGTGTGG GAGCTGGCCC AGACGTCTCC GCCGCCGGGT
GACACGGTCT GGAGCGACAC CTTCGAGACC GCCACCGGCT GGGTCGTGGA CCCGGCCGGC
ACCGACACCG CCACGACCGG GGCGTGGGAA CGCGGCGACC CCGCCACCAC CAGCAGCTCC
GGGACCACCC TCCAACTCGG CACCACGGTG AGCGGTAGCT TCGACCTGGT CACCGGCGCG
GCTGCGGGCA GCAGTGCGGG TAGTCACGAC GTCGACGGTG GGGTCACCTC GATCCAGTCC
CCGGCGGTCA GTCTGCCCTC GACCGGCGCG CTGACCCTCA GCTTTTCCTG GTACCTCGCC
CACCTGAGCA ACGCCACCAG TGCCGACTAC CTGCGGGTCC GGGTGGTGGG CAGCAGCACC
GTGACGGCGT TGAGCGTCAC CGGCACGGCG AGCAACCGGG CTGGGGCCTG GCAGACAATC
AGCACGGATA TATCATCCCT AAGCGGTCAA ACCGTACATA TTTTGATCGA CGTGGCGGAT
GCCAGCAACC CGAGTCTGGT GGAGGCCGGC GTCGACGACG TGCGGATCGC CGAGGGCTGA
 
Protein sequence
MRSRTPRPVW PAVLAVVAAT TLTATAAAAA PVHPHLAPSS TAVDAPDIPL ANVKTHLTQF 
QSIANTNGGN RAHGRPGYLA SVNYLRSQLD AVGYTTTVQS FTYAGATGYN LLAEWPAGDP
DAVVLTGAHL DSVTSGPGIN DNGSGSAAIL EVALAVPRSG FTPDKRLRFA WWGAEELGLR
GSRHYVNSLS GAERDRIQQY LNFDMVGSPN AGYFVYDGDD SDGVGAGPGP EGSAEIEQTI
QAYYTSIGVT TQGTDFDGRS DYGPFIAVGI PAGGTFTGAE GIKSSAQAAL WGGTAGQAFD
SCYHRSCDTT ANVNDTALDR NADAIAYTVW ELAQTSPPPG DTVWSDTFET ATGWVVDPAG
TDTATTGAWE RGDPATTSSS GTTLQLGTTV SGSFDLVTGA AAGSSAGSHD VDGGVTSIQS
PAVSLPSTGA LTLSFSWYLA HLSNATSADY LRVRVVGSST VTALSVTGTA SNRAGAWQTI
STDISSLSGQ TVHILIDVAD ASNPSLVEAG VDDVRIAEG