Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1367 |
Symbol | |
ID | 5707286 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1581088 |
End bp | 1582647 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641270878 |
Product | aminopeptidase Y |
Protein accession | YP_001536259 |
Protein GI | 159037006 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.227193 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000286251 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGATCCC GTACCCCTCG ACCCGTCTGG CCGGCGGTGC TGGCCGTCGT GGCCGCGACG ACGCTGACCG CCACCGCGGC CGCCGCCGCG CCGGTGCACC CCCACCTCGC GCCGTCCTCG ACCGCCGTCG ACGCCCCGGA CATCCCGCTG GCCAACGTGA AAACCCACCT GACCCAGTTC CAGTCGATCG CCAACACCAA CGGCGGAAAC CGGGCGCACG GCCGACCCGG CTACCTGGCC TCGGTGAACT ACCTGCGGTC GCAGCTCGAC GCGGTCGGCT ACACCACCAC CGTGCAGTCG TTCACCTACG CCGGTGCGAC CGGCTACAAC CTGCTCGCCG AATGGCCGGC GGGTGACCCG GACGCCGTGG TCCTGACCGG AGCGCACCTG GACAGCGTCA CCAGTGGACC GGGCATCAAC GACAACGGAT CCGGCTCGGC GGCGATCCTC GAGGTGGCAC TCGCCGTGCC GCGTAGCGGC TTCACCCCGG ACAAGCGCCT ACGGTTCGCC TGGTGGGGCG CGGAGGAGCT GGGTCTGCGC GGTTCCCGTC ACTACGTGAA CAGCCTGTCG GGCGCGGAGC GCGACCGGAT CCAGCAGTAT CTCAACTTCG ACATGGTGGG TTCGCCGAAC GCCGGCTACT TCGTCTATGA CGGCGACGAC TCCGACGGGG TGGGTGCCGG CCCCGGGCCC GAGGGTTCCG CCGAGATCGA GCAGACCATC CAGGCGTACT ACACCTCGAT CGGCGTGACG ACCCAGGGCA CCGACTTCGA CGGCCGCAGC GACTACGGGC CGTTCATCGC GGTCGGCATC CCGGCCGGTG GCACGTTCAC CGGCGCGGAG GGCATCAAGT CCAGCGCCCA GGCGGCGCTC TGGGGCGGGA CGGCGGGACA GGCTTTCGAC TCCTGCTACC ACCGTTCGTG CGACACCACC GCCAACGTCA ACGACACGGC GCTGGACCGC AACGCCGACG CGATCGCGTA CACGGTGTGG GAGCTGGCCC AGACGTCTCC GCCGCCGGGT GACACGGTCT GGAGCGACAC CTTCGAGACC GCCACCGGCT GGGTCGTGGA CCCGGCCGGC ACCGACACCG CCACGACCGG GGCGTGGGAA CGCGGCGACC CCGCCACCAC CAGCAGCTCC GGGACCACCC TCCAACTCGG CACCACGGTG AGCGGTAGCT TCGACCTGGT CACCGGCGCG GCTGCGGGCA GCAGTGCGGG TAGTCACGAC GTCGACGGTG GGGTCACCTC GATCCAGTCC CCGGCGGTCA GTCTGCCCTC GACCGGCGCG CTGACCCTCA GCTTTTCCTG GTACCTCGCC CACCTGAGCA ACGCCACCAG TGCCGACTAC CTGCGGGTCC GGGTGGTGGG CAGCAGCACC GTGACGGCGT TGAGCGTCAC CGGCACGGCG AGCAACCGGG CTGGGGCCTG GCAGACAATC AGCACGGATA TATCATCCCT AAGCGGTCAA ACCGTACATA TTTTGATCGA CGTGGCGGAT GCCAGCAACC CGAGTCTGGT GGAGGCCGGC GTCGACGACG TGCGGATCGC CGAGGGCTGA
|
Protein sequence | MRSRTPRPVW PAVLAVVAAT TLTATAAAAA PVHPHLAPSS TAVDAPDIPL ANVKTHLTQF QSIANTNGGN RAHGRPGYLA SVNYLRSQLD AVGYTTTVQS FTYAGATGYN LLAEWPAGDP DAVVLTGAHL DSVTSGPGIN DNGSGSAAIL EVALAVPRSG FTPDKRLRFA WWGAEELGLR GSRHYVNSLS GAERDRIQQY LNFDMVGSPN AGYFVYDGDD SDGVGAGPGP EGSAEIEQTI QAYYTSIGVT TQGTDFDGRS DYGPFIAVGI PAGGTFTGAE GIKSSAQAAL WGGTAGQAFD SCYHRSCDTT ANVNDTALDR NADAIAYTVW ELAQTSPPPG DTVWSDTFET ATGWVVDPAG TDTATTGAWE RGDPATTSSS GTTLQLGTTV SGSFDLVTGA AAGSSAGSHD VDGGVTSIQS PAVSLPSTGA LTLSFSWYLA HLSNATSADY LRVRVVGSST VTALSVTGTA SNRAGAWQTI STDISSLSGQ TVHILIDVAD ASNPSLVEAG VDDVRIAEG
|
| |