Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0255 |
Symbol | |
ID | 5705405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 281946 |
End bp | 283610 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641269783 |
Product | extracellular solute-binding protein |
Protein accession | YP_001535178 |
Protein GI | 159035925 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.192685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00254463 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTGCAT CCAGGCCGAA GGTCGCTGTC GCGGCTGTCG CGGTCGCGGC CCTCGCGGTA TCCGGCTGCG CCGAGAGCGA CCGCGACGAT TCTTCCGGTG ATAGCAACAA CGACACCCTG GTCTTCGGCG TCGCCGGAGA TCCGAAGGTG CTCGACCCGA GCTTCGCCAG CGACGGCGAA TCGCTGCGTG TGTCCCGTCA GATCTTCGAG ACTCTGGTCC GTCCCGAGGA GGGCGGCACC AAGGTGAGCC CCGGCCTGGC CGAGTCCTGG ACCCCGGACG CAGCCGGCAC GACCTGGACC TTCAAGCTTC GCTCGGGCGT GAAGTTCCAC GATGGTACCG ACTTCGACGC CGAGGCCGTC TGCGTCAACT TCAATCGCTG GTACAACGCC AAGGGCCTCA TGCAGAGCCC GGACGTGACC ACGTACTGGC AGGACGTGAT GAACGGCTTC GCGCAGAACG AAAACGACAC GCTCTCGGAG AGCCTGTTCA AGTCCTGCAC CGCCACGGAC GCCACCACGG TCGACCTGGC CTTCACCCGG GTGTCCAGCA AGATCCCGGC CGCCCTGATG CTGCCGTCGT TCTCCATCCA CAGCCCGAAG GCGCTGGAGC AGTACGACGC GAGCAACGTC GGCGGCACGG CGACGGACGT CAAGTACCCC GAGTACGCGA CCGGGCACCC GACCGGTACC GGACCGTTCA AGTTCAAGGC CTGGGACATC GCCAACAAGA CGCTCACTAT CGAGCGTAAC GACGACTACT GGGGCGAGAA GGCCAAGCTG AAGACCCTTA TCTTCAAGAC CATCTCCGAT GAGAACGCCC GCAAGCAGGC GCTGCGGTCT GGTGACATCC AGGGCTACGA CCTGGTCGGG CCGGCTGACG TCGAGCCGCT GAAGGCGGAG GGCTTCAACG TCCTGACCCG GCCGGCGTTC AACATCCTCT ACCTGGGGAT GAACCAGAAG GGGAACCCGA AGCTGGCCGA CCTCAAGGTG CGGCAGGCGA TCGCCCACGC GATCAACCGG CAGGCCTTGG TCGACTCGAA GCTCCCCCCG GGAGCGAAGG TCGCGATGAA CTTCTTCCCG GACACCGTCG AGGGTTGGAA CGGTGACGTC ACCACGTACG ACTACGACGT CGACAAGGCC AAGCGGTTGC TGGCCGAGGC CGACGCGGCC GACCTGACGC TGCGGTTCCA CTACCCGACC GAGGTCACCC GCCCGTACAT GCCGAACCCG AAGGACCTCT TCGAACTGGT GTCGGCGGAC CTGCAGGCGG TCGGCATCAC GGTCGAGCCG ATCCCGCTGA AGTGGAGCCC GGACTACCTG AACGCCACCA CGTCCGGCAG CGAACACGAC CTGCACCTGC TCGGATGGAC CGGCGACTAC GGCGACGGCT ACAACTTCAT CGGCATCATG TTCGACCGGC AGAAGGACGA GTGGGGTTTC GACAACCCTG CCCTCTTCGC TCAGTTCACG GATGCTGACA CCACCGCCGA CCGGGCGAGC CGGGTGGAGA AGTACAAGGG CCTGAACAAG ACCATCATGG ACTTCCTGCC AGGCGTGCCG ATCTCGCACT CGCCGCCGGC GATCGTCTTC GGCAAGGACG TGATCGGTGT CAAGGCCAGC CCGCTCACCG ACGAGCGGTA CGCCAACGCC GAGTTCAAGT CCTGA
|
Protein sequence | MRASRPKVAV AAVAVAALAV SGCAESDRDD SSGDSNNDTL VFGVAGDPKV LDPSFASDGE SLRVSRQIFE TLVRPEEGGT KVSPGLAESW TPDAAGTTWT FKLRSGVKFH DGTDFDAEAV CVNFNRWYNA KGLMQSPDVT TYWQDVMNGF AQNENDTLSE SLFKSCTATD ATTVDLAFTR VSSKIPAALM LPSFSIHSPK ALEQYDASNV GGTATDVKYP EYATGHPTGT GPFKFKAWDI ANKTLTIERN DDYWGEKAKL KTLIFKTISD ENARKQALRS GDIQGYDLVG PADVEPLKAE GFNVLTRPAF NILYLGMNQK GNPKLADLKV RQAIAHAINR QALVDSKLPP GAKVAMNFFP DTVEGWNGDV TTYDYDVDKA KRLLAEADAA DLTLRFHYPT EVTRPYMPNP KDLFELVSAD LQAVGITVEP IPLKWSPDYL NATTSGSEHD LHLLGWTGDY GDGYNFIGIM FDRQKDEWGF DNPALFAQFT DADTTADRAS RVEKYKGLNK TIMDFLPGVP ISHSPPAIVF GKDVIGVKAS PLTDERYANA EFKS
|
| |