Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2038 |
Symbol | |
ID | 5705692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2333213 |
End bp | 2334886 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641271528 |
Product | extracellular solute-binding protein |
Protein accession | YP_001536899 |
Protein GI | 159037646 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000868803 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACCCAGG CACGCAGCAC AGCGATCAGC GCACCCGGCA CCCGGCAGCG GAACATCCTG CGGTTGCTCG GTACCGCCGC CGTCCACCAG GCCGACCCGG CCGCCGCCTG GTCCCCGGCC GAACGCCAAC TGCTGCGCCT GACCACGCGG CAACTGGTCA GCTACCCAGG GGCGGCTGAC CCGACCGACT GGCGGGCCCT CGGTCCGGTC GGTGACCTGG CCGTCGACGT GCCGTCCACC TACAACGCCG GGCTCGGCGC CAGCCACCGC TCGTACGTGG CACACCTGCG TCCGGACGTG TGGTGGGACA GCCCGCAGCC GAGGCGGATC ACCGCACACG ACGTCGTACG CGGATTCAAG CGACTCGCCA ACCCCGTGAC CCGTCATCCC GCCCTGCCCT ACTTCCGCAG CACCATACGG GGCATGGACC GGTACTGCGA CGAGTACGCC GCCGTCGTCG CCGGCCGGCC GGTCACCGCG GCACTGCTGG CCGCCTTCGC CAACGCGCAC GACCTGCCCG GCGTGTTCGC CCTCGACGAC GAGACGGTCG TCATCGAACT CCTCCGCCCG GCGTTGGACT TTCCCAACAT GCTCGCCCTG AGCTGCGCCT CCCCGGCCCC CGCCGAGTAC GACGCGTACC TGCCGGGCAG CACCGAACTG CACGCGCACC TCGTCGCCAG CGGTCCGTAC CGGGTCGCCA CGTGGCAGCC CGGGGACACC ATCCGGTTGG AACCCAACCC CACCTGGCGC TCGGAGAGCG ACCCGGTGCG GCACCAGCGT TTCGACGCCG TGGAGTTCCG GGTGTCCGGC GACGGTCCGC GCCGGCTGGC CGACCAGATC TCCGCCGACG TGGCCGACCT GCCGTGGGGG GTTCCCGTCG GCGAGGTGAG CGGGTACCGG GCCGACCCGT TCCTGGTGTT CAACCTGCGC GACCCAGCCA ACCCGGCGAT GACCACGGCA GCCGTGCGAC AGGTGATCGA CGAGGCGATC GACCGGTCCG CGTTGGCCCG GATCGCCCGC GTCGGTGACC CGTGGTCGGC GGTTCGCGAG GCGTACACCG TGGTGCCGCC GGGCAACGAC GGACACCTGC CTTCGGACCC AGCGGCCGAC CCACCGGCGC ACGGCGCCCC CCGCGAGCGG CTCACCGCCG CCGGCCATCC GAACGGGCTC CTCCTGACCG CGGTGTGCCC CGACCGGACC GAGGAACTGG CCCTGGCCCG CGCCTGGGCC GCCGACCTGG CTACGGCCGG CATCGAGGTA CGGCTGGTGG CGTTGGACGA GGCGACGCAC CGGGCGCTGC TCACCGGCGC CGCAGGCGCA CCGGCCCAAC GCTGGGACGT CAGCACCACG TCGTGGACGG CGCCATGGGG GTACGGCAAC GCGCGGGTGT TCCTCCAGCC GCTGGTGGAT GGCGCACGGC CGAGCGGCCA CCGCGACGAG GAGATCGACC GGATGGTCGA GCAGGCGGTC GACGCCGCCG ATCCCCGGGA GGCCGTGGCG TGCTGGCAGC AGGTGCAGCG ACGGCTGCTG GCCGACGCGG CGGTCGTACC CCTGCTGTTC CGACGCCCCA CCGACGCGGC ACCGCGCGGG CCGCGAGTGC GCCGCGCCGA CGCGCTGCCC TCGCTCGGTG GCCTGGCCGA CCTCGGCGAC GTGCGCCTGG GGAACGAGCG GTGA
|
Protein sequence | MTQARSTAIS APGTRQRNIL RLLGTAAVHQ ADPAAAWSPA ERQLLRLTTR QLVSYPGAAD PTDWRALGPV GDLAVDVPST YNAGLGASHR SYVAHLRPDV WWDSPQPRRI TAHDVVRGFK RLANPVTRHP ALPYFRSTIR GMDRYCDEYA AVVAGRPVTA ALLAAFANAH DLPGVFALDD ETVVIELLRP ALDFPNMLAL SCASPAPAEY DAYLPGSTEL HAHLVASGPY RVATWQPGDT IRLEPNPTWR SESDPVRHQR FDAVEFRVSG DGPRRLADQI SADVADLPWG VPVGEVSGYR ADPFLVFNLR DPANPAMTTA AVRQVIDEAI DRSALARIAR VGDPWSAVRE AYTVVPPGND GHLPSDPAAD PPAHGAPRER LTAAGHPNGL LLTAVCPDRT EELALARAWA ADLATAGIEV RLVALDEATH RALLTGAAGA PAQRWDVSTT SWTAPWGYGN ARVFLQPLVD GARPSGHRDE EIDRMVEQAV DAADPREAVA CWQQVQRRLL ADAAVVPLLF RRPTDAAPRG PRVRRADALP SLGGLADLGD VRLGNER
|
| |