Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_2983 |
Symbol | |
ID | 5059447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 3411952 |
End bp | 3413355 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640475234 |
Product | extracellular solute-binding protein |
Protein accession | YP_001159799 |
Protein GI | 145595502 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.105062 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCTA CCCCCGATCT CAACCGCCGG AGCCTGCTGC GTCGCGCCGC CGCCGCGGGT CTGCTGACCC TCCCGGCCGC CGGCCTGCTC AGCGCCTGCG CCGGCAGCGA GCCAGCCCAG GACGACAACT CCGGTGCCGC GAAGACCAAG GACAACCCGT TCGGTGTCCA GGACGGCAGC TCCGTCAAGG TGGTCATCTT CAACGGCGGG CTGGGCGACC AGTGGGCCAA GGAGGACGAG GCCGTCTTCA AGGCCAGGCA TCCGAGCGTC ACGGTCAACA TGTCCTCGAC CCAGAAGATC AAGACCGAAG AGCAGCCGAA GATGGCGACC CAGCCCAGCG ACGTCGTCAT GAACTCCGGC GCCGACAGTA TGGACATCAG CACCCTGGTC AACGAGGGCG CGATCGAGCC GCTGGAGGAC CTGCTCGCCG CCCCGGCGTG GGACAGCGAG GGCACGGTGG CGGACACCCT GCTGCCGGGG ACCGTCAACG ACGGCACCTT CCAGGGCAAG TTCTACGTGG TGAACATCGC GTACACGGTC TGGGGTAACT GGTACAACGC CGCCCTCTTC GACAAGGAGG GCTGGCAGCC GCCGAAGACC TTCGACGAGT TCTTCGCCCT CGCGCCGAAG ATCAAGGCGA AGGGCATGGC CCCGTACGTC TACGACGCGG TGCACGGCTA CTACCCGCGG TGGGCGCTGA TGGCGACGAT CTGGAAGTCC GCCGGCAAGC AGGCCGTGAT CGACATCGAC AACCTCAAGG AGAACGCCTG GAAGGCTGAC GGGGTGCTAC CGGCCCTCGA GGCGTGGGAG AAGCTGGTCA AGGACAAGCT GCTGCTCCCC GGCCAGCTGG ACCACACCCA GTCGCAGCAG GCGTGGCTCG ACGGCAAGGC CGCTTTCATC CAGGTCGGCA CCTGGCTCAA GAACGAGATG GCGGAGACCA TCCCCCCGGG CTTCGAGATG ACGCTGTCGG ACTACTGGAG CCTGGGGGCG GGCGACAAGG CACCGAACGA CGTCTACGCC GGTGCGGGCG AGAACATCGT CGTGCCCTCG AAGGCGCCGA ACAAGGCCGC TGCCAAGGAG TTCCTGCGGG CGGTGCTCTC CAAGGAGGGC TCGGCGAAGT TCGCTGAGCT GACCAAATCC CTCGCCTCCA CCAAGGGCTC CGGGGACAAC GTCGAGGACT CGGCGCTGGC CAGCGCGAAC GAGCTGATGC GCAACGCACC ACAGGATCTC GTCTCGTTCA AGTTCTGGAA CTTCTACGCC GACCTGGACA AGGAGAGCCA GAACCTCTCC GCCGAGCTGA TGGCCGGCCG GATGACCGCC CAGCAGTTCG TCGACGGTAT GCAGGGCGCC GCTGACAAGG TCGCCAAGGA CTCGTCGATC AAGAAGCAGA CCCGCTCCGC CTGA
|
Protein sequence | MSATPDLNRR SLLRRAAAAG LLTLPAAGLL SACAGSEPAQ DDNSGAAKTK DNPFGVQDGS SVKVVIFNGG LGDQWAKEDE AVFKARHPSV TVNMSSTQKI KTEEQPKMAT QPSDVVMNSG ADSMDISTLV NEGAIEPLED LLAAPAWDSE GTVADTLLPG TVNDGTFQGK FYVVNIAYTV WGNWYNAALF DKEGWQPPKT FDEFFALAPK IKAKGMAPYV YDAVHGYYPR WALMATIWKS AGKQAVIDID NLKENAWKAD GVLPALEAWE KLVKDKLLLP GQLDHTQSQQ AWLDGKAAFI QVGTWLKNEM AETIPPGFEM TLSDYWSLGA GDKAPNDVYA GAGENIVVPS KAPNKAAAKE FLRAVLSKEG SAKFAELTKS LASTKGSGDN VEDSALASAN ELMRNAPQDL VSFKFWNFYA DLDKESQNLS AELMAGRMTA QQFVDGMQGA ADKVAKDSSI KKQTRSA
|
| |