Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1431 |
Symbol | |
ID | 5708054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1658031 |
End bp | 1659098 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641270940 |
Product | extracellular solute-binding protein |
Protein accession | YP_001536321 |
Protein GI | 159037068 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.452816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000555172 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGGTTCG TCGTGTCTGC ACTCTTCCGC CGGTCCGCCG CGACGTTCGT CGCGACCGGA CTAGTCGCCG TCGGCCTGGT CGCGTGCGGG TCCGGTCAGG ATGACACCGA CCCCGGTGAA CCCAACGACA AGATCATCAC TGTCTACAGC GGGCGCAACG AGAAGCTCGT CAAGCCGCTG CTGGAGAGGT TCACCGAGCA GACCGGCATC GAGATCCGAC CCCGGTACGC CACCACCGCC CAGTTGGCGG CGCAGCTCGT CGAGGAGGGC GACCGGTCCC CGGCCGACAT CTTCTTCGCC CAGGACGCCG GCGCCCTCGG CACCGTGGCC AAGCAGGGCA TGTTCGCCAC GCTGCCGGAG ACCAGCACCG GCAAGGTCAC CGAGACCTAC CGGGCGCGCA GTGGCCAGTG GGTCGGAGTC AGCGCCCGCT CCCGGGTGCT GGTCTACAAC GCCGACCAGG TCCCCGCCGA CCAACTCCCG ACGTCCGTGT TCGACCTGAC CGGCCCGGAC TGGAAGGGCA AGGTCGCGCT CGCTCCGACC AACGCCTCCT TCCAGGCGTT CGTCACCGCG ATCCGGGTAC AGCACGGCGA CCAGCGGGCC AAGGACTTCC TGTCCGGCCT CAAGGCCAAC GAGGCACAGA TCCGCGACAA CAACATCCGG ATCGTCGAGG CCGTCGACGC CGGTGAGGTC CCCATGGGAC TGGTCAACCA CTACTACCTC GGCGCGATTG CCGAAGAGCA GGGCAGCACG CCGGAGGCGT TGAAGGCCAA GCTGCACTTC TTCCCCGACG GTGACACCGG TGCCCTGGTG AACATCGCCG GTGTCGGGGT GCTCAACCGG GCCGCCGAGG ACGCCGACGC CCAGGCGTTC GTCGACTTCC TGCTCGGCGC GGAGGCCCAG CGGTACTTCG CCGAGGAAAC CTTCGAGTAC CCGGTGGTGA CGGGCGTGCC CGGGCCGACG TACGTGCCGC CACTGGCCGA CCTGAAGGTG CCCGCCATCG ACCTCAACGA CCTGGACACA CTCGAGGCCA CCGTCGCCAT GATCACTGAC TCGGGGCTGG TGCCCTGA
|
Protein sequence | MGFVVSALFR RSAATFVATG LVAVGLVACG SGQDDTDPGE PNDKIITVYS GRNEKLVKPL LERFTEQTGI EIRPRYATTA QLAAQLVEEG DRSPADIFFA QDAGALGTVA KQGMFATLPE TSTGKVTETY RARSGQWVGV SARSRVLVYN ADQVPADQLP TSVFDLTGPD WKGKVALAPT NASFQAFVTA IRVQHGDQRA KDFLSGLKAN EAQIRDNNIR IVEAVDAGEV PMGLVNHYYL GAIAEEQGST PEALKAKLHF FPDGDTGALV NIAGVGVLNR AAEDADAQAF VDFLLGAEAQ RYFAEETFEY PVVTGVPGPT YVPPLADLKV PAIDLNDLDT LEATVAMITD SGLVP
|
| |