Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3207 |
Symbol | |
ID | 5705538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3697273 |
End bp | 3698676 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641272638 |
Product | extracellular solute-binding protein |
Protein accession | YP_001538005 |
Protein GI | 159038752 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCTA CCCCCGATCT CAACCGTCGG ACCCTGTTGC GTCGCGCCGC GGCCGCGGGT CTGCTGACCC TCCCGGCCGC CGGCGTGCTC AGTGCCTGCG CCGGCAGTGA GCCGGCCCAG GACGACAGCT CCGGTGCCGC GAAGAGCAAG GACAACCCGT TCGGCGTCAA GGACGACAGC TCTGTCAAGG TGGTCATCTT CAACGGCGGG CTGGGCGACC AGTGGGCCAA GGAGGACGAG GCCGTCTTCA AGGCCAAGTA CCCGAACATC ACGGTCAACA TGTCGTCGAC CCAGAAGATC AAGACCGAAG AACAGCCGAA GATGGCGACC CGGCCCAGCG ACGTCGTCAT GAACTCCGGC GCCGACATCA TGGACATCAG CACCCTGATC AACGAGAGCG CGATCGAGCC GCTGGATGAC CTGCTCGACG CCCCGGCCTG GGACAGCGAG GGCACGGTGG CGGACACCCT GCTGCCGGGG ACCGTCAGCG ACGGCACCTT CCAGGGCAAG TTCTACGTGG TGAACATCGC GTACACGGTG TGGGGTAACT GGTACAACGC CGCCCTGTTC GACAAGGAGG GCTGGCAGCC GCCGAAGACC TTCGACGAGT TCTTCGCCCT CGCGCCGAAG ATCAAGGCGA AGGGCATGGC CCCGTACGTC TACGACGCGG TGCACGGCTA CTACCCGCGC TGGGCGCTGA TGGCGACGAT CTGGAAGTCC GCCGGTAAGC AGGCCGTGAT CGACATCGAC AATCTCAAGG AGAACGCCTG GAAGGCCGAT GGGGTGCTGC CGGCCCTTCA GGCGTGGGAG AAGCTGGTCA AGGACAAGCT GCTGCTCCCC GGCAAGCTCG ACCACACCCA GTCGCAGCAG GCGTGGCTCG ATGGCAAGGC CGCGTTCATC CAGCTCGGTA CCTGGCTCAA GAACGAGATG GCGGAGACCA TCCCGCCGGG CTTCGAGATG AAGCTGTCGG ACTACTGGAG CCTGGGGGCG AGCGACAAGG CGCCGAACGA CGTCTACGCC GGTGCGGGTG AGGGCATCGT CGTGCCGTCG AAGGCGCCGA ACAAGGCCGC TGCCAAGGAG TTCCTGCGGG CAATGCTCTC CAAGGAGGGC TCGGCGAAGT TCGCCGAGCT GACCAAGTCC CTCGCCTCCA CCAAGGGCTC CGGGGACAAC GTCCAGGATT CGGCGCTGGC CAGCGCGAAC GAGCTGATGA GCAACGCCCC CCAGGATCTG GTCTCGTTCA AGTTCTGGAA CTTCTACGCC GACCTGGACA AGGCGAGCCA GAACTTCTGT GCGGAGTTGA TGGCCGGCCG GCTGACCGCT CAGGAGTTCG TCGACGGCAT GCAGGCGGCC GCCGACAAGG TCGCCAAGGA CTCGTCCGTC AAGAAGCAGA CCCGCTCCGC CTGA
|
Protein sequence | MSATPDLNRR TLLRRAAAAG LLTLPAAGVL SACAGSEPAQ DDSSGAAKSK DNPFGVKDDS SVKVVIFNGG LGDQWAKEDE AVFKAKYPNI TVNMSSTQKI KTEEQPKMAT RPSDVVMNSG ADIMDISTLI NESAIEPLDD LLDAPAWDSE GTVADTLLPG TVSDGTFQGK FYVVNIAYTV WGNWYNAALF DKEGWQPPKT FDEFFALAPK IKAKGMAPYV YDAVHGYYPR WALMATIWKS AGKQAVIDID NLKENAWKAD GVLPALQAWE KLVKDKLLLP GKLDHTQSQQ AWLDGKAAFI QLGTWLKNEM AETIPPGFEM KLSDYWSLGA SDKAPNDVYA GAGEGIVVPS KAPNKAAAKE FLRAMLSKEG SAKFAELTKS LASTKGSGDN VQDSALASAN ELMSNAPQDL VSFKFWNFYA DLDKASQNFC AELMAGRLTA QEFVDGMQAA ADKVAKDSSV KKQTRSA
|
| |