Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3644 |
Symbol | |
ID | 5703338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4204729 |
End bp | 4205529 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641273069 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001538433 |
Protein GI | 159039180 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.471814 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00574343 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACACAG GCATTCGCCG GCCAGCGCTC ACGGTAGGGA ACATCTCGCT GTCGTTTCAC CGGGCCGTGG CCGCGGTCAC CCGCCGCGTC CTGGAGGACT CCGGGCACGA AGTGCGTACC GTCGAGGCAC CGCACCAGCA GCTCTTCGAG GTGCAGGCCG CGGGAGAGCT CGATGTGCTC GTCTCGGCCT GGCTGCCCTC CAGCCACGGG AAATACCTGT CCCCGTACCG CGACCACGTG CAGGTCCTGC CCGCGCACTA CGAGCCGTAC TGCGTCTGGG CGGTGCCCCC GTATGTCCCG GCCGACGCGG TCGGCGAGGT CGCCGACCTG GTCCGCACCG ACGTGGCCGG CCGGATGACC GGAACCATCG ACGGCATCAA CCCCGGAGCC GGTATCAGCC GCTTCTCCGC CCAAATGGTC CGTGAATACG GTCTTGACCG GCACGGTTAC GCGTTCCGGC CGGGGACCGA GCAGTCCTTC GTCAGCCGGG TGGAGCGCGG AATCGCCGAG CGCGAATGGT TCGTGATTCC GCTCTGGCGG CCACAGTATC TGAATCTGCT CCACGGCCTG CGGCCGCTGG CCGAGCCCAA GGGCCTGCTG GGCGGAGTGG ACTCGGCGAG CCCGGTCGTC ACCAACCGCG CCATGGACGT CATCGCCCCC GAGGCGCTGG AGCGTCTACA CAAACTCCAT CTCGGCAACG AGGGGGTGGA GGCGATCGAC AAGCTCATCA ACGTCGACGG ACTGGCACCG CTGGACGCCG CCGACCGCTA CCTGGGCCGC GCGGGCGCCG CCACCGGATG A
|
Protein sequence | MDTGIRRPAL TVGNISLSFH RAVAAVTRRV LEDSGHEVRT VEAPHQQLFE VQAAGELDVL VSAWLPSSHG KYLSPYRDHV QVLPAHYEPY CVWAVPPYVP ADAVGEVADL VRTDVAGRMT GTIDGINPGA GISRFSAQMV REYGLDRHGY AFRPGTEQSF VSRVERGIAE REWFVIPLWR PQYLNLLHGL RPLAEPKGLL GGVDSASPVV TNRAMDVIAP EALERLHKLH LGNEGVEAID KLINVDGLAP LDAADRYLGR AGAATG
|
| |