Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1622 |
Symbol | |
ID | 5703403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1856241 |
End bp | 1857437 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641271130 |
Product | glycine betaine/L-proline ABC transporter, ATPase subunit |
Protein accession | YP_001536505 |
Protein GI | 159037252 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1125] ABC-type proline/glycine betaine transport systems, ATPase components |
TIGRFAM ID | [TIGR01186] glycine betaine/L-proline transport ATP binding subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCG ACGTCATGAT TGATCTTCAA CAGGTCAGCA AGTTCTACCG CGGGCAGAAG GCCCCCGTGG TGGAGAACAT GTCGATGACG ATCCACCGGG GTGAGATCGT CGTCCTGGTC GGCCCCTCTG GCTGCGGCAA GACCACGACC ATGAAAATGA TCAACCGTTT GATCGAGCCC AGCAGCGGCC GGATCCTCAT CGACGGAACC GACGTGACCG CACTGGACGG CAACGACCTA CGCCGACAGA TCGGCTACGT CATTCAGCAG GTTGGGCTCT TCCCACACAT GAGCGTCGCC ACCAATGTCG GATTGGTGCC GAAGATGCTC GGCTGGGACC GCAAGCGTAT CGAGGCCCGG GTCGACGAAC TCCTGCACCT CGTCGGCCTC GAACCGGCCA CCTACCGCAA CCGGTTGCCC CGCCAGCTCT CCGGCGGGCA GCAGCAGCGC GTCGGGGTGG CCCGGGCCCT CGCCGCGGAT CCGCCGGTGA TGCTGATGGA CGAGCCGTTC GGCGCCACCG ACCCAATGAC CCGGGACAGG CTGCAGAACG AGTTCCTCCG CCTCCAAGAT CAGTTGCGCA AGACGATCGT CTTCGTGACC CACGACTTCG ACGAGGCGAT CAAGATGGGC ACCCGGATCG CCGTCCTCGG GGAGAGGTCC AGGATTCGGC AGTTCGACAC CCCCGAAGTC CTGCTGGCGC ACCCCGCCGA CAGCACCGTG GCCCAGTTCA TCGGCGGCGG CGCCCAGCTG AAACAGCTCG ACCTTCGCCG GGTCGACGCG ATCCAGTGGG ACGACGTCCC CCTGATCCGG GTGGACAAGG CCGCCACCGG TACGCGACGC CACCCCGACG GGGCCGACGG CTCCACGGCG CTCACCGTGG ACGACGACAA CCGCCCGCTG GGCTGGATCA GCGCCCACGA TCGGGCGATC CGGCAGGGCG CTCCCGAGGG TGCCAGCCAG GCGGTGACGA CGGTGGAGCC ACAGGCGACG TTGCGTGATG CCCTCGACGC GATGCTCGCC TCACGCCACG GCACCGCCGT GGTGGTCGAC GAGCAGGGCC GCTACGCCGG CGCGGTGACA CTCGACGGCC TGATGCGGGT GATTCACCCC ACGCGAGAGC AGGACCGGCT GGACTCCGCT GTTCCCGGTC AGACAACGGG TGAGGTCCGC TCCGCGCTGA CGCCGGAGAA ACGATGA
|
Protein sequence | MSRDVMIDLQ QVSKFYRGQK APVVENMSMT IHRGEIVVLV GPSGCGKTTT MKMINRLIEP SSGRILIDGT DVTALDGNDL RRQIGYVIQQ VGLFPHMSVA TNVGLVPKML GWDRKRIEAR VDELLHLVGL EPATYRNRLP RQLSGGQQQR VGVARALAAD PPVMLMDEPF GATDPMTRDR LQNEFLRLQD QLRKTIVFVT HDFDEAIKMG TRIAVLGERS RIRQFDTPEV LLAHPADSTV AQFIGGGAQL KQLDLRRVDA IQWDDVPLIR VDKAATGTRR HPDGADGSTA LTVDDDNRPL GWISAHDRAI RQGAPEGASQ AVTTVEPQAT LRDALDAMLA SRHGTAVVVD EQGRYAGAVT LDGLMRVIHP TREQDRLDSA VPGQTTGEVR SALTPEKR
|
| |