Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1628 |
Symbol | |
ID | 5703472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1865799 |
End bp | 1866773 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641271136 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001536511 |
Protein GI | 159037258 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.634582 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATCCA TTGTTAACAG GAGGGCCCGT CTGGCGGGCA TCTCACTGTC CACGGTGGCG GCACTCGCGC TCACCGCGTG CGGTGGGGCC AAGGTCGAGT CGTCGGATGC CGCCGATTCG GGCGACTGCG GTACGTTCAC CATCGCGATC AACCCCTGGG TCGGGTACGA GGCGAACGCG GCCGTCATCG CCCACGTCGC CGAGACCGAA CTCGGCTGCA CGGTCGTCAA GAAGGACCTC AAGGAGGAGA TCGCCTGGCA GGGCTTCGGC ACCGGTCAGG TGGACGCGAT CGTGGAGAAC TGGGGCCACG ACGACCTCAA GAAGAAGTAC ATCGAGGACC AGAAGACCGC GGTCGACGCC GGTTCGACCG GCGTGGAGGG AGTCATCGGC TGGTACGTGC CGCCATGGAT GGCCGAGGAG TACCCCGACA TCACCGACTG GAACAACCTG AACAAGTACG CGTCCCTGTT CGAAACCACG GAGTCCGGTG GCAAGGGGCA GCTGCTCGAC GGCGACCCGT CCTTCGTCAC CAACGACGAA GCCCTGGTCA AGAACCTGGA GCTGGACTAC AAGGTGGTCT ACGCGGGCAG TGAGCCGGCG TTGATCCAGG CGTTCCGCCA GGCGGAGAAG GAGAAGAAGC CGGTGCTCGG CTACTTCTAC GACCCGCAGT GGTTCCTCTC CGAGGTCGAA CTGGTGAAGG TGAACCTGCC CGAGTACCAG GAGGGCTGCG ACGCCGACCC GGAGAAGGTC GCCTGCGACT ACCCGGTGTA CGACCTCGAC AAGATCGTCA GCAAGTCGTT CGCCGACGCC AACGGGCCGG CGTACCAGCT GGTCAACAAC TTCACCTGGA CCAACGAGGA CCAGAACCTG GTGGCCCGGT ACATCGCCCA GGACAACATG TCGCCGGAAG AGGCGGCCAA GAAGTGGGTC GAGGCCAACA AGGACAAGGT CGAGGCCTGG CTGCCGCAGA GCTGA
|
Protein sequence | MRSIVNRRAR LAGISLSTVA ALALTACGGA KVESSDAADS GDCGTFTIAI NPWVGYEANA AVIAHVAETE LGCTVVKKDL KEEIAWQGFG TGQVDAIVEN WGHDDLKKKY IEDQKTAVDA GSTGVEGVIG WYVPPWMAEE YPDITDWNNL NKYASLFETT ESGGKGQLLD GDPSFVTNDE ALVKNLELDY KVVYAGSEPA LIQAFRQAEK EKKPVLGYFY DPQWFLSEVE LVKVNLPEYQ EGCDADPEKV ACDYPVYDLD KIVSKSFADA NGPAYQLVNN FTWTNEDQNL VARYIAQDNM SPEEAAKKWV EANKDKVEAW LPQS
|
| |