Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3224 |
Symbol | |
ID | 5705447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3714919 |
End bp | 3715890 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641272655 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001538022 |
Protein GI | 159038769 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGATAC CAGCCTGGAT CAGGGCAACG TCCCTCGTTG GCTCGGCCGT ACTGGCACTG ACGGCGTGCA CGGCCGACAT CCAGCCCCGG GTCGCGGCAC CCGGCCCGCC CGCGGCGATC GAGTGCGGCA CCCTGACCCT GGCGGTCAAC CCGTGGGTCG GGTACGAGGC GAACGTCGCC GTCATCAGCT ACCTGGCCAA GAACCAGCTC AACTGCACCG TGGTCGAGAA GGACCTCAGC GAGGAGGAGT CCTGGAAGCT GCTCGCAGCC GGCGAGATCG ACGCGATCCT GGAGAACTGG GGCCACGACG ACCTGAAAAA GCAGTACATC GACGACGAGC GGGTTGCCGT GGAACACGGT CTCACCGGTA ACAAGGGCAT CATCGGCTGG TATGTCCCGC CATGGCTGGC CGAGAGATAC CCGGGCATCA CCGACTGGCG GAAGCTGAAC GACTACACCT TTCTGTTCCG CACTCCCCGC TCCGGTGGTA GGGGGGAACT GCTCGGCGGC GACCCCACCT ACGTCACCAA CGACAAGGCG CTGATCCGCA ACCTGAAGCT GAACTACACG GTCACCTTCA CCGGAAGTGA GGACAAGCTG ATCGAGGCGT TCCGCACGGC GGAGGAGGAG CGTCGGGCCG TCATCGGATA CTTCTACGCC CCCCAGTGGT TTCTCTCCGA GGTCGATCTG GTGCACATCA GGCTCCCTGA GTACACACCC GGCTGTGACG CGGATCCGGC GAAGGTGGCC TGTGACTACC AGCCGTATGA TCTCGACAAG ATTGCCAACC GGGAGTTCGC CGAATCCGGT AGTCCGGCCG CGGATTTGAT CAAGAACTTC CAGTGGACCA ACGCCGATCA GAACACGGTG GCCCGTTACA TCCGGCAGGA CAAGATGTCC CGCGACGAGG CGGCCAAGAA GTGGCTGGAC GCGAACCCCG ACGTCTGGCG GTCCTGGCTG CCTGCCACCT GA
|
Protein sequence | MRIPAWIRAT SLVGSAVLAL TACTADIQPR VAAPGPPAAI ECGTLTLAVN PWVGYEANVA VISYLAKNQL NCTVVEKDLS EEESWKLLAA GEIDAILENW GHDDLKKQYI DDERVAVEHG LTGNKGIIGW YVPPWLAERY PGITDWRKLN DYTFLFRTPR SGGRGELLGG DPTYVTNDKA LIRNLKLNYT VTFTGSEDKL IEAFRTAEEE RRAVIGYFYA PQWFLSEVDL VHIRLPEYTP GCDADPAKVA CDYQPYDLDK IANREFAESG SPAADLIKNF QWTNADQNTV ARYIRQDKMS RDEAAKKWLD ANPDVWRSWL PAT
|
| |