Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_5038 |
Symbol | |
ID | 5707309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5705320 |
End bp | 5706516 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641274431 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_001539772 |
Protein GI | 159040519 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00742101 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCCACAGT TGAACCGCCG GCACGCACTA CAGCTCCTGG CCGCACTCGG CACCGCCGGC CTCGTGGCCG GATGTGGCGA CAACGGCGAG TCCGAATCCG GCACCCGCCC GAACCCTATC AAGATCGGCA TGCTCGTGCC GCAGAACGGC GATCTCAAGG AGGTCGGGGT CGAGGTCGTC AACGGCTTCC AGCTCTTCCT GGATCTCAAC GAGGGACGAC TCGGCGGACA TCCCACCGAG TTGATCATCG CCGACGAGGG TGCCGACGCG CAGTCCGGCC AGGCCGCGGT CGAAGGTCTA CTCGAGCAGG GAGTGCTGTC TCTCACCGGC GTGGTCGGCT CGGCCGTCAT GCTCGGCATC CGGGACATGG TGGAGCAGGC CCGGGTGCCG TTGGTCGGCT CCAACGCGTC ACCCACCAGC CTGCAGAGCG TCGTCTACAT CTGGCGCACG TCGTACGTGC TGGACGAGGC CGGTCGAGCC CTCGGCCTGT ACCTGAAGGC GACGCTCGAT CCGTCGGAGC GGCTGGCGAT CATCATGCCG CAGTCCCCGG CCAGCCAGGA CGTGCTGCGG GGTTTCCAGC AGGAATTCGG TGAATCCGAT CCACGAATCG GACGCGTCAC CTGGACGGAG GATATCTCCG GCACCCCGAG CAAGTCCGCC TACCGCCGCG ACATCAACAC CGCCCTCCAG CGTGACCCGG ACGGTGTCCT CTGCTTCTTC ACCGGCGCCG CCGCCGTGGA GTTCCTCAAA CAGCTCCGCG CCGCGGGATA CACCGGCCCC GTCTACGCCC CGGGCTTCCT CACCGAAGGG AACTTCCTGG CGAGTTTCAA AGACGAGGCG GATGTTCTCG GCATCCAGAC CGCCCTGAAC TACTCCCCCG ACCTCAACAA CGCGGCCAAC AGGCACTTCG CCTCGGCGTA CCGCAAGAAG CACGGCACCT CGCCGACCGC GTACGCGACG GCGTCGCACG ACGCGGCGCG GGTCCTCAAC CAGGCGATTC GGCGCGCTGG CGGGTCACCC ACCCCACAGG AGGTGAACCT CGCCCTCGGA AAGATCGGCC GGATCGACAG CCCACGTGGG GTGTGGCAGT TCAACCAGAC GCGCACCCCG CAGCAGCGGT GGTATCTCCG GGAGGTCCAG TTGGACGGTC AGGTGCTGTC CAACGTGCTG TTGACTGAGC TGGCCACGCT CGGTTGA
|
Protein sequence | MPQLNRRHAL QLLAALGTAG LVAGCGDNGE SESGTRPNPI KIGMLVPQNG DLKEVGVEVV NGFQLFLDLN EGRLGGHPTE LIIADEGADA QSGQAAVEGL LEQGVLSLTG VVGSAVMLGI RDMVEQARVP LVGSNASPTS LQSVVYIWRT SYVLDEAGRA LGLYLKATLD PSERLAIIMP QSPASQDVLR GFQQEFGESD PRIGRVTWTE DISGTPSKSA YRRDINTALQ RDPDGVLCFF TGAAAVEFLK QLRAAGYTGP VYAPGFLTEG NFLASFKDEA DVLGIQTALN YSPDLNNAAN RHFASAYRKK HGTSPTAYAT ASHDAARVLN QAIRRAGGSP TPQEVNLALG KIGRIDSPRG VWQFNQTRTP QQRWYLREVQ LDGQVLSNVL LTELATLG
|
| |