Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1793 |
Symbol | |
ID | 5708376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2065073 |
End bp | 2066308 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641271295 |
Product | hypothetical protein |
Protein accession | YP_001536670 |
Protein GI | 159037417 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000632283 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTCGCCG CGTCGACGGC CACGGTGCTC GTGGTCTCCG GATGCGGTGG CGGAGGTCCC CAATCGGGCG GGGACGAGAA GTTTACCGAC GGCGCGATCG TGCTGGGCGT GCTCAACGAC CAGTCCGGCG TGTACTCCGA GCTGTCCGGC CGGAACTCGG TCACGGCCGT GGAACTGGCC GTCGCCGATT TCACGGCGAA ATACGGCGAC CAGGCGGTCA CCACGGACAT CACCGTGCAA ACCGCCGATC ACCAGAACAA GCCGGATGTG GCCAACAGCA AGGCCCAGGA GATGTACGAC CGTCAGGGGG TCGACCTGAT CCTGGACGTG CCCACCTCGT CGGCGGCGCT GCGGGTGGCC GACGTGGCGA AGGAGAAGCA GAAGCTCTAC TTCAACATCG GTGCGGCGAC CACCGACCTC ACCGGCAAGA GCTGCAACAA ATACACCTTC CACTACGCGT ACGACACGTA CATGCTCGCC AACGGCACCG GTCGGACCAC CACCGAGCAG ATCGGCCGGA ACTGGTACAT CCTCTATCCG AACTACGCGT TCGGTCAGGA CATGGAGAAG AGCTTCTCCA CGGCCATCGC CGACGCCGGC GGACGGGTCG TCGGCAAGGA CGGGGCACCG TTCCCGAACA CCAGCGGCGA CTTCTCCACC TACCTGCTGA AGGCGCCGAC ACTGGACCCG AAGCCAGACG TGCTCGGCAC CATGCAGGCC GGCGCGGAAC TGGTCAACGT GGTGAAGCAG TACAACGAGT TCAAGCTGCG CGACAAGGGT GTCGGGCTGG CCGTCGGACT GATGTTCATC ACCGACATCC ACTCACTCAC CCCAGCCGCG CTGGCCGGCA CCACCTACAC CGACGCCTGG TACTGGAACT TCGACGAACA GAACCGTGAG TTCGCCGACC GGTTCCAGCA GGAGACGGGC ACCCGGCCGT CCTTCGCGCA CGCGGCGAAC TACTCCGCCG CCACGCAGTA CCTGGAGGCG GTGCAGGCGG CCGGCACCGA CGATGCCGAC ACCATCGTCG AGGAACTGGA GGGCAAGGAG ATCAACGACG TCTTCCTGCG CAACGGCAAG ATCCGCGCGG AGGACCACCG GGTGGTCCAC GACGCCTACC TGGCCCAGGT GAAGCCGCAG TCCGAGGTCA CCGAGCCGTG GGACTACGTG CGGATCCTCG AGACCATCCC GGCCGGGGAG GCGTTCCGGG CCCCGTCCCC GGACTGCAGC CTGTGA
|
Protein sequence | MVAASTATVL VVSGCGGGGP QSGGDEKFTD GAIVLGVLND QSGVYSELSG RNSVTAVELA VADFTAKYGD QAVTTDITVQ TADHQNKPDV ANSKAQEMYD RQGVDLILDV PTSSAALRVA DVAKEKQKLY FNIGAATTDL TGKSCNKYTF HYAYDTYMLA NGTGRTTTEQ IGRNWYILYP NYAFGQDMEK SFSTAIADAG GRVVGKDGAP FPNTSGDFST YLLKAPTLDP KPDVLGTMQA GAELVNVVKQ YNEFKLRDKG VGLAVGLMFI TDIHSLTPAA LAGTTYTDAW YWNFDEQNRE FADRFQQETG TRPSFAHAAN YSAATQYLEA VQAAGTDDAD TIVEELEGKE INDVFLRNGK IRAEDHRVVH DAYLAQVKPQ SEVTEPWDYV RILETIPAGE AFRAPSPDCS L
|
| |