Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1401 |
Symbol | |
ID | 5704087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1618533 |
End bp | 1619741 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641270911 |
Product | hypothetical protein |
Protein accession | YP_001536292 |
Protein GI | 159037039 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000317251 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTGACT GGACTGCCTT CGGACGGGTG GACGCGGACG GCACCGTGTA CGTCAAAACC GCTGAGGGCG AGCGGGTTGT CGGTTCCTGG CAGGCGGGTG CGCCGGAGGA GGGGTTGGCC CACTTCGCCC GCCGTTTCGC GGACCTGGTC ACCGAGGTTG AGTTGACCGA GGCCCGACTC AACTCGGGCG CGGCGGACGC GAACCACTCG CTCAGCACGA TCCGCCGGCT GCGCGCCTCA CTGGCCGAGG CGCACGTCGT CGGCGACATC GACGGGCTGG CCACCCGGCT GGACCGGCTC GCCGGCCTGG CCGAGGAGAA GGCTGGGGAG GCTCGCGCCG CCCGCGACGC CGCCCGCACC GAGGCCCTCG CCCGGAAGAC CATGCTGGTC GAGGAGGCCG AGAAGCTGGC TGCCGAGTCG ACCGGCTGGA AGACGGCCGG CGACCGGCTC AAGGAGATCC TCGGGGAGTG GAAGACCATC CGCGGGGTCG ACCGGAAGAC CGACGGTGAG CTGTGGAAGC GGTTCGCGGC GGCCCGGGAC GGCTTCACCC GCCGGCGCGG CGCCCACTTT GCCTCTCTGG ACGCGCAGCG TAAGCAGGCG CAGTCGGTCA AGGAGGAGCT GGTCGTCGAG GCCGAGAAGC TCAAGGATTC GACCGAGTGG GCGAACACCG CCAGCCAGCT CAAGGAGCTG ATGAACCAGT GGCGTGCCGC GCCGCGGGCG TCGAAGGAGG CCGAGCAGCG GCTCTGGGAG AGGTTCCGGG CCGCGCAGGA CGCCTTCTTC ACCCGGCGCA GCGAGGTCTT CTCGGCCCGG GACAACGAGC AGCGCGCCAA CCTGGAGCGC AAGCAGGCGT TGTTGGCGGA GGCCGAGGCG CTGGACATCG ACGGCGACCC GAAGGGGGCG CAGGCGAGGC TCCGGGGGAT TCAGGCGCAG TGGCACGAGG CCGGGCGCGT GCCCCGGGAG GCGGCGGGGG GCTTGGAGCG TCGGCTACGC GCGGTCGACG ACAAGGTTCG CGAGGTGATG GATTCGGCGT GGCGGCGCAC CTCCCCGCAG GACAATCCGC TGCTCGCGCA GATGCGGGCG CAGGTCGCCG AGGCCGAGGA GCGGCTCGCC CGAGCCAAGG CCGCCGGGGA TGCCCGGCGG GTCCGCGACG CGGAGCAGGC TCTGAGTTCC AAGCGTCAGT TCCTCAACCT GGCCGAGCAG TCCAACTGA
|
Protein sequence | MSDWTAFGRV DADGTVYVKT AEGERVVGSW QAGAPEEGLA HFARRFADLV TEVELTEARL NSGAADANHS LSTIRRLRAS LAEAHVVGDI DGLATRLDRL AGLAEEKAGE ARAARDAART EALARKTMLV EEAEKLAAES TGWKTAGDRL KEILGEWKTI RGVDRKTDGE LWKRFAAARD GFTRRRGAHF ASLDAQRKQA QSVKEELVVE AEKLKDSTEW ANTASQLKEL MNQWRAAPRA SKEAEQRLWE RFRAAQDAFF TRRSEVFSAR DNEQRANLER KQALLAEAEA LDIDGDPKGA QARLRGIQAQ WHEAGRVPRE AAGGLERRLR AVDDKVREVM DSAWRRTSPQ DNPLLAQMRA QVAEAEERLA RAKAAGDARR VRDAEQALSS KRQFLNLAEQ SN
|
| |