Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3501 |
Symbol | |
ID | 5703310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4040415 |
End bp | 4041833 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641272928 |
Product | Alpha,alpha-trehalose-phosphate synthase (UDP-forming) |
Protein accession | YP_001538294 |
Protein GI | 159039041 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0380] Trehalose-6-phosphate synthase |
TIGRFAM ID | [TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.519906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00222704 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGACAGA GTTCCCTGGT GGTGGTCGCC AACCGCCTCC CCATCGATGA CAGTACGGCG CCGGACGGTG CCTGCGAATG GCGCCGCAGG CCCGGCGGCC TGGTGACCGC CCTACACTCG CTGCTACGGC AGGCACCCGC CACCTGGGTG GGCTGGGCGG GTGGCACCGG GCCGGCCCCG ACACTGCCCG ACGTCGACGG CGTCCGCATG CACACGGTGC CGCTCACCGT CGACGACCTC CGCGACCACT ACGAGGGCTT CGCCAACGCC ACCCTCTGGC CGCTCTACCA CGACGCCGTG GAGCAGCCGG AGCACCACCG CCGGTGGTGG GAGGCGTACC AACGGGTCAA CCAACGGTTC GCGGCGGCGA CCGCCGACGT GGCCGAGACC GGCGCGGTGG TCTGGGTGCA GGACTACCAC CTGCAGCTCG TACCCGGCCT GCTCCGGGCA CTGCGCCCGG ACCTGCGGAT CGGCTTCTTC CTCCACGTGG CGTTCCCACC ACCCGAGCTG TTCATGCAGC TTCCCCGGCG GGCCGAGTTG CTCCGCGGGA TACTCGGCGC GGACCTCGTC GGCTTCCAGC GGGCCCAGGC GGCGCACAAC TTCGCCCAAC TCGCCGTCCG GGTGCTCGGG CTGCCGGCCA CCGACCGCCA GATCGTCGTG GACGACCGAG TGGTCCGCAT CGGCTCGTTC CCCGTCTCCA TCGACAGCGC CGAAATGGCG GCCCTGGCCA ACCGAGCCGA TGTCGCCGAC CGAGCCAACC GACTCCGCCG TGACCTGGGC AGCCCGGAAC AGGTGATCCT CAGCGTCGAC CGGATGGACT ACACCAAGGG CATCGAGCAG CGGCTGAAGG CGTACAGCGA GCTGATCTCC GACGGCCACG TCAAGGTACG AGACACCGTC CTGGTCCAGG TGGCGGTGCC CAGCCGCGAG CGGGTCGGGC AATACCAGAT CCTCCGCGAA CGGGTCGAAC GTGAGGTTGG CCGCATCAAC GGCGAATTCG GTCGCGTCGG CGAACCGGCC ATCCACTACC TGACCCGACC CTTCGACCGC GCCGAACTGG CCGCGCTCTA CCGGGTCGCC GACGTGATGG CGGTGACCCC ACTGCGGGAC GGCATGAACC TGGTGGCCAA GGAATACGTA GCCGCTCGGG TCGACGACAC CGGTGCGCTG CTGCTCAGCG AGTTCGCCGG CGCCGGGGCG GAGCTGTCCC AGGCGTATCT GGTGAACCCG CATGATCTGG AAGGTCTCAA GCAGGGTCTT CTCGCGGCGC TGCGGGCCCG GCCGGACCAC GTCCGCAAAC GGATGCGGGC GATGCGGGCG CACCTGCGCA AGCACGACAT CCACGCATGG GCGCGCTCCT ACCTTGCCGC CCTCGACGAC AACGGCTCGC TGCTCAGCCG ACTCGGTACG ACCCGCTGA
|
Protein sequence | MRQSSLVVVA NRLPIDDSTA PDGACEWRRR PGGLVTALHS LLRQAPATWV GWAGGTGPAP TLPDVDGVRM HTVPLTVDDL RDHYEGFANA TLWPLYHDAV EQPEHHRRWW EAYQRVNQRF AAATADVAET GAVVWVQDYH LQLVPGLLRA LRPDLRIGFF LHVAFPPPEL FMQLPRRAEL LRGILGADLV GFQRAQAAHN FAQLAVRVLG LPATDRQIVV DDRVVRIGSF PVSIDSAEMA ALANRADVAD RANRLRRDLG SPEQVILSVD RMDYTKGIEQ RLKAYSELIS DGHVKVRDTV LVQVAVPSRE RVGQYQILRE RVEREVGRIN GEFGRVGEPA IHYLTRPFDR AELAALYRVA DVMAVTPLRD GMNLVAKEYV AARVDDTGAL LLSEFAGAGA ELSQAYLVNP HDLEGLKQGL LAALRARPDH VRKRMRAMRA HLRKHDIHAW ARSYLAALDD NGSLLSRLGT TR
|
| |