Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_1396 |
Symbol | |
ID | 7103083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 1459551 |
End bp | 1461095 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643474475 |
Product | Alpha,alpha-trehalase |
Protein accession | YP_002371612 |
Protein GI | 218246241 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1626] Neutral trehalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCAATT CTCTTCCCCT GTATGATGAA TTACGGCTAA CCAGCCAAGA TCTTGCACCC ATCAGACACT ACATCAAACA CACCTGGAAA ACCCTCACTC GTTCCCCACG TCATATCGTT AAAGCTGCCA GAGATCCTAA ACTGGAGTAT CAAGGAGATC AACCCTTACC CGTTTATCTT TCAGCGAGAG AAGATTATGT ACAAGTAGAA AACAAGTTGC GTCAACTACT CAGTCCAGAA GAATTAGCGC AAATTGAACT GCAAGTTTTA CCCCCAGAAA TGAACCAAAT TGAACACCAT GGACTGCTTT ACCTCCCAGG GGAATACGTT GTGCCTGGAG GACGCTTTAA TGAACTCTAT GGATGGGATA GCTACTTTAT TCAATTGGGA TTACTCCAGG ATGGAGAAAT TGCTTTAGCC CAAAGCATGA TAGACCAATT ACTCTACGAA ATCGAACACT ATGGGACAGT TTTAAACGGC AATCGCACCT ATATGCTCAA CCGTTCTCAA CCTCCCTTTC TCACCCGAAT GATTTTAGAC CTGTATCACC GTACTCACGA TCTCAACTGG TTGCGTTCAG TGTTACCAAC GGTACAAAGC TATTATTTTT ACTGGACGGT TCCCCCGCAC CTCAATCAAG CCACAGGATT GTCTCACTAT AACGCCTTTG GGGTGGGACC AGCCCCAGAG GTGATCAGTT CCGAAATTGA TGAAAACGGC AAAAATCACT ACGAACGCAT CTTAGAATAC TACCGCACTC ATGAAATTGA AGACTATGAC GTGAGTCTGT ACTATGATCA AGAAACAGAC AGTTTAACCG ACCTTTTTTA TCAGGGCGAT CGCTCCATGC GAGAATCGGG ATTTGACCCC ACTAATCGCT TTGGACCGTT TAGCGTAGAT ATCATCCATT ATGCCCCCGT CTGTCTCAAT TCCCTACTGT ATCAGATGGA ACTGGATTTA GCCGAAATGC AACGCATCTT AGGCTATGGT CACGCGGCCT CCTATTGGCT CAACCACGCC GAAAACCGCC GTCATTTGAT GAATCAATAC CTCTGGGATG ACGAAGTAGG GTTGTATTTT GACTACAATT TTCGGACTGG TTGCTGTCGT CGCTATGAAT TTGTGACCAC CTTCTTCCCC TTGTGGGTTG GGTTAGCCTC TCCCGAACAA GCGCAACGGG TTGCCCTGAA TTTATCCACC TTTGAAACCC CTGGCGGCCT CGTCACCAGT ACCCACTTTT CCGGTAATCA ATGGGATGAG CCTTTTGGTT GGGCTCCCTT ACACCTGATT GCTGTTGATG GGTTGCGGCG TTATGGTTAT ATTGAGGAAG CCCACCGCAT TGCCTGTAAA TTTGTCAATT TAGTCCTTCA AGAGTTTAAC AAAACCGGAA CCATTGTCGA GAAATACGAT GTCAAAAAAT GCTCGGCTGA TGTCTCTGAT GAAATTTTCT TCGGTTATAG TTCCAATGAA ATTGGCTTTG GCTGGACGAA TGGGGTCGTT TTAGAGTTAT TGGCGATGTT GGAACGCGAT GGGGTCATCG TGTAG
|
Protein sequence | MLNSLPLYDE LRLTSQDLAP IRHYIKHTWK TLTRSPRHIV KAARDPKLEY QGDQPLPVYL SAREDYVQVE NKLRQLLSPE ELAQIELQVL PPEMNQIEHH GLLYLPGEYV VPGGRFNELY GWDSYFIQLG LLQDGEIALA QSMIDQLLYE IEHYGTVLNG NRTYMLNRSQ PPFLTRMILD LYHRTHDLNW LRSVLPTVQS YYFYWTVPPH LNQATGLSHY NAFGVGPAPE VISSEIDENG KNHYERILEY YRTHEIEDYD VSLYYDQETD SLTDLFYQGD RSMRESGFDP TNRFGPFSVD IIHYAPVCLN SLLYQMELDL AEMQRILGYG HAASYWLNHA ENRRHLMNQY LWDDEVGLYF DYNFRTGCCR RYEFVTTFFP LWVGLASPEQ AQRVALNLST FETPGGLVTS THFSGNQWDE PFGWAPLHLI AVDGLRRYGY IEEAHRIACK FVNLVLQEFN KTGTIVEKYD VKKCSADVSD EIFFGYSSNE IGFGWTNGVV LELLAMLERD GVIV
|
| |