Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sama_0300 |
Symbol | |
ID | 4602556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella amazonensis SB2B |
Kingdom | Bacteria |
Replicon accession | NC_008700 |
Strand | + |
Start bp | 362780 |
End bp | 364558 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639779629 |
Product | glycoside hydrolase family protein |
Protein accession | YP_926181 |
Protein GI | 119773441 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4724] Endo-beta-N-acetylglucosaminidase D |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAT CCGCTCTCTC TCTCGCTTTG CTGCTTGCTG GCCTTTGCTT GCCCGCTGTG ACCTTTGCTG ATCCCCAGCC GGGAACAGAA AAGGTGGCCG GAGGCATTCT CGACAGTCAG CCTCCCTTTG CCCTGACCAT AGCCCAGGCC AAGGCCTGGT CTCCAAAAGG CGCCACGGCA GACAGCCGCA ATGTGTCTCA GGTGCCCTTG GCATCGCGCT TCAGCGCACC CCTTGCAGGG CAGAAGGTGT CTGAGGCCAA GGTGCTCTAC GCCCCCGATG GCATGAATAA CTTTGCCAAT TACCTGGATA CTCAGGGCCA GTTCAACCTG TACAACTTTA CCCACTGGTC GCAAATCGAT GTGCTCAATT GGTTTGCTGG CACTGCCGAC CTTGGGGTGC AAATTCCCGC CCGCCCCTGG GTGGAAACCG CCCATAAAAA CGGGGTGAAG GTGATAGGCT CTGTCTTTCT CGGGGTGGCC CAATGGGGCG GCAGCGCCGA CAAGGTGGAG GCCTTGCTTG AGCAGGACGC CACAGGAAAC TTTATTGTTG CCGACAAACT TATCCAGATG GCCCGGTTCT ATGGCTTTGA TGGCTGGTTG ATTAATCAGG AAACCGACCT GACCGCCGTG AAGGATGCCG ACAACAACCT GCTGGCGGAT AAAAGCGACC CGGTACGTGG CAGGGAATTG GCTGCGCGTA TGCTCGCCTT CGTGCAATAC CTTACCGCCA ACGCCCCTCA GGGCATGGAA ATCCATTGGT ACGATGCCAT GATTGCCAGC GGCGAGGTGC GTTGGCAAAA CCAGCTTAAT GAACATAACC GGGTTTACCT GCAAGACAGT GCGCAGGACG GCTCGCAGGG CGGTTTACAA AATAAGGTGC GTTCATCCGA TGCCATCTTT CTGAATTACT GGTGGAACAA AGACATGGTC CGTGCGTCGG TGCAAGAGGC CAAAGCCTTG GGCCGTTCTC CCTATGATGT GTATGTGGGA GCCGACCTCT GGCCGAGCCG CAACGCTCAG CGCGCCTTCA GCCGCCATCA GTGGCTCGAT TGGTTATTCG ATGACGGTGA GGCGCTGACC TCCATTGCCC TGTTTGCCCC CAATGTGAAC TTCAATTTTG ACGGCGAGCC CCATACGCCG CCGTTCAGCA ACTTCCGAAA TGACCCAGGC GATGCGGCGC GCTTTTATGC CACAGAAGTG CGGCTGTTCG CCGGGGACGA TATGAATCTC GCCACTGCGG ATGAAGCCGG TTGGAAGGGC GTGGGTGCTT ATCTGCCTGC CAAGTCCACC CTGAACAGCC TGCCGTTTCG CACCAGCTTT AATACCGGAC AGGGTAAGCA GTGGGTGGAA AAGGGTGAGG TTAAGGGCGG CGCCTGGACC GACATGGGCC GTCAGGATTT CTTGCCCACA TGGCAATTTG CCGGTGAAGG GGCGCTGAAG CTCAGTTTTG ATTTTGATAC CGTCTACCAG GGCGGCAGCA GTCTGGCGGT AACAGCCAAA GGAGCTGCCA CAGCGCCGCT TTATGCACTT GATGTGATGC TGAGCGAATC AAGCCAGCTA ACCCTGATAA GCCAGGGGCA GGCCAAGGGC CTGAGTCTCT ATGTTGAAAC CGCCGACGGT GAAAAGCTCT CTTTAGTCCT TGGCGACCAT GCTGACTGGA CGAGTCAGTC ACTCGGGCTT TCGGGTCTTA AAGGGAAAAA GCTGGTGGTT ATCGCACTGC AAGCCGATGG CAGCGCGAGC ATCAATGCCC ATCTTGGCCA TTTGGAGCTG TTGCCATGA
|
Protein sequence | MKKSALSLAL LLAGLCLPAV TFADPQPGTE KVAGGILDSQ PPFALTIAQA KAWSPKGATA DSRNVSQVPL ASRFSAPLAG QKVSEAKVLY APDGMNNFAN YLDTQGQFNL YNFTHWSQID VLNWFAGTAD LGVQIPARPW VETAHKNGVK VIGSVFLGVA QWGGSADKVE ALLEQDATGN FIVADKLIQM ARFYGFDGWL INQETDLTAV KDADNNLLAD KSDPVRGREL AARMLAFVQY LTANAPQGME IHWYDAMIAS GEVRWQNQLN EHNRVYLQDS AQDGSQGGLQ NKVRSSDAIF LNYWWNKDMV RASVQEAKAL GRSPYDVYVG ADLWPSRNAQ RAFSRHQWLD WLFDDGEALT SIALFAPNVN FNFDGEPHTP PFSNFRNDPG DAARFYATEV RLFAGDDMNL ATADEAGWKG VGAYLPAKST LNSLPFRTSF NTGQGKQWVE KGEVKGGAWT DMGRQDFLPT WQFAGEGALK LSFDFDTVYQ GGSSLAVTAK GAATAPLYAL DVMLSESSQL TLISQGQAKG LSLYVETADG EKLSLVLGDH ADWTSQSLGL SGLKGKKLVV IALQADGSAS INAHLGHLEL LP
|
| |