Gene Sama_0300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_0300 
Symbol 
ID4602556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp362780 
End bp364558 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content57% 
IMG OID639779629 
Productglycoside hydrolase family protein 
Protein accessionYP_926181 
Protein GI119773441 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4724] Endo-beta-N-acetylglucosaminidase D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT CCGCTCTCTC TCTCGCTTTG CTGCTTGCTG GCCTTTGCTT GCCCGCTGTG 
ACCTTTGCTG ATCCCCAGCC GGGAACAGAA AAGGTGGCCG GAGGCATTCT CGACAGTCAG
CCTCCCTTTG CCCTGACCAT AGCCCAGGCC AAGGCCTGGT CTCCAAAAGG CGCCACGGCA
GACAGCCGCA ATGTGTCTCA GGTGCCCTTG GCATCGCGCT TCAGCGCACC CCTTGCAGGG
CAGAAGGTGT CTGAGGCCAA GGTGCTCTAC GCCCCCGATG GCATGAATAA CTTTGCCAAT
TACCTGGATA CTCAGGGCCA GTTCAACCTG TACAACTTTA CCCACTGGTC GCAAATCGAT
GTGCTCAATT GGTTTGCTGG CACTGCCGAC CTTGGGGTGC AAATTCCCGC CCGCCCCTGG
GTGGAAACCG CCCATAAAAA CGGGGTGAAG GTGATAGGCT CTGTCTTTCT CGGGGTGGCC
CAATGGGGCG GCAGCGCCGA CAAGGTGGAG GCCTTGCTTG AGCAGGACGC CACAGGAAAC
TTTATTGTTG CCGACAAACT TATCCAGATG GCCCGGTTCT ATGGCTTTGA TGGCTGGTTG
ATTAATCAGG AAACCGACCT GACCGCCGTG AAGGATGCCG ACAACAACCT GCTGGCGGAT
AAAAGCGACC CGGTACGTGG CAGGGAATTG GCTGCGCGTA TGCTCGCCTT CGTGCAATAC
CTTACCGCCA ACGCCCCTCA GGGCATGGAA ATCCATTGGT ACGATGCCAT GATTGCCAGC
GGCGAGGTGC GTTGGCAAAA CCAGCTTAAT GAACATAACC GGGTTTACCT GCAAGACAGT
GCGCAGGACG GCTCGCAGGG CGGTTTACAA AATAAGGTGC GTTCATCCGA TGCCATCTTT
CTGAATTACT GGTGGAACAA AGACATGGTC CGTGCGTCGG TGCAAGAGGC CAAAGCCTTG
GGCCGTTCTC CCTATGATGT GTATGTGGGA GCCGACCTCT GGCCGAGCCG CAACGCTCAG
CGCGCCTTCA GCCGCCATCA GTGGCTCGAT TGGTTATTCG ATGACGGTGA GGCGCTGACC
TCCATTGCCC TGTTTGCCCC CAATGTGAAC TTCAATTTTG ACGGCGAGCC CCATACGCCG
CCGTTCAGCA ACTTCCGAAA TGACCCAGGC GATGCGGCGC GCTTTTATGC CACAGAAGTG
CGGCTGTTCG CCGGGGACGA TATGAATCTC GCCACTGCGG ATGAAGCCGG TTGGAAGGGC
GTGGGTGCTT ATCTGCCTGC CAAGTCCACC CTGAACAGCC TGCCGTTTCG CACCAGCTTT
AATACCGGAC AGGGTAAGCA GTGGGTGGAA AAGGGTGAGG TTAAGGGCGG CGCCTGGACC
GACATGGGCC GTCAGGATTT CTTGCCCACA TGGCAATTTG CCGGTGAAGG GGCGCTGAAG
CTCAGTTTTG ATTTTGATAC CGTCTACCAG GGCGGCAGCA GTCTGGCGGT AACAGCCAAA
GGAGCTGCCA CAGCGCCGCT TTATGCACTT GATGTGATGC TGAGCGAATC AAGCCAGCTA
ACCCTGATAA GCCAGGGGCA GGCCAAGGGC CTGAGTCTCT ATGTTGAAAC CGCCGACGGT
GAAAAGCTCT CTTTAGTCCT TGGCGACCAT GCTGACTGGA CGAGTCAGTC ACTCGGGCTT
TCGGGTCTTA AAGGGAAAAA GCTGGTGGTT ATCGCACTGC AAGCCGATGG CAGCGCGAGC
ATCAATGCCC ATCTTGGCCA TTTGGAGCTG TTGCCATGA
 
Protein sequence
MKKSALSLAL LLAGLCLPAV TFADPQPGTE KVAGGILDSQ PPFALTIAQA KAWSPKGATA 
DSRNVSQVPL ASRFSAPLAG QKVSEAKVLY APDGMNNFAN YLDTQGQFNL YNFTHWSQID
VLNWFAGTAD LGVQIPARPW VETAHKNGVK VIGSVFLGVA QWGGSADKVE ALLEQDATGN
FIVADKLIQM ARFYGFDGWL INQETDLTAV KDADNNLLAD KSDPVRGREL AARMLAFVQY
LTANAPQGME IHWYDAMIAS GEVRWQNQLN EHNRVYLQDS AQDGSQGGLQ NKVRSSDAIF
LNYWWNKDMV RASVQEAKAL GRSPYDVYVG ADLWPSRNAQ RAFSRHQWLD WLFDDGEALT
SIALFAPNVN FNFDGEPHTP PFSNFRNDPG DAARFYATEV RLFAGDDMNL ATADEAGWKG
VGAYLPAKST LNSLPFRTSF NTGQGKQWVE KGEVKGGAWT DMGRQDFLPT WQFAGEGALK
LSFDFDTVYQ GGSSLAVTAK GAATAPLYAL DVMLSESSQL TLISQGQAKG LSLYVETADG
EKLSLVLGDH ADWTSQSLGL SGLKGKKLVV IALQADGSAS INAHLGHLEL LP