Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_1995 |
Symbol | |
ID | 4252568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 2373911 |
End bp | 2374978 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638118608 |
Product | arabinan endo-1,5-alpha-L-arabinosidase |
Protein accession | YP_734125 |
Protein GI | 113970332 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3507] Beta-xylosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000026265 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0000968585 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCAGCTA TGAGAGTCAC ACCGAAAACA GCCCTAAAAC GTCACCTTAA GATGCTCAAT TGTGCGCTGC TAGGCGTATT GGGAACCCTA GGCCAAGCGA GTGCCAAACA GGTGAGTATT CACGATCCTG TGATGGCAAA AGAGGCCGGA CAGTATTATC TCTTCAGCAC TGGCCCCGGC ATTACCTATT ATTCCTCAAA GGATAAAATC CATTGGGAAT TAGCTGGGCG GGTATTCGAA ACCGAGCCTA GCTGGGCAAA GGACGTTGCG CCAGAGTTTA ACGGCCATTT ATGGGCGCCG GATATCATTG AGCATAACGG CTTGTTTTAT CTGTATTACT CAGTATCAGC CTTTGGTAAA AACACCTCGG CCATTGGCGT GACAGTCAAT AAAACCCTCG ACAAAAACTC AAAGGAATAT CAGTGGACAG ATAAGGGGAT CGTTATTCAA TCTGTGCCAA ATCGCGATGC ATGGAACGCG ATTGATCCCA ATATTATTGT CGATGAGCAG GGTACACCTT GGATGAGTTT TGGCTCCTTT TGGCAAGGGT TAAAACTGGT CAAACTCAAT CCAGACTTTA TCTCCATCTC CAAACCAGAG GAGTGGCATA CGCTAGCCAA GTTAGAACGC CCCGCACTGC TTGGTGAAAC CGAGCCAGGC CCTGCTGAAA TCGAAGCGCC GTTTATCTAT AAAAAAGATG ACTATTACTA TCTTTTTGTC TCCTATGGCC TTTGCTGTCG TGGGGACGAC AGCACTTACC ACTTAGCCGT GGGCCGCACC AAAACGGTGA CTGGGCCTTA CCTCGATAAA GAGGGCAAAG ACATGGCGCA AGGCGGCGGT TCGGTTTTAC TGCATGGCAC TAAGGCTTGG CCGGGATTGG GCCATAACAG TGTTTATGCC TTCGATGGCA AAGATTACTT AGTCTTTCAT GCCTACGAAT CGGCTGATAA TGGCTTACAA AAACTCAAGA TGGCGGAACT GAGCTGGCGC CAAGGTTGGC CAGAGGTCGA TCCTAAGGCG CTCAATCAAT ACCAGAGCGT ATTAGTTGCA CCCGAAGGAA CAAAATAA
|
Protein sequence | MPAMRVTPKT ALKRHLKMLN CALLGVLGTL GQASAKQVSI HDPVMAKEAG QYYLFSTGPG ITYYSSKDKI HWELAGRVFE TEPSWAKDVA PEFNGHLWAP DIIEHNGLFY LYYSVSAFGK NTSAIGVTVN KTLDKNSKEY QWTDKGIVIQ SVPNRDAWNA IDPNIIVDEQ GTPWMSFGSF WQGLKLVKLN PDFISISKPE EWHTLAKLER PALLGETEPG PAEIEAPFIY KKDDYYYLFV SYGLCCRGDD STYHLAVGRT KTVTGPYLDK EGKDMAQGGG SVLLHGTKAW PGLGHNSVYA FDGKDYLVFH AYESADNGLQ KLKMAELSWR QGWPEVDPKA LNQYQSVLVA PEGTK
|
| |