Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr7_2552 |
Symbol | |
ID | 4257451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-7 |
Kingdom | Bacteria |
Replicon accession | NC_008322 |
Strand | + |
Start bp | 3012431 |
End bp | 3014056 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638123227 |
Product | extracellular solute-binding protein |
Protein accession | YP_738594 |
Protein GI | 114048044 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0329561 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTGC TAATAAGACG CCTGTGCCTA TCAACTGCAG TTTTCTGCAT GAGTGGGTTG TTGGTTGCAT GTGGCCCCCA ACGACTCCCT TCCGGTTTGG TCTATTGTTC CGAAGGGAAT CCCGAGTCTT TTAATCCGCA GTTGGTGACC TCTGGCACCA CCATCGACGC CACCTCACAC CAAATTTATA GCCGCTTAGT GGACTATGAT GCGATTTCAG GCCAGCTCGT GCCTGCGCTG GCCACCAGTT GGGCCGAGAG TGACGATGGT TTAAGTTACC GTTTTACCCT CAGGGAAAAT GTTAAGTTTC AACATTCATC CCGTTTTACA CCAAGCCGCG ACTTTAACGC CGACGATGTG CTGTTTTCCT TCAATCGGAT TATCGATAAG CACCACCCTT ACCATGGCGT ATCACGTACC GGTTATCCTT TTTTCCAAAG TATTGGATTT TCAGAACAAG TTAAAAGTGT TGAGAAAATC AATGACCATG AGGTCATCTT TCGTTTAGCA CGCAAAGATG CGTCGTTTTT ATCAAATCTA GCGACAGACT TTGCCGTCAT CCTCTCTAGC GAATATGCCG ATCAGCTACT TGCGCAGGGA CACCCTGAAA ATCTTGATCA TTTCGCCATA GGTACAGGGC CTTTTACCTT AGTGCATTAC GCCAAAAACG AATATGTTCG CTATCGCCGC AATCCCGACT TTTGGGGAGA GCCCGCCAAG GTCGAGATGC TGATCTACGA TATCACCCCT AAAAGCACTG TGAGACTGGC CAAACTAATC GCTGGTGACT GCAGCGTATC GGCCCTGCCC AAAGCGGGAG AACTGCCCGT TATCAAGCAA CATGAACAGC TCAGTATTGA GTCACAACCT GGTCTCAACG TCGCCTTTTG GGCCTTCAAC ACACAAAAGC CTCCTTTAGA TGATGTACGC GTTCGCCGCG CCCTCGCCTA TGCCGTAGAT AAGCAAAATA TCTTACGTGC CGTGTATCAA AACACCGCCA TAGAAGCCAC AGGTGTGTTG CCGCCAGCGT CTTGGGCCTA CGACAGCAAT AAAAAACTAT TAGATTACAA TCCGCAAAAA GCGCGCGATC TGTTAAAAGA AGCGGGAATA AAACATCTCA GCATTGATAT CTGGGCCATG CCTGTCGCCC GCGCCTACAA CCCTAATGCA CTGAAAACTG CCGAATTAAT TCAATCAGAC TTAGCCAATA TTGGCGTAAA AGTGAATATC ATCAGCTACG ACTGGAGCGT TTTTAGCCAA AGATTGAGCC GTGATGAATA TGACTCTGTG CTCATTGGCT GGAACGCCGA TAACAGCGAC CCGGATAACT TCTTCACTCC ATTACTCAGC TGCTCGGCGA TGCAATCAAA CAACAACCGT TCTCGCTGGT GCAATAAAGA GTTTGATGCC ATTCTAGACA GAGCAAGGGA AGTCTCGACT CAAGCCGAAC GTAAAGCGAT TTACCAAGAG GCCGAAGCCT TCTTAGCCGA ACAAGTGCCA ATGTTAAGTT TGGCCCATGC TAAACGTGTC GCCCTAACCC GCAGCAACAT CCACGATATG CAACTGACAC CTTTTGGCGG CATCTCCTTT TCCCATACCA GTCAAGCTGA GCAGGAGACA CACTGA
|
Protein sequence | MSVLIRRLCL STAVFCMSGL LVACGPQRLP SGLVYCSEGN PESFNPQLVT SGTTIDATSH QIYSRLVDYD AISGQLVPAL ATSWAESDDG LSYRFTLREN VKFQHSSRFT PSRDFNADDV LFSFNRIIDK HHPYHGVSRT GYPFFQSIGF SEQVKSVEKI NDHEVIFRLA RKDASFLSNL ATDFAVILSS EYADQLLAQG HPENLDHFAI GTGPFTLVHY AKNEYVRYRR NPDFWGEPAK VEMLIYDITP KSTVRLAKLI AGDCSVSALP KAGELPVIKQ HEQLSIESQP GLNVAFWAFN TQKPPLDDVR VRRALAYAVD KQNILRAVYQ NTAIEATGVL PPASWAYDSN KKLLDYNPQK ARDLLKEAGI KHLSIDIWAM PVARAYNPNA LKTAELIQSD LANIGVKVNI ISYDWSVFSQ RLSRDEYDSV LIGWNADNSD PDNFFTPLLS CSAMQSNNNR SRWCNKEFDA ILDRAREVST QAERKAIYQE AEAFLAEQVP MLSLAHAKRV ALTRSNIHDM QLTPFGGISF SHTSQAEQET H
|
| |