Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_2484 |
Symbol | |
ID | 4253055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 2951784 |
End bp | 2953409 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 638119116 |
Product | extracellular solute-binding protein |
Protein accession | YP_734612 |
Protein GI | 113970819 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0213722 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTGC TAATAAGACG CCTGTGCCTA TCAACTGCAG TTTTCTGCAT GAGTGGGTTG TTGGTTGCAT GTGGCCCCCA ACGACTCCCT TCCGGTTTGG TCTATTGTTC CGAAGGGAAT CCCGAGTCTT TTAATCCGCA GTTGGTGACC TCTGGCACCA CCATCGACGC CACCTCACAC CAAATTTATA GCCGCTTAGT GGACTATGAT GCGATTTCAG GCCAGCTCGT GCCTGCGCTG GCCACCCGTT GGGCCGAGAG TGACGATGGT TTAAGCTATC GTTTCACTCT AAGGGAAAAT GTTAAGTTTC AACATTCATC CCGTTTTACA CCAAGCCGCG ACTTTAACGC CGACGATGTG CTGTTTTCCT TCAATCGGAT TATCGATAAG CATCACCCTT ACCATGGCGT ATCACGTACC GGTTATCCTT TTTTCCAAAG TATTGGATTT TCAGAACAAG TCAAAAGTGT CGAAAAAATC AATGACCATG AGGTCATCTT TCGTTTAGCA CGCAAAGATG CGTCGTTTTT ATCAAATCTA GCGACAGACT TTGCCGTCAT CCTCTCTAGC GAATATGCCG ATCAGCTACT TGCGCAGGGG CACCCTGAAA ATCTTGATCA TTTCGCCATA GGTACAGGGC CTTTTACCTT AGTGCATTAC GCCAAAAACG AATATGTTCG CTATCGCCGC AATCCCGACT TTTGGGGCGA GCCCGCCAAA GTCGATATGC TGGTGTACGA TATCACCCCA AAAAGCACTG TCAGACTGGC CAAACTTATC GCTGGTGATT GCAGCGTATC GGCCCTGCCC AAAGCGGGTG AGTTGCCCGT TATCAAGCAA CATGAACAGC TGAGCATTGA GTCTCAACCC GGTCTTAACG TAGCCTTTTG GGCCTTTAAC ACACAAAAGC CCCCTTTAGA TGATGTACGC GTGCGCCGCG CCCTCGCCTA TGCCGTAGAT AAGCAAAATA TCTTACGGGC CGTGTATCAA AATACCGCCA TAGAAGCCAT AGGGGTGTTG CCGCCAGCGT CTTGGGCCTA CGACAGCAAT AAAAAACTGT TAGATTACAA TCCACAAAAA GCGCGTGATT TGTTAAAAGA AGCGGGAATA AAACATCTCA GCATCGATAT TTGGGCCATG CCCGTTGCCC GCGCTTACAA CCCTAATGCG TTGAAAACCG CCGAGTTAAT TCAATCTGAC TTGGCCAATA TCGGTGTTAA GGTCAATATC ATCAGTTACG ATTGGAGCGT CTTTAGCCAA AGATTGAGCC GGGATGAATA TGACTCAGTC CTCATCGGCT GGAACGCCGA TAACAGCGAC CCCGATAACT TCTTCACGCC ATTGCTGAGT TGCTCGGCGA TGCAATCAAA CAACAACCGC TCTCGCTGGT GCAATAAAGA GTTTGATGCC ATTCTAGACA GAGCGAGGGA AGTCTCGACG CAGGCCGAGC GCAAGGAAAT TTACCAAGAG GCCGAGGCCT TCTTAGCCGA GCAAGTGCCA ATGTTGAGTC TGGCCCACGC CAAGCGAGTC GCACTCACTC GCAGCAATAT CCATGATATG CAGTTAACCC CCTTTGGCGG CATCTCATTT GCCCGAACCA GCCAAGCTGA GCAGGAGACA CACTGA
|
Protein sequence | MSVLIRRLCL STAVFCMSGL LVACGPQRLP SGLVYCSEGN PESFNPQLVT SGTTIDATSH QIYSRLVDYD AISGQLVPAL ATRWAESDDG LSYRFTLREN VKFQHSSRFT PSRDFNADDV LFSFNRIIDK HHPYHGVSRT GYPFFQSIGF SEQVKSVEKI NDHEVIFRLA RKDASFLSNL ATDFAVILSS EYADQLLAQG HPENLDHFAI GTGPFTLVHY AKNEYVRYRR NPDFWGEPAK VDMLVYDITP KSTVRLAKLI AGDCSVSALP KAGELPVIKQ HEQLSIESQP GLNVAFWAFN TQKPPLDDVR VRRALAYAVD KQNILRAVYQ NTAIEAIGVL PPASWAYDSN KKLLDYNPQK ARDLLKEAGI KHLSIDIWAM PVARAYNPNA LKTAELIQSD LANIGVKVNI ISYDWSVFSQ RLSRDEYDSV LIGWNADNSD PDNFFTPLLS CSAMQSNNNR SRWCNKEFDA ILDRAREVST QAERKEIYQE AEAFLAEQVP MLSLAHAKRV ALTRSNIHDM QLTPFGGISF ARTSQAEQET H
|
| |