Gene Shewmr4_2484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2484 
Symbol 
ID4253055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2951784 
End bp2953409 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content50% 
IMG OID638119116 
Productextracellular solute-binding protein 
Protein accessionYP_734612 
Protein GI113970819 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0213722 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTGC TAATAAGACG CCTGTGCCTA TCAACTGCAG TTTTCTGCAT GAGTGGGTTG 
TTGGTTGCAT GTGGCCCCCA ACGACTCCCT TCCGGTTTGG TCTATTGTTC CGAAGGGAAT
CCCGAGTCTT TTAATCCGCA GTTGGTGACC TCTGGCACCA CCATCGACGC CACCTCACAC
CAAATTTATA GCCGCTTAGT GGACTATGAT GCGATTTCAG GCCAGCTCGT GCCTGCGCTG
GCCACCCGTT GGGCCGAGAG TGACGATGGT TTAAGCTATC GTTTCACTCT AAGGGAAAAT
GTTAAGTTTC AACATTCATC CCGTTTTACA CCAAGCCGCG ACTTTAACGC CGACGATGTG
CTGTTTTCCT TCAATCGGAT TATCGATAAG CATCACCCTT ACCATGGCGT ATCACGTACC
GGTTATCCTT TTTTCCAAAG TATTGGATTT TCAGAACAAG TCAAAAGTGT CGAAAAAATC
AATGACCATG AGGTCATCTT TCGTTTAGCA CGCAAAGATG CGTCGTTTTT ATCAAATCTA
GCGACAGACT TTGCCGTCAT CCTCTCTAGC GAATATGCCG ATCAGCTACT TGCGCAGGGG
CACCCTGAAA ATCTTGATCA TTTCGCCATA GGTACAGGGC CTTTTACCTT AGTGCATTAC
GCCAAAAACG AATATGTTCG CTATCGCCGC AATCCCGACT TTTGGGGCGA GCCCGCCAAA
GTCGATATGC TGGTGTACGA TATCACCCCA AAAAGCACTG TCAGACTGGC CAAACTTATC
GCTGGTGATT GCAGCGTATC GGCCCTGCCC AAAGCGGGTG AGTTGCCCGT TATCAAGCAA
CATGAACAGC TGAGCATTGA GTCTCAACCC GGTCTTAACG TAGCCTTTTG GGCCTTTAAC
ACACAAAAGC CCCCTTTAGA TGATGTACGC GTGCGCCGCG CCCTCGCCTA TGCCGTAGAT
AAGCAAAATA TCTTACGGGC CGTGTATCAA AATACCGCCA TAGAAGCCAT AGGGGTGTTG
CCGCCAGCGT CTTGGGCCTA CGACAGCAAT AAAAAACTGT TAGATTACAA TCCACAAAAA
GCGCGTGATT TGTTAAAAGA AGCGGGAATA AAACATCTCA GCATCGATAT TTGGGCCATG
CCCGTTGCCC GCGCTTACAA CCCTAATGCG TTGAAAACCG CCGAGTTAAT TCAATCTGAC
TTGGCCAATA TCGGTGTTAA GGTCAATATC ATCAGTTACG ATTGGAGCGT CTTTAGCCAA
AGATTGAGCC GGGATGAATA TGACTCAGTC CTCATCGGCT GGAACGCCGA TAACAGCGAC
CCCGATAACT TCTTCACGCC ATTGCTGAGT TGCTCGGCGA TGCAATCAAA CAACAACCGC
TCTCGCTGGT GCAATAAAGA GTTTGATGCC ATTCTAGACA GAGCGAGGGA AGTCTCGACG
CAGGCCGAGC GCAAGGAAAT TTACCAAGAG GCCGAGGCCT TCTTAGCCGA GCAAGTGCCA
ATGTTGAGTC TGGCCCACGC CAAGCGAGTC GCACTCACTC GCAGCAATAT CCATGATATG
CAGTTAACCC CCTTTGGCGG CATCTCATTT GCCCGAACCA GCCAAGCTGA GCAGGAGACA
CACTGA
 
Protein sequence
MSVLIRRLCL STAVFCMSGL LVACGPQRLP SGLVYCSEGN PESFNPQLVT SGTTIDATSH 
QIYSRLVDYD AISGQLVPAL ATRWAESDDG LSYRFTLREN VKFQHSSRFT PSRDFNADDV
LFSFNRIIDK HHPYHGVSRT GYPFFQSIGF SEQVKSVEKI NDHEVIFRLA RKDASFLSNL
ATDFAVILSS EYADQLLAQG HPENLDHFAI GTGPFTLVHY AKNEYVRYRR NPDFWGEPAK
VDMLVYDITP KSTVRLAKLI AGDCSVSALP KAGELPVIKQ HEQLSIESQP GLNVAFWAFN
TQKPPLDDVR VRRALAYAVD KQNILRAVYQ NTAIEAIGVL PPASWAYDSN KKLLDYNPQK
ARDLLKEAGI KHLSIDIWAM PVARAYNPNA LKTAELIQSD LANIGVKVNI ISYDWSVFSQ
RLSRDEYDSV LIGWNADNSD PDNFFTPLLS CSAMQSNNNR SRWCNKEFDA ILDRAREVST
QAERKEIYQE AEAFLAEQVP MLSLAHAKRV ALTRSNIHDM QLTPFGGISF ARTSQAEQET
H