Gene Ssed_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_1997 
Symbol 
ID5612847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp2414974 
End bp2416587 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content45% 
IMG OID640932883 
Productpeptidase M28 
Protein accessionYP_001473734 
Protein GI157375134 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.801062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000223343 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATAGTA GAAATATCTT GTTAATGACG TTCTTTTCAA CGTTATTTTT AACGGCATGC 
GGGCAACAAG TATCAAAAGT GCCAGCCACA GAAGTGAACC TAGATAATGC TCATCGCCAT
ATAAAAATAC TGGCTTCCGA TGAATTTGCG GGTCGAGGAC CACTGACTGA AGGGGAAAGA
GTCACCATTG ATTATTTGGC GAAACAATAT CGCGAAATAG GATTAACCGG GGGCAATAAA
GGAAGTTTCT TCCAAACCGT CCCCATGGCC AAGCTCACCT CAGATCAGTC CATGCTTTTA
ACCATTGGTG ATCTTGAGTT TCAGGCGGGA AAAGATTTCA CCGCCCGTAC TCAGCAGATG
AATACAGATA TCAAACTCAA TGATTCGCAG GTGGTATTTG TTGGGTACGG CATCAATGCC
CCTGAATATC AGTGGAATGA TTATCGTGAC ATTGACGTAA CAGATAAAAC GGTCATTGTT
TTAGTCAATG ATCCAGGCTT CGCGACACAA GACGATAATC TATTTACCGG CAACGCCATG
ACCTATTATG GACGTTGGAC CTATAAGTAT GAGGAAGCGG CCAGACAAGG GGCAAAAGCG
GTATTGATAG TCCATGAAAC AGCCCCTGCC GCCTATCCAT GGAGTGTTGT AGAGAGCTCA
AACACTGGCA GCAAATACAC CTTAGTAGAC GATGAGAATA ACAAGAGTAA GCTGCCAATC
ATGGGCTGGT TACAGCATGA AGCCACGAAA CAGGTTTTTG CTCAAGCGGG TCTGGATTTT
GACACATTAA AACACCAGGC TATCAGCCGT AACTTTAAGC CTATTCCTTT TAAGCAAACT
GCCAGGTTAA ACCTGACAAC GGCCATTGAA CATGCAGAGT CAAACAATGT TCTTGCGTTA
ATTAAAGGCA GCAAAAGGCC CGATGAAACC ATTATCGTCA GTGCCCATTG GGATCATTTT
GGTACACAAC AAACACCTGA GGGTCCCAAA ATTTATAATG GCGCCATCGA TAATGCCACC
GGTGTGGCGG GCACACTGGA AATTGCACGC ATACTGATGG AGCGCCACAA AATTAAGCCG
TTTGAGCGTT CAATTCTCTT TGCCAATTTT ACCGCAGAGG AAACGGGCAT GATTGGGTCC
GAACTCTTTG CTCGTGGCAC CGTTTTACCC ACTAAAAACA TCGTAGGCTT ACTGAATATC
GATGGTATGA ACGCCTTAGA CGGAGTCGAC TATGTATTAC AATATGGCAA AAACATGTCA
GAACTTGAGG GGTATCTGCA AAAGGCGGCC AGTAAACAGC AACGACACGT TAAGCTCGAT
CCAAGACCGC AAAACGGCCT CTTCTTTCGT TCCGACCACT TCTCATTAGC TAAACAAGGA
GTTCCAGGTT TACTGTTTAT GAGTCTGGGG GATACCGATC CTGACTATAT CAGTCACAGA
TACCATAAAG AGGCCGACGA CTATTCAGTT TCCTGGTCAT TAGGTGGCAT GCAACAGGAG
ATCGCCTTGA TTGTCGATAT CGCCAGTGAG CTGGCGACCA ACGATGATTG GCCTAAGTGG
AAAGCCGATT CTGACTTTAA GAAGCGCAGA GCCGAAGACA TGAGCAAGAT GTAA
 
Protein sequence
MNSRNILLMT FFSTLFLTAC GQQVSKVPAT EVNLDNAHRH IKILASDEFA GRGPLTEGER 
VTIDYLAKQY REIGLTGGNK GSFFQTVPMA KLTSDQSMLL TIGDLEFQAG KDFTARTQQM
NTDIKLNDSQ VVFVGYGINA PEYQWNDYRD IDVTDKTVIV LVNDPGFATQ DDNLFTGNAM
TYYGRWTYKY EEAARQGAKA VLIVHETAPA AYPWSVVESS NTGSKYTLVD DENNKSKLPI
MGWLQHEATK QVFAQAGLDF DTLKHQAISR NFKPIPFKQT ARLNLTTAIE HAESNNVLAL
IKGSKRPDET IIVSAHWDHF GTQQTPEGPK IYNGAIDNAT GVAGTLEIAR ILMERHKIKP
FERSILFANF TAEETGMIGS ELFARGTVLP TKNIVGLLNI DGMNALDGVD YVLQYGKNMS
ELEGYLQKAA SKQQRHVKLD PRPQNGLFFR SDHFSLAKQG VPGLLFMSLG DTDPDYISHR
YHKEADDYSV SWSLGGMQQE IALIVDIASE LATNDDWPKW KADSDFKKRR AEDMSKM