Gene Ssed_3969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_3969 
Symbol 
ID5614206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp4855081 
End bp4856253 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content49% 
IMG OID640934923 
Producthypothetical protein 
Protein accessionYP_001475701 
Protein GI157377101 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2039] Pyrrolidone-carboxylate peptidase (N-terminal pyroglutamyl peptidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAAAT CCGTGATGGC GATTGCAGCT GTAGGCGCTC TGGCCTTTAG CATCGCCTGT 
TCAGCCACCC CTGATGCTAT GACCCCCCTG AATATTGAAG AGCTGAGACT CGAAGCAGCA
AACCAGACGA TACCCGATGT TGTCACTCGC TATCAGGCCT TAGTTGAGAG CCTGGACAAT
AAATACCAGC AGCAAACAGA CGAGCTTGAA ATGACCAAGA TGGTCGCACG TCAGGGTGAA
AGGTTATGGC AACAAGCCGT TAGAGACATA CAATCAGGCA GTATCGATGA CAGGCCGCTC
TACTGGAGTC GCCTCGCCAT GCAGCGTCAA TTGAAGCGCA GTCGCCCCGC ATTTAACATG
GCCTCCTGGC AGCAAGAGGT TTTACTGAGC GCCGTTGAAA AATCTTCCAG AGGCTTTAGC
AATGTGCAAT TCTCGAATGA CACCGAGGTA AAGATACTGC TAACAGGTTT CGATCCCTTC
TTCCTCGATC GTAATGTCGG CCAGAGTAAC CCCTCGGGCT TAGTTGCCTT GTCCTTAGAT
GGTTTTGAAT TTTCCGCTAA CGGCAAGAAA GCACAGATTG AAACTGTGAT GATACCCGTT
CGTTTTGCCG ATTTTGATGA GGGAATAATA GAGTCACTAC TGACTCCTGT TTACCGTGAA
AACAGTGTAG ATATGATTAT TACTGTCAGC ATGGGCCGTG ATGATTTCGA TCTGGAGCGC
TTCCCGGGTC GAAACCGAAG TGCTGCCGCT CCCGATAATT TGAACCTGCT CACAGGTGCA
AATAAGCAAA AGCCAATGGC ACCTTTGTTT AATCGTAAAA CATTAAACGG CCCGGAATTT
GTCGAGTTCT CTCTCCCTGT GGCGGCGATG CAATCGGTAA GTGGTAATTG GAAAGTAAAT
GATAACCATA AGGTTACTTC GGTTGCTAAA GGTGAGTTTA CCGCTTCAAC GCTATCTGAA
CTTCAAAGCG AAACTTCGGT TGAAGGCTCA GGTGGAGGCT ACCTTTCCAA TGAAATATCT
TACCGTGCGG TACTGCTGGG GCAACAGTTT AATAGCAGTA TTGCAGTCGG ACACATACAT
ACCCCCAGAG TTTCAGGCCA CGACCCGAAA GTTGAAGCCG AGATCATGGC GCAGGTTAAA
GCTATGGTCA TAGCAGGCGC CGCTGCTCTA TAA
 
Protein sequence
MTKSVMAIAA VGALAFSIAC SATPDAMTPL NIEELRLEAA NQTIPDVVTR YQALVESLDN 
KYQQQTDELE MTKMVARQGE RLWQQAVRDI QSGSIDDRPL YWSRLAMQRQ LKRSRPAFNM
ASWQQEVLLS AVEKSSRGFS NVQFSNDTEV KILLTGFDPF FLDRNVGQSN PSGLVALSLD
GFEFSANGKK AQIETVMIPV RFADFDEGII ESLLTPVYRE NSVDMIITVS MGRDDFDLER
FPGRNRSAAA PDNLNLLTGA NKQKPMAPLF NRKTLNGPEF VEFSLPVAAM QSVSGNWKVN
DNHKVTSVAK GEFTASTLSE LQSETSVEGS GGGYLSNEIS YRAVLLGQQF NSSIAVGHIH
TPRVSGHDPK VEAEIMAQVK AMVIAGAAAL