Gene Shewmr4_0693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_0693 
Symbol 
ID4251537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp800343 
End bp801500 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content52% 
IMG OID638117256 
Producthypothetical protein 
Protein accessionYP_732830 
Protein GI113969037 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2039] Pyrrolidone-carboxylate peptidase (N-terminal pyroglutamyl peptidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value2.26756e-05 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.897132 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAGC CCAGCCTCGT TTTTATCCTC GCCAGCACAG TGGCGCAAAC CGCAGCTGCC 
GTACAACTCC TAGGAGATGT GGAAGTTTCC CGCATTCCTA CTGCAGAAAA AACCATGGCA
GAAGTGGTCA ACCGCTATCA AGCCTTGGAC GAAGGCTTGG CGACTCAGCT TTCCGCACAG
AAGAATGAAC GCGATGCAAC CCAACTTGCA GCCCGCCAAG GCCACAGACT GTGGCAACAG
GCGGTGCGTG ATGTGCAGTC AGGTCACTTT GACGACAGAT CCCTCTACTG GGCTCGGCTC
TCAATGTTAA ATAGCATCAA GAGCAATAGC GCCAATTTCA AAATGGCCGA TTGGCAACAG
AATATTTTAG CCAGCGCCGT CGAGAAGGCA TCTCGCGGTT TTAGCGATAT CCAATTCGGC
GACGATGTTC AGATAAAAAT CTTCCTGACG GGATTCGACC CTTTCTTCCT CGATAAAGAC
ATCAGCCAGA GCAATCCTTC AGGCTTGGTC GCCCTTGCCC TCGATGGTTT TAGGTTTGAT
ATCAACGGCA AAAAAGCCCA AATCGAAACC GCGATGATCC CAGTGCGCTT CGAGGATTTT
GACCAAGGCA TTATCGAATC CTTACTCAGC CCCATTTATC GCGATCCTAA AACCCAGTTT
GTCTTTACCG TCAGCATGGG CCGCAGTGAC TTTGATATTG AACGCTTCCC CGGCCGTAAC
CGTAGCGCCG CCGCGCCGGA TAACCAAAAT CTGTACACAG GCGGAAGCAA AACCGCGCCT
GTCGCCCCCA AACTCAATGG TAAAGACTTT ATCGGCCCTG AGTTTGTTGA GTTTTCACTG
CCCGTCGCCG CCATGCAGGT CAAAGACGGC CAATGGAAAG TCAACGACAA CCATACAGTG
ACCACCCTAG CCCGCGGCGA ATTTAATGCC AGCTCCCTAA ACGAGCTGCA AAATGAAACC
TCGGTCGAAG GTTCTGGCGG TGGGTATCTC TCAAACGAGA TTTCTTATCG CGCCATTGTG
TTACAGCAAA AGTTCAACAG CCCAGCCAAG GTCGGCCATA TCCACACCCC AAGGGTGAAG
GGCTACGACA ATGCCACTGA ACAAGCCATC GTCGAGCAAG TGCGCACTAT GGTGATGCAA
GCTGCGGCGA GCCTGTAA
 
Protein sequence
MLKPSLVFIL ASTVAQTAAA VQLLGDVEVS RIPTAEKTMA EVVNRYQALD EGLATQLSAQ 
KNERDATQLA ARQGHRLWQQ AVRDVQSGHF DDRSLYWARL SMLNSIKSNS ANFKMADWQQ
NILASAVEKA SRGFSDIQFG DDVQIKIFLT GFDPFFLDKD ISQSNPSGLV ALALDGFRFD
INGKKAQIET AMIPVRFEDF DQGIIESLLS PIYRDPKTQF VFTVSMGRSD FDIERFPGRN
RSAAAPDNQN LYTGGSKTAP VAPKLNGKDF IGPEFVEFSL PVAAMQVKDG QWKVNDNHTV
TTLARGEFNA SSLNELQNET SVEGSGGGYL SNEISYRAIV LQQKFNSPAK VGHIHTPRVK
GYDNATEQAI VEQVRTMVMQ AAASL