Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_0693 |
Symbol | |
ID | 4251537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 800343 |
End bp | 801500 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 638117256 |
Product | hypothetical protein |
Protein accession | YP_732830 |
Protein GI | 113969037 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG2039] Pyrrolidone-carboxylate peptidase (N-terminal pyroglutamyl peptidase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000226756 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.897132 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAAAGC CCAGCCTCGT TTTTATCCTC GCCAGCACAG TGGCGCAAAC CGCAGCTGCC GTACAACTCC TAGGAGATGT GGAAGTTTCC CGCATTCCTA CTGCAGAAAA AACCATGGCA GAAGTGGTCA ACCGCTATCA AGCCTTGGAC GAAGGCTTGG CGACTCAGCT TTCCGCACAG AAGAATGAAC GCGATGCAAC CCAACTTGCA GCCCGCCAAG GCCACAGACT GTGGCAACAG GCGGTGCGTG ATGTGCAGTC AGGTCACTTT GACGACAGAT CCCTCTACTG GGCTCGGCTC TCAATGTTAA ATAGCATCAA GAGCAATAGC GCCAATTTCA AAATGGCCGA TTGGCAACAG AATATTTTAG CCAGCGCCGT CGAGAAGGCA TCTCGCGGTT TTAGCGATAT CCAATTCGGC GACGATGTTC AGATAAAAAT CTTCCTGACG GGATTCGACC CTTTCTTCCT CGATAAAGAC ATCAGCCAGA GCAATCCTTC AGGCTTGGTC GCCCTTGCCC TCGATGGTTT TAGGTTTGAT ATCAACGGCA AAAAAGCCCA AATCGAAACC GCGATGATCC CAGTGCGCTT CGAGGATTTT GACCAAGGCA TTATCGAATC CTTACTCAGC CCCATTTATC GCGATCCTAA AACCCAGTTT GTCTTTACCG TCAGCATGGG CCGCAGTGAC TTTGATATTG AACGCTTCCC CGGCCGTAAC CGTAGCGCCG CCGCGCCGGA TAACCAAAAT CTGTACACAG GCGGAAGCAA AACCGCGCCT GTCGCCCCCA AACTCAATGG TAAAGACTTT ATCGGCCCTG AGTTTGTTGA GTTTTCACTG CCCGTCGCCG CCATGCAGGT CAAAGACGGC CAATGGAAAG TCAACGACAA CCATACAGTG ACCACCCTAG CCCGCGGCGA ATTTAATGCC AGCTCCCTAA ACGAGCTGCA AAATGAAACC TCGGTCGAAG GTTCTGGCGG TGGGTATCTC TCAAACGAGA TTTCTTATCG CGCCATTGTG TTACAGCAAA AGTTCAACAG CCCAGCCAAG GTCGGCCATA TCCACACCCC AAGGGTGAAG GGCTACGACA ATGCCACTGA ACAAGCCATC GTCGAGCAAG TGCGCACTAT GGTGATGCAA GCTGCGGCGA GCCTGTAA
|
Protein sequence | MLKPSLVFIL ASTVAQTAAA VQLLGDVEVS RIPTAEKTMA EVVNRYQALD EGLATQLSAQ KNERDATQLA ARQGHRLWQQ AVRDVQSGHF DDRSLYWARL SMLNSIKSNS ANFKMADWQQ NILASAVEKA SRGFSDIQFG DDVQIKIFLT GFDPFFLDKD ISQSNPSGLV ALALDGFRFD INGKKAQIET AMIPVRFEDF DQGIIESLLS PIYRDPKTQF VFTVSMGRSD FDIERFPGRN RSAAAPDNQN LYTGGSKTAP VAPKLNGKDF IGPEFVEFSL PVAAMQVKDG QWKVNDNHTV TTLARGEFNA SSLNELQNET SVEGSGGGYL SNEISYRAIV LQQKFNSPAK VGHIHTPRVK GYDNATEQAI VEQVRTMVMQ AAASL
|
| |