Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_3431 |
Symbol | |
ID | 5604395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | - |
Start bp | 3800716 |
End bp | 3802311 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640938984 |
Product | extracellular solute-binding protein |
Protein accession | YP_001479657 |
Protein GI | 157371668 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000878954 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATAAAA AAGCAATAAC AATGACGTTG GGACTATTGG CTCTAACGGT GATGGGCGGC GCTCGCGCCA GCACGCTGGT TTATTGTTCC GAAAGCTCGC CGGAAGGGTT TAATCCGCAG TTGTTTACCT CCGGCCCGAC GGTAGATGCC AGTTCGGCGA CGATTTACAA CCGGCTGGTG GACTTTAAAG TCGGTACCGT CGAGCTGCAA CGGAGCCTGG CGGAAAGCTG GCAGGTGAGC GAAGACGGAA AGCAATACAC TTTTCATCTG CGCAAAGGGG TGAAATTCCA GAGCAATAAA TACTTTACCC CCACCCGCGA CTTCAACGCC GACGACGTTA TCTTCTCCTT TATGCGGCAA AAAGACGCCA ACAACCCCTA TCACAAGGTC TCCAACGGCG CTTACACCAA CTTTGAGTCG ATGGAGTTTG GCACGCTGAT CAACAACATT GTCAAAGTGG ATGACAACAC CGTGCGTTTT GAGCTTTCGC GTGCCGAAGC GCCGTTTGTG GCCGATCTCG GCATGTATTT CGCCACCATT CTCTCTGCCG AATATGCCGA TGCGATGCTG AAGGCCGGCA CGCCACAGCG GGTGGATAAC GATCCTGTCG GCACCGGGCC ATTCCAACTG GTGCAGTATC AGAAGGATGC GAAGATCCTC TACAAGGCGT TCGACCAGTA TTGGGAGGGT AAACCGAAGA TAGACCGGCT GGTGTTCTCG ATCACCCCGG ATGCGGCGGT TCGCTTTGCC AAACTGCAAA AGAATGAATG TCAGGTGATG CCGTTCCCCA ACCCGGCGGA TCTCAAGCGC ATGCGCGAGG ACAAAAATAT CCAGGTGATG GAGAAATCCG GGTTAAACAT CGGGTTCCTG GCGTTCAATA CCCAGAAAAA ACCGCTGGAT AACCTCAAGG TGCGGCAGGC GCTGGCGCTG GCGGTGAACA AGCCGGCGAT CCTCGAGGCG GTATTCCACG GCGCAGGCCA GCCGGCGAAG AACCTGTTGC CACCAACCCA GTGGGGCAGC AATGACCAGA TTGAGGACTA TCCGTACTCC CCGGAAAAAG CCAAACGGTT GTTGCAGGAG GCCGGACTGG GGCAGGGGTT TGCCATCGAT CTGTGGGCGA TGCCGGTACA GCGGCCGTAC AATCCGAACG CCAAACGCAT GGCGGAGATG ATCCAGGCCG ACTGGGCGAA AATCGGCGTG AAGGCCAAGA TTGTTACCTT CGAGTGGGGC GAATATTTGC AGCGGATAAA AAATGGAGAG CATCAGGCCG CGTTGATGGG CTGGACCACC GCCAACGGCG ATCCGGATAA CTTCTTCGGC CCGCTGTTTA CCTGCGTTTC GGCCAACGGC GGTTCAAACT CGGCCAAATG GTGTTATCCG CCGTTCGATA AGCTGATCCA GCAGGCGCGT GAAGAGAACG ATCACGCCAA ACGGGTGGCG ATGTATCAGC AGGCGCAGGT GATGATGCAT GACCAAATGC CGGCGATGAT GATTGCCCAC TCAACCATTT TTGAGCCGGT ACGCAAGGAA GTGAAAGGCT ACGAGATTGA CCCGTTTGGC AAGCACATCT TCAAACAGGT TTCGCTGGAG AAATAA
|
Protein sequence | MNKKAITMTL GLLALTVMGG ARASTLVYCS ESSPEGFNPQ LFTSGPTVDA SSATIYNRLV DFKVGTVELQ RSLAESWQVS EDGKQYTFHL RKGVKFQSNK YFTPTRDFNA DDVIFSFMRQ KDANNPYHKV SNGAYTNFES MEFGTLINNI VKVDDNTVRF ELSRAEAPFV ADLGMYFATI LSAEYADAML KAGTPQRVDN DPVGTGPFQL VQYQKDAKIL YKAFDQYWEG KPKIDRLVFS ITPDAAVRFA KLQKNECQVM PFPNPADLKR MREDKNIQVM EKSGLNIGFL AFNTQKKPLD NLKVRQALAL AVNKPAILEA VFHGAGQPAK NLLPPTQWGS NDQIEDYPYS PEKAKRLLQE AGLGQGFAID LWAMPVQRPY NPNAKRMAEM IQADWAKIGV KAKIVTFEWG EYLQRIKNGE HQAALMGWTT ANGDPDNFFG PLFTCVSANG GSNSAKWCYP PFDKLIQQAR EENDHAKRVA MYQQAQVMMH DQMPAMMIAH STIFEPVRKE VKGYEIDPFG KHIFKQVSLE K
|
| |