Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmwyl1_3908 |
Symbol | |
ID | 5368266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinomonas sp. MWYL1 |
Kingdom | Bacteria |
Replicon accession | NC_009654 |
Strand | + |
Start bp | 4407147 |
End bp | 4408448 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640806296 |
Product | proline dipeptidase |
Protein accession | YP_001342740 |
Protein GI | 152997905 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.969925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCCTT CTATGACCAC ACTCTATCAA GAACATCTTG CCGTTCTCTG CAATCGATAT GAAAACGCTA TGGCGCATTT TTGCCTAGAC GCCATTGTTT TAGCCTCAGG GGAAAAAAGC TACTATTTCC AAGATGATCA CACCCATCCT TTTCATGCCT ATTCTGGCGC ACAACAGTGG TTGCCATTTA CACTAGGGGC AGAAACCTAT GTCATCTTAA AAACGGGCGA AAAACCGACG TTAATTTGGC CTGTTCGTGA CGATTTCTGG CACGCTCCCA ATCCTGTTCC TAGCGGCGAT TGGCAACAAA ACTGGAACAT TCTAACGGCG AAGAAAACAG ACAAATGGTT CTCCGATTTA CCCAATAAAA CCGCTTGGCT TGGTCCGCAA GCAATGCCAT TTGCTATCGT TTGTGAAGGA TTAAAAGCTT ATGTAGATTT TGCTAAAGCC GTAAAAACAG ACTTCGAAAT TCAAGCCATG CGCCAAGCAA GCAAACGCGG CGCCGCAGGT CATATCGCAG CAAAAGAGGC TTTTTTAGCT GGCGGTAGCG AATTAGAAAT TCACCTCGCC TTCTTAAAAG CTAGCCAGCA AAGCGCTTTC CAAGAACCCT ATCCAGGTAT TGTCGGCTTA GACGAGCACG CTGCTGTGTT GCATTATGAA CATAAGTCCA TAGAGCAAGT CGCAAACTCT CGTACGCTAC TGATTGATGC AGGTGCCAAC GAGCATGGCT ATGCCAGTGA CATTACCCGT ACTTTTACGC GTCACTATGA TGACTTTAAC GCCTTGATCA ATGACCTTGA TGTAATGGAA CAAAAGCTTT GCCGTAGCGC CATATCTGGT GTTGCCTTCC AGGATTTGCA CCAAAACACG CTAGCAGGTA TTGCGGCATT ACTGCTTGAA CATAAGATTT GTGCTTTAAG CGTTGAAGAG CAATTAGCAA AACGCATTCC ACAAGTTTTC TTTCCTCACG GCTTGGGTCA TTTATTAGGC TTGCAAGTTC ACGATGTGGG CGGCCATCAA ATTGATCAAA CAGGCACATT AAGCATGCCT GACGACAGCG CTCCCTTCTT ACGTCTGACT AGAAAACTAG AGAAAAACAT GGTGATCACT ATAGAGCCAG GATTGTATTT TATTCCAATG TTACTAGATA AAATGCAGGC CGACATACTC GGCCATGGTT GTGATATGGC ACGGATCCAA CATTTTATGC CTTATGGAGG CATCCGCATC GAAGACAATG TGGTGGTGAA AAACGACCTG CCAGAGAACC TTACTCGTAA CGCTTTTTCC AACCAAGCTT AG
|
Protein sequence | MSPSMTTLYQ EHLAVLCNRY ENAMAHFCLD AIVLASGEKS YYFQDDHTHP FHAYSGAQQW LPFTLGAETY VILKTGEKPT LIWPVRDDFW HAPNPVPSGD WQQNWNILTA KKTDKWFSDL PNKTAWLGPQ AMPFAIVCEG LKAYVDFAKA VKTDFEIQAM RQASKRGAAG HIAAKEAFLA GGSELEIHLA FLKASQQSAF QEPYPGIVGL DEHAAVLHYE HKSIEQVANS RTLLIDAGAN EHGYASDITR TFTRHYDDFN ALINDLDVME QKLCRSAISG VAFQDLHQNT LAGIAALLLE HKICALSVEE QLAKRIPQVF FPHGLGHLLG LQVHDVGGHQ IDQTGTLSMP DDSAPFLRLT RKLEKNMVIT IEPGLYFIPM LLDKMQADIL GHGCDMARIQ HFMPYGGIRI EDNVVVKNDL PENLTRNAFS NQA
|
| |