Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1293 |
Symbol | |
ID | 8252393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 1530705 |
End bp | 1533734 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644934947 |
Product | coagulation factor 5/8 type domain protein |
Protein accession | YP_003091570 |
Protein GI | 255531198 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00882728 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATTA AAATATACCT GTTGCTGTTT ATTTGTTTTG CTTGCCCCAC TGTTAGGGCG CAGCAGCAAA ATATTGCCTT ACATAAAGAA GTTACCGTAT CTTCTGAAGC TGAGGGGCAT CCTGCCGGGA ATATCGTAGA TGGTAAGATA TCCCGGTCAT CGGTTTGGCG GGCAGCTGTT GCCAAAGCAC CGCATATTGT TGAAATCAAT TTCAATACCT ACTATAATGT GAATGAGCTG CGGATACATA GTGGTATTAT GGACCAGGAA AAGAAGCCGG ATGAAATGAG CCAGGCCGCT GGTTTCTGGT CGGTAAAGAA CTTTAAGCTG CAATACTGGG ATGATGCCAA CTGGACAGAT TTCCCGAAGG CTGAGGTACA TGAAAACAGA TTGACCACTG TTGTGATGAA GTTTCAGCCA GCAGTAACTA CTTTTAAGAT CCGTTTGGTA TGTGACGATG GAGAGCCCAT CAGTATTATG GAAATAGAGG CCTTTGGCTC GGTTGCAGCC AACATGCCGG CACCACCAAC CGGCAATGCC AATGTGCTGC AGCAAAAGAA ATTAACAGGC CCCCAAAGCG CCAACATTAA AGTTACACCG AAAGTTGTGG GCAAAACCAT GAAGTTTGTA GGCTATAACC AGGGCTATTA TTTGCCAGGT ACAAATGTCT CGGGCTGGAT GGAATATGCC AATGTAAACA GTGTTCGCCT TTGGGCAGCT TTGAATGCTT TTGTACCAGA AAGAACCGTG CAGGTTGATC CGGGTATAAC CTCGGTTGAA GTGTTTGACA AAAGAAAAAA TGAACTGCGT ACCAGCCCTG AGCACAATAA ATACCTGAAA TGGGATGAGC TGACACCACT ATATGACCTT CCGGACTCAT CCTCTACCAA TGCCATGGTA TACAATTATG CTTTAAAAGA ATTGAAACGT TTGGGCATTG CCGTGGTTTT GCAGGTGGGG AGTACCGATT TTAAAGATAC CTGGGAAAAT AAATGGAAGC AATGGCAGCG GTATTATGCC CTGGCTTACC ATTCGGCAAA AACCGGGGAT GTAAGCATGT TTGCGATGCA GAATGAACCA AATCACAGAA ACGCCGGTCC TATGAAGCTG GACGACTGGA TCATGGGGAT GCAGATCACT TCCGATGCCA TACACAGCGC CGTAGCCGAC GTGAATAAAA AGTATGGCAA AAAGCTGGAG GCAAAATTTG TAGGGCCGGT AACTGCCGGT CAGAATACAG ACTGGTGGGC GGCCGTAGCT AAAGCCATCC GTACAGATTA CCATGGCAAA CAGGTGGATA AGGACCTGAT GGAGATTTTT TCTACCCATT CTTATAACTC GCCGGCCATA GGCTATGCGT CCAGGATCAG CAATATCCGT AAAATTATAC AGGAACACCA TCCAAAGGGA GCTTCACTGC CCATTGTATA TACCGAAATC GGCCGCTGGA TGAACGCTTA CCTGATTGAT AAGGAAGAGA CCATGGACGA TCCCTCATTA TTTACAGAAT GGGCGGGCAT TTATTCCAAC AACATGAAAA ATGCAGGTTA TGGGATGTGG GCATTTAAGT TTGCCAATAC AGCAAGCGGC CCTTATGCAA GGGGTATCAA ATCGGGACAC CATTACATCT GGCAGGGCAG GCGTATTGTG GAAGATGCCT ATCAAAACCT GGCGCTGCGG AAGTCGGTTA AAACCAGCAA CTCAGCAAGC AACTCAGCAG TAGTTACCGA TGGTGACAAA TCCGATGCCT CGGCCTGGGT ATCAGCTACC GAGGGTGAAA AATGGATAGA GATAGATCTT GGAACTGTAC ATACCCTAGG CAGTTCGGTA GTCTACACCG GTTCTGCAGG AGGGATATAT ACTGCGCCAG ACCGCATAAA AAACTTTAAA CTACAATATT TTAGTCAGGG ACAATGGCTG GACATTCCCG GTACAGCAGA AAAAGAAAGC AGGTATGCTC AGATTTTCAG TATTTTTAAA CAGCCGGTAA ATACCAGCAA AATCCGTTTT ATAAGCAAGG ATAAAGGCAA CCTGAAAGTA AGGGAAATCA AAGTATTTGC CAAAGGTGAT GGTCCTTCCG ATAAAGCTGA TTTTAATGTT TCAGGTATTC AGCGTACCGG AGAGGTGGTA CGCCTGTTTG CAAAAGGCTT TAAAGATGAG CGTAACCTGC TGCAAACCAA AGTTTCTGTA AACGATACTG ACCTGGATAC CTATACGTCT TATGATGCGC AAACCGGCAA TTATTATATA TGGCTGGTTC AGCGCGGTAC CTTTAGTTAT AAGCTGTCTG TAGACCTTTC TGCATTAAAC CTTGCCGCAG GTACTCCGGT TACTGCAGAG ACCGTAAACG CTTTAAATTA TGGTGAAGTA ACCCATAACA TCAGTCTGCC AGCCAGCCAG ACATTTAATT TTGAACTGGC ACCGCAAAGT GTGGTACTAC TGACTATCCC ATCGGCTAAA TTGGTTAAGA CTACCGTATA TCCGGTTGCG GATGCCAGTG TAGCGGGTGG TAAAAATGCA TTGCTCAGTA ATGGTACAGC AAAGCAGCTT GCGGTGCAGC TTGACGCTGC TGTGCCGGAA AATAACCAGG TCGCTTACCT GTACTTTAAC CTCTCCAAAA ATAACCTGGC CACTGCAAAA AAAATCATTG TGGGACTAAG TGGGGCAACT GATCAGAATA AAAGACCATA TCGTTTGCAT GTTTACGGAA TTCCGGGGCC AAAATGGGAG CAGCAACAAT TAAACTGGTC AAATGCGCCA TTGCTGGACC AGAAAGAAGC CCTTATTAAG GGTGTGGGGG AAAAAGCCTT TGTTGCTGGC GAGCTGGCTT TTAACGGAAA GGAGCAATAC CATCAGCTGG ACATTACCGG CATGGTCAAA AAACATGTTA AAGAGGGCCT TACACTCGTA CTGATACGGG AAACCAGGCA ACTAGGTGAT GATGAGGATA AGGGCAGGAA AGTAAGAATC AATTCGATGG AGTCGGTAAA CAAGCCCAAA CTGGAAATCT GGCATGAAAC CTCAAAATAA
|
Protein sequence | MNIKIYLLLF ICFACPTVRA QQQNIALHKE VTVSSEAEGH PAGNIVDGKI SRSSVWRAAV AKAPHIVEIN FNTYYNVNEL RIHSGIMDQE KKPDEMSQAA GFWSVKNFKL QYWDDANWTD FPKAEVHENR LTTVVMKFQP AVTTFKIRLV CDDGEPISIM EIEAFGSVAA NMPAPPTGNA NVLQQKKLTG PQSANIKVTP KVVGKTMKFV GYNQGYYLPG TNVSGWMEYA NVNSVRLWAA LNAFVPERTV QVDPGITSVE VFDKRKNELR TSPEHNKYLK WDELTPLYDL PDSSSTNAMV YNYALKELKR LGIAVVLQVG STDFKDTWEN KWKQWQRYYA LAYHSAKTGD VSMFAMQNEP NHRNAGPMKL DDWIMGMQIT SDAIHSAVAD VNKKYGKKLE AKFVGPVTAG QNTDWWAAVA KAIRTDYHGK QVDKDLMEIF STHSYNSPAI GYASRISNIR KIIQEHHPKG ASLPIVYTEI GRWMNAYLID KEETMDDPSL FTEWAGIYSN NMKNAGYGMW AFKFANTASG PYARGIKSGH HYIWQGRRIV EDAYQNLALR KSVKTSNSAS NSAVVTDGDK SDASAWVSAT EGEKWIEIDL GTVHTLGSSV VYTGSAGGIY TAPDRIKNFK LQYFSQGQWL DIPGTAEKES RYAQIFSIFK QPVNTSKIRF ISKDKGNLKV REIKVFAKGD GPSDKADFNV SGIQRTGEVV RLFAKGFKDE RNLLQTKVSV NDTDLDTYTS YDAQTGNYYI WLVQRGTFSY KLSVDLSALN LAAGTPVTAE TVNALNYGEV THNISLPASQ TFNFELAPQS VVLLTIPSAK LVKTTVYPVA DASVAGGKNA LLSNGTAKQL AVQLDAAVPE NNQVAYLYFN LSKNNLATAK KIIVGLSGAT DQNKRPYRLH VYGIPGPKWE QQQLNWSNAP LLDQKEALIK GVGEKAFVAG ELAFNGKEQY HQLDITGMVK KHVKEGLTLV LIRETRQLGD DEDKGRKVRI NSMESVNKPK LEIWHETSK
|
| |