Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1397 |
Symbol | |
ID | 8252497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 1664570 |
End bp | 1667557 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644935050 |
Product | hypothetical protein |
Protein accession | YP_003091673 |
Protein GI | 255531301 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0243953 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATATT CTAACCAGGT TAATGCTGCA CAGCTGTACA CCAGCCCTCC TCACATCCGC TGTTTCAATT TAATGATCCG GATAAGCATC CTATTCTTTT TCCTGCTGTT CAGCACTGTT AAGCTTTTTG CACAAAACCA GCCTACACAA AATACCCCAC CACCTTTGCC GGTAAGAGAA ATCAGCGGGA TTGTAAAAGA CAGCACAGAT CTGGGGGTAA TTGGCGCAAC AGTGAGTTTA ACCTCAATTA AGGATACTTT AAAAACCAGT ACCAATTCGG ACGGTATATT TGTATTCAAA AATGTAAAAT CGGCCACCTA TACCATTTTA GTACAAAGTA TTGGTTACAG GCCTTCTGCC CCAATGCGGT ACAAGCAAAA TGATGCCATA CCCCGTATCG TAATGGATCC CATTTTACTT AAAGAGCAAA AAAATACTCT AAATGAAGTG GTAATCAGTG GTACACCATC CATTACCTAT AAAACGGATA CAGTAGAGTA CAAGGCCAGC GATTATATTG TAAGGGCCAA CTCGACGGTA GATGAGCTGC TCAAAAAAAT GGAGGGCATG GAAGTCGGCA ATGATGGCTC ATTGGTGCAT CAGGGCCAGA ATGTAACAAA AGCAAAATTA AACGGTAAAG AATATCAGGG GGGCGACATT GCAACCGCAA TTAAAAACCT TCCGGCGGAA ATTGTGGACA AGATCCAGAT TGTGGACGAC TATGGTGATC AGGCCGCCCG TACCGGAATT AAAGATGGTG ATCCTGAAAA AGTACTGAAC ATTACTACCC GAACCGATAA ATCTGTGGGT AATATGGCCA ATATTTTCGT CGGGGCTGGT AATAACGATC GATATGAAGC CGGAATATTT GGTACACGGA TAAACGGAAA CCAGACCATA GGGGTAAACG GACGCTTCAG CAATACCGTA AACGGTGTAG CCAGCAGCGG AGACAACAGC AATAATAGCG GTGGTGGCGG TGGAGGTGGC GGCGGCCGTG GTGGTCAGGG TGGCCAGAAT ACTCAGAATA GTGGTGGCAG TGGTAGTGGT GGTACCACCA CCTCAGGCCG GGGCTCATTC AGTATCCGTG ATAAAATAGG GAAAAAAATA GAACTCAACC TGAATTATGA CTACACGAGT TCAAATGTCA AATCTTTAAA CGACAGTTAT TCGGTTAGTC CAACGCGGCA GCGGATCCAG GTGATCGAAA ATGGTGTTCC CATCATGAAA GATACTACCT ATTATACTTT TGCAAATAGC CTAAGCAATG GAGAAAACCT GAACAAGAGC CATAATTTCA GGGCCGAAAT AGAAATAAAC CTCGATAGTA ACAATTTTCT ACGTGCGGTC CCTACATTGC GGTATAGTTC GGCAAACAAC ACGAGATTTT CCGATATCAA ACAAACCGGG TTTTATCACC AGAACAGGCA AGGAACCAAT ATCAGCAAAA ATACAAGACC CCAGCTGGGT GCTTCTGTTT TTTATCAGCA TATTTTTAAA AAACCCAGAA GGAACCTTTC TTTACAGGTA GATGCAAACA GCAATAATCT GGACCAGGAG CAGGAACAGG ATACCAAAAT CATTTACTTT TTAAATGAGG CTGAAACAGA AACAAGGGAC TCGGCTGTAA ACAGGATTGT AGCCAGGAAA AACCTGCAGA GCAATTACCG GGGAAGCTTT ACTTTTGCAG AACCCTTAAC TGCCAATACC CAGCTGGAAT TAAATGCCCA GGTCAATTAC AATGGGTACG ACAATACCGC AACTACGCAG AACATCATCA ATGGAAATCC ATCTGGAGTC ATAGATTCTC TGAGTAATAT TTATGATTAT TCCTTTACGC AGGGCCGTAT TGCTTTGAAC TACCGTTATG GCTTAAGCAA TATGTCTAAA GTACGTTTCT CTTTGGGTAT AACGGCAATA CCATCCCTGC TTTCCGGTAC AAAGGTAAGT CTGGGTACCA CCACCAACAG AAGCAGTTTT AATATGGTGC CCATTGCCCG TTTTGAATAT CTATGGTCCA GACAACATAA AATGTCGATC AATTATTATG GAAATGCAAT AGAGCCTACC TTTGACCAGA TCCAGCCGGT TAGGGATGTA ACCAATCCAC AAAATCCTAT TGTGGGTAAC CCAAATTTAG TGGCCTCTTT CCAGCATACG GTTAGGGCGG GTTATGACAA TTACATCGCC AATTCTAAAT TGAACTATTC TTTGAATGTC AACGGATCCT TAACAGATAA TTCGGTCATC AGAAATAATG TACAGATCAT TGATCAGATA CTGACAAATC AGGCTACGAG TAAAAAAGAT ACCATTTACA TTACGGAAAC CCGTTTTTTA AATACCAGTG GAGCGTATAA GGTAAATGGA AATTATTCCA TCAGCAAACA ACTGAACGAC CGCAGGTACA ATCTGTCGCT CAGTGGATCA GCCAGCTACG ATCATCGTAT TTCTATGAGT GCCTCGCAAA AAAACATAAA TACAGTAATG ACTTTTATTG AACGTTTCGG ACCAAGGATC AATCCGAACG ACTGGTTCGA GATCAATCCC AACGTATCAT ATAACTACAC GAAATCGACC AATACCCTAC CTGGTTTCAG GGATACCAAA ACAAATACAC TGGCGCTTAA CCTGGACGGA AGGATGTACT TCCTGAACAG CTGGCTTTTT GGCTATAGTG CCAGTAAAAA CTATGTCAGC GGAATTGATG CCAATGTGAC CAGCAATCCT TTTGTGGTAA ACGCCTATAT AGAAAAAGAA TTTTTTAACC GCAGGGGCCG GGTCACCTTT CAGGCCTTCG ATATCCTGAA CCAGAACAAT TTTGTAAGTC GTGACAACAG CGAGGATGGA GGATATACCG ATACCAAATC CAATGCACTG AGCCGGTACT TTATGCTGCG TTTAAGTATG CGTTTACAAA AATGGACGGG TGCACAGGGC CGCGGAGGCA GACAGATTAT GCGCAGGGGC GATGGAAGCT TTATGTAG
|
Protein sequence | MKYSNQVNAA QLYTSPPHIR CFNLMIRISI LFFFLLFSTV KLFAQNQPTQ NTPPPLPVRE ISGIVKDSTD LGVIGATVSL TSIKDTLKTS TNSDGIFVFK NVKSATYTIL VQSIGYRPSA PMRYKQNDAI PRIVMDPILL KEQKNTLNEV VISGTPSITY KTDTVEYKAS DYIVRANSTV DELLKKMEGM EVGNDGSLVH QGQNVTKAKL NGKEYQGGDI ATAIKNLPAE IVDKIQIVDD YGDQAARTGI KDGDPEKVLN ITTRTDKSVG NMANIFVGAG NNDRYEAGIF GTRINGNQTI GVNGRFSNTV NGVASSGDNS NNSGGGGGGG GGRGGQGGQN TQNSGGSGSG GTTTSGRGSF SIRDKIGKKI ELNLNYDYTS SNVKSLNDSY SVSPTRQRIQ VIENGVPIMK DTTYYTFANS LSNGENLNKS HNFRAEIEIN LDSNNFLRAV PTLRYSSANN TRFSDIKQTG FYHQNRQGTN ISKNTRPQLG ASVFYQHIFK KPRRNLSLQV DANSNNLDQE QEQDTKIIYF LNEAETETRD SAVNRIVARK NLQSNYRGSF TFAEPLTANT QLELNAQVNY NGYDNTATTQ NIINGNPSGV IDSLSNIYDY SFTQGRIALN YRYGLSNMSK VRFSLGITAI PSLLSGTKVS LGTTTNRSSF NMVPIARFEY LWSRQHKMSI NYYGNAIEPT FDQIQPVRDV TNPQNPIVGN PNLVASFQHT VRAGYDNYIA NSKLNYSLNV NGSLTDNSVI RNNVQIIDQI LTNQATSKKD TIYITETRFL NTSGAYKVNG NYSISKQLND RRYNLSLSGS ASYDHRISMS ASQKNINTVM TFIERFGPRI NPNDWFEINP NVSYNYTKST NTLPGFRDTK TNTLALNLDG RMYFLNSWLF GYSASKNYVS GIDANVTSNP FVVNAYIEKE FFNRRGRVTF QAFDILNQNN FVSRDNSEDG GYTDTKSNAL SRYFMLRLSM RLQKWTGAQG RGGRQIMRRG DGSFM
|
| |