Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1172 |
Symbol | |
ID | 8252270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 1386726 |
End bp | 1389743 |
Gene Length | 3018 bp |
Protein Length | 1005 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644934827 |
Product | Tetratricopeptide domain protein |
Protein accession | YP_003091452 |
Protein GI | 255531080 |
COG category | [S] Function unknown |
COG ID | [COG1729] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAAAA AATACCTTTT TATTCCGCTA CTGCTAGCAG GTGGCTTTAC AGCCGGTTAC GCCCAGACAA GCGTACTGGT TAACCTGAAC AAGAATTACC AAACTGGTCT GGAACTACTG GATAATGAAA AATATGTAGC AGCTGCGCAA CAGTTCAGAC TGGTTGAGCA GCTCAGGCAA AAACCGGGCA CACAACAGGA AAGCAATGCC GAACTGTCTA TGCTTAAAGA AAATGCCAAA TTCTATGCTG CCGTTTGTGC CCTTGAGCTC GGTAACAGTG ATGCGGAAAG CCTGTTTCAG AACTTCATTA AAGATTATCC CCTTAACCCC AATACCAAAC TGGCTTACTT CCATGTAGGC AAATCCTATT TTGCCCAAAA AAATTACCAG AAAGCACTGG AATGGTTTGA AAAAACCGAT CCTTCTACGC TATCCGGCAA ACAGCGGCTG GAATATCAGT TTAAACAGGG CTATGCCTAC TTCCAGCTCA GCAACATGGA AAAAGCTGAA CCTTTATTTG AAGCCGTAAA AAAAGAAAAA TCACCTTTTC AGGAAAGTGC AACCTATTAC TTTGCCTACA TCAATTACCT GAACAAAGAA TATAAAACGG CACTAAGCAA TTTTGAAAAG CTTAAAGGAT CGCCAACTTA CGAAGCCAGC TATCCCTATT ACATCACCTC CATGTATTAC CTGGATGAAA GGTATGACGA TGTGATCAGC TATGCCATAC CCATCTTAAA AACCTCTAAA CAACAATACG AAGCAGAAAT GCTGAGCCTG ATTGCGGCAT CCTATTTTGC CAAATCTGAT TATGTAAATG CAGAAAAATA CTTCAGAGAG TTTTATGCTA AAGACAAATC AAACAATAAG AATAACCTGT TTATTTACCA ATATGGCTAT TCTTTATTTG AATTGAAAAA GTACAGCGAA TCTGTAACTG TACTTGAAAA ACTGGACAAC GATGATGTGT ACCTTCAAAG CGGTATGTAC ACCCTGGGCC GCTCTTTTCT GCAGTTAAAA AACAAGGAGA AAGCAAGAAG TGCTTTCTTC AGGGCATCCA GGCTTGATTT TGACAAGGTA ATACAAGAAG AAGCCTGGAT CAATTATGCC AGACTGAGCT ATGAACTCGA GTTTAACCAG CAGGCGCTGG AAGCAACTCA AAATTTCTTG AAACAGTTTC CTTCTTCACG TAAAATCAAT GAAGCGAAAA CCCTGTTGGG CGAGATTTTG CTGACCAGCA AAAACTACCA GGCAGCAATA GATATTCTGG AACCCATTCA GAGCAAATCG CCCGAAGCCA GGGAAGCTTA CCAGAAAGTA ACCTATTTCA GGGGGCTGGA GTTTTACAAT GAACGCGCAT TTCCAAATGC CCTGTCTATG TTCCTCCGCT CTGAAAAGTT TCCTGAAGAT AATGAAATTT TAGCGCTGAG TACTTACTGG AAAGCCGAAG CCTGCTATGA ACTGAGAAAA TTTGGTGAAG CAGTAAGGCA TTTTGAAACC TTTTTGGATA TGCCGGGTGC CAGCAAAACG GGTGTTTACA ATTTTGCCAA TTATGCCCTG GCCTATTCAG CATTTGAAGA TGAAAAATAT GGAAAGGCGG CACTTTATTT TGAACGCTTT TTAAAAGGTA ATGATAAAGA CCAGAAAACA GTAAATGATG CCACCATCAG GCTTGCCGAT TCGTATTTTG TAAACAAAAG TTATGGTAAT GCCCTGGTAA ACTACAACAG GATCATCGAC AGCAAAGCCA GCGGAGAAGA TTATGCATTG TTTCAACGTG GGATGATCCA GGGACTGGAT AACCAGAATG ACGCCAAGAT CAATACCATG CAAAATTTGC TGAAACAGTT TCCCAACTCC AATTATGCGG ATGATGCAGG TTTCGAGATG GCCTATACCT ATTTCAATAA GGGGGAACTC GACAAATCAA AATCCGATCT GATCAGTCTG GTAAGCCAAT ACCCCAACAG CAGTTATGTA CCACGTGCAT TGGTAACCAT AGGTCTGGTG CAATACAACC AGGACCAGGA TGATGCTGCC CTTGAGTCTT TCAAAAAGGT GATCCGCGAT TACCCGAGTA CTGAAGAAGC CAAACAGGCC CTGGAATCTA TCAAAAATAT CTATGTAGAC AAAGGCGATT CCCAGGGTTT CATCAATTAT GCCGGCACAA CCCCACTTGG CAACTATTCA AACGCAGAAC AGGATAATAT CCTGTTCCAG GGTGCCAACA ATCTTTATTT GAAAGGAGAT GCAAAGGGTG CTTTTGAAGC AATTAACGCT TACTTTGACA AATTCCCTAA GGCAATCCAC GATAAAGAAG CCAAGTTCAT CAGAGCAGAA TCACTGGTAA AATTAGGCCG CCCGAATGAA GCTGTTCCTG ATTATGAATA TATTCTGAAC GACTGGACCA GTGATTATAC CGAACGCTCG CTGGTAAGCA TTTCCAAATT GTTCCTTGAT CAGAAAAAAT ACAATGAGGC TATCGTTTAT CTGAAACGCC TGGAAACTAC GGCCGATTAC AAAGTGCACT ATACATATGC ACTGAACAAT CTTCTAAAAG CTTACAGTGA GCTGAACATG CCTGATGATG TGTTGAAATA TGTCCAGCTG GTCAAAGAAT CTGACAAAGC TTCTGAAGAA GAAAAAAACA GCGTAGATCT ATATGCGGGT AAAGCTTATT TATTAAAAGG TGATACTGAA CAAGCTATAA AAGCCTTTAA CAGTGTGATT AGCAAAACCA AAACCCTGGC TGCCGCAGAA TCCAAGTACA ATCTGGCTGC CATACAGTAT GATAAAAAAG ACTATAAAAC CTCTACAAAA ACCTGTTTTG ACCTCATCAA CAACATGCCT TCTTATGATT ACTGGGTAGC AAAGGCATTT ATCCTGTTAT CCGATAACTA TGTGGCCCTG AAAGATAATT TACAGGCAAA AAGCACGCTC CTGAGCATCA TCGACAATTA CGAAGGCAAG GATGACATTG TACCTACTGC CAAGGCCAAA TTAGAAAAAA TTAAATAA
|
Protein sequence | MSKKYLFIPL LLAGGFTAGY AQTSVLVNLN KNYQTGLELL DNEKYVAAAQ QFRLVEQLRQ KPGTQQESNA ELSMLKENAK FYAAVCALEL GNSDAESLFQ NFIKDYPLNP NTKLAYFHVG KSYFAQKNYQ KALEWFEKTD PSTLSGKQRL EYQFKQGYAY FQLSNMEKAE PLFEAVKKEK SPFQESATYY FAYINYLNKE YKTALSNFEK LKGSPTYEAS YPYYITSMYY LDERYDDVIS YAIPILKTSK QQYEAEMLSL IAASYFAKSD YVNAEKYFRE FYAKDKSNNK NNLFIYQYGY SLFELKKYSE SVTVLEKLDN DDVYLQSGMY TLGRSFLQLK NKEKARSAFF RASRLDFDKV IQEEAWINYA RLSYELEFNQ QALEATQNFL KQFPSSRKIN EAKTLLGEIL LTSKNYQAAI DILEPIQSKS PEAREAYQKV TYFRGLEFYN ERAFPNALSM FLRSEKFPED NEILALSTYW KAEACYELRK FGEAVRHFET FLDMPGASKT GVYNFANYAL AYSAFEDEKY GKAALYFERF LKGNDKDQKT VNDATIRLAD SYFVNKSYGN ALVNYNRIID SKASGEDYAL FQRGMIQGLD NQNDAKINTM QNLLKQFPNS NYADDAGFEM AYTYFNKGEL DKSKSDLISL VSQYPNSSYV PRALVTIGLV QYNQDQDDAA LESFKKVIRD YPSTEEAKQA LESIKNIYVD KGDSQGFINY AGTTPLGNYS NAEQDNILFQ GANNLYLKGD AKGAFEAINA YFDKFPKAIH DKEAKFIRAE SLVKLGRPNE AVPDYEYILN DWTSDYTERS LVSISKLFLD QKKYNEAIVY LKRLETTADY KVHYTYALNN LLKAYSELNM PDDVLKYVQL VKESDKASEE EKNSVDLYAG KAYLLKGDTE QAIKAFNSVI SKTKTLAAAE SKYNLAAIQY DKKDYKTSTK TCFDLINNMP SYDYWVAKAF ILLSDNYVAL KDNLQAKSTL LSIIDNYEGK DDIVPTAKAK LEKIK
|
| |