Gene Phep_1172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1172 
Symbol 
ID8252270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1386726 
End bp1389743 
Gene Length3018 bp 
Protein Length1005 aa 
Translation table11 
GC content41% 
IMG OID644934827 
ProductTetratricopeptide domain protein 
Protein accessionYP_003091452 
Protein GI255531080 
COG category[S] Function unknown 
COG ID[COG1729] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAAA AATACCTTTT TATTCCGCTA CTGCTAGCAG GTGGCTTTAC AGCCGGTTAC 
GCCCAGACAA GCGTACTGGT TAACCTGAAC AAGAATTACC AAACTGGTCT GGAACTACTG
GATAATGAAA AATATGTAGC AGCTGCGCAA CAGTTCAGAC TGGTTGAGCA GCTCAGGCAA
AAACCGGGCA CACAACAGGA AAGCAATGCC GAACTGTCTA TGCTTAAAGA AAATGCCAAA
TTCTATGCTG CCGTTTGTGC CCTTGAGCTC GGTAACAGTG ATGCGGAAAG CCTGTTTCAG
AACTTCATTA AAGATTATCC CCTTAACCCC AATACCAAAC TGGCTTACTT CCATGTAGGC
AAATCCTATT TTGCCCAAAA AAATTACCAG AAAGCACTGG AATGGTTTGA AAAAACCGAT
CCTTCTACGC TATCCGGCAA ACAGCGGCTG GAATATCAGT TTAAACAGGG CTATGCCTAC
TTCCAGCTCA GCAACATGGA AAAAGCTGAA CCTTTATTTG AAGCCGTAAA AAAAGAAAAA
TCACCTTTTC AGGAAAGTGC AACCTATTAC TTTGCCTACA TCAATTACCT GAACAAAGAA
TATAAAACGG CACTAAGCAA TTTTGAAAAG CTTAAAGGAT CGCCAACTTA CGAAGCCAGC
TATCCCTATT ACATCACCTC CATGTATTAC CTGGATGAAA GGTATGACGA TGTGATCAGC
TATGCCATAC CCATCTTAAA AACCTCTAAA CAACAATACG AAGCAGAAAT GCTGAGCCTG
ATTGCGGCAT CCTATTTTGC CAAATCTGAT TATGTAAATG CAGAAAAATA CTTCAGAGAG
TTTTATGCTA AAGACAAATC AAACAATAAG AATAACCTGT TTATTTACCA ATATGGCTAT
TCTTTATTTG AATTGAAAAA GTACAGCGAA TCTGTAACTG TACTTGAAAA ACTGGACAAC
GATGATGTGT ACCTTCAAAG CGGTATGTAC ACCCTGGGCC GCTCTTTTCT GCAGTTAAAA
AACAAGGAGA AAGCAAGAAG TGCTTTCTTC AGGGCATCCA GGCTTGATTT TGACAAGGTA
ATACAAGAAG AAGCCTGGAT CAATTATGCC AGACTGAGCT ATGAACTCGA GTTTAACCAG
CAGGCGCTGG AAGCAACTCA AAATTTCTTG AAACAGTTTC CTTCTTCACG TAAAATCAAT
GAAGCGAAAA CCCTGTTGGG CGAGATTTTG CTGACCAGCA AAAACTACCA GGCAGCAATA
GATATTCTGG AACCCATTCA GAGCAAATCG CCCGAAGCCA GGGAAGCTTA CCAGAAAGTA
ACCTATTTCA GGGGGCTGGA GTTTTACAAT GAACGCGCAT TTCCAAATGC CCTGTCTATG
TTCCTCCGCT CTGAAAAGTT TCCTGAAGAT AATGAAATTT TAGCGCTGAG TACTTACTGG
AAAGCCGAAG CCTGCTATGA ACTGAGAAAA TTTGGTGAAG CAGTAAGGCA TTTTGAAACC
TTTTTGGATA TGCCGGGTGC CAGCAAAACG GGTGTTTACA ATTTTGCCAA TTATGCCCTG
GCCTATTCAG CATTTGAAGA TGAAAAATAT GGAAAGGCGG CACTTTATTT TGAACGCTTT
TTAAAAGGTA ATGATAAAGA CCAGAAAACA GTAAATGATG CCACCATCAG GCTTGCCGAT
TCGTATTTTG TAAACAAAAG TTATGGTAAT GCCCTGGTAA ACTACAACAG GATCATCGAC
AGCAAAGCCA GCGGAGAAGA TTATGCATTG TTTCAACGTG GGATGATCCA GGGACTGGAT
AACCAGAATG ACGCCAAGAT CAATACCATG CAAAATTTGC TGAAACAGTT TCCCAACTCC
AATTATGCGG ATGATGCAGG TTTCGAGATG GCCTATACCT ATTTCAATAA GGGGGAACTC
GACAAATCAA AATCCGATCT GATCAGTCTG GTAAGCCAAT ACCCCAACAG CAGTTATGTA
CCACGTGCAT TGGTAACCAT AGGTCTGGTG CAATACAACC AGGACCAGGA TGATGCTGCC
CTTGAGTCTT TCAAAAAGGT GATCCGCGAT TACCCGAGTA CTGAAGAAGC CAAACAGGCC
CTGGAATCTA TCAAAAATAT CTATGTAGAC AAAGGCGATT CCCAGGGTTT CATCAATTAT
GCCGGCACAA CCCCACTTGG CAACTATTCA AACGCAGAAC AGGATAATAT CCTGTTCCAG
GGTGCCAACA ATCTTTATTT GAAAGGAGAT GCAAAGGGTG CTTTTGAAGC AATTAACGCT
TACTTTGACA AATTCCCTAA GGCAATCCAC GATAAAGAAG CCAAGTTCAT CAGAGCAGAA
TCACTGGTAA AATTAGGCCG CCCGAATGAA GCTGTTCCTG ATTATGAATA TATTCTGAAC
GACTGGACCA GTGATTATAC CGAACGCTCG CTGGTAAGCA TTTCCAAATT GTTCCTTGAT
CAGAAAAAAT ACAATGAGGC TATCGTTTAT CTGAAACGCC TGGAAACTAC GGCCGATTAC
AAAGTGCACT ATACATATGC ACTGAACAAT CTTCTAAAAG CTTACAGTGA GCTGAACATG
CCTGATGATG TGTTGAAATA TGTCCAGCTG GTCAAAGAAT CTGACAAAGC TTCTGAAGAA
GAAAAAAACA GCGTAGATCT ATATGCGGGT AAAGCTTATT TATTAAAAGG TGATACTGAA
CAAGCTATAA AAGCCTTTAA CAGTGTGATT AGCAAAACCA AAACCCTGGC TGCCGCAGAA
TCCAAGTACA ATCTGGCTGC CATACAGTAT GATAAAAAAG ACTATAAAAC CTCTACAAAA
ACCTGTTTTG ACCTCATCAA CAACATGCCT TCTTATGATT ACTGGGTAGC AAAGGCATTT
ATCCTGTTAT CCGATAACTA TGTGGCCCTG AAAGATAATT TACAGGCAAA AAGCACGCTC
CTGAGCATCA TCGACAATTA CGAAGGCAAG GATGACATTG TACCTACTGC CAAGGCCAAA
TTAGAAAAAA TTAAATAA
 
Protein sequence
MSKKYLFIPL LLAGGFTAGY AQTSVLVNLN KNYQTGLELL DNEKYVAAAQ QFRLVEQLRQ 
KPGTQQESNA ELSMLKENAK FYAAVCALEL GNSDAESLFQ NFIKDYPLNP NTKLAYFHVG
KSYFAQKNYQ KALEWFEKTD PSTLSGKQRL EYQFKQGYAY FQLSNMEKAE PLFEAVKKEK
SPFQESATYY FAYINYLNKE YKTALSNFEK LKGSPTYEAS YPYYITSMYY LDERYDDVIS
YAIPILKTSK QQYEAEMLSL IAASYFAKSD YVNAEKYFRE FYAKDKSNNK NNLFIYQYGY
SLFELKKYSE SVTVLEKLDN DDVYLQSGMY TLGRSFLQLK NKEKARSAFF RASRLDFDKV
IQEEAWINYA RLSYELEFNQ QALEATQNFL KQFPSSRKIN EAKTLLGEIL LTSKNYQAAI
DILEPIQSKS PEAREAYQKV TYFRGLEFYN ERAFPNALSM FLRSEKFPED NEILALSTYW
KAEACYELRK FGEAVRHFET FLDMPGASKT GVYNFANYAL AYSAFEDEKY GKAALYFERF
LKGNDKDQKT VNDATIRLAD SYFVNKSYGN ALVNYNRIID SKASGEDYAL FQRGMIQGLD
NQNDAKINTM QNLLKQFPNS NYADDAGFEM AYTYFNKGEL DKSKSDLISL VSQYPNSSYV
PRALVTIGLV QYNQDQDDAA LESFKKVIRD YPSTEEAKQA LESIKNIYVD KGDSQGFINY
AGTTPLGNYS NAEQDNILFQ GANNLYLKGD AKGAFEAINA YFDKFPKAIH DKEAKFIRAE
SLVKLGRPNE AVPDYEYILN DWTSDYTERS LVSISKLFLD QKKYNEAIVY LKRLETTADY
KVHYTYALNN LLKAYSELNM PDDVLKYVQL VKESDKASEE EKNSVDLYAG KAYLLKGDTE
QAIKAFNSVI SKTKTLAAAE SKYNLAAIQY DKKDYKTSTK TCFDLINNMP SYDYWVAKAF
ILLSDNYVAL KDNLQAKSTL LSIIDNYEGK DDIVPTAKAK LEKIK