Gene Phep_2787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2787 
Symbol 
ID8253895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3294884 
End bp3297997 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content45% 
IMG OID644936433 
Producthypothetical protein 
Protein accessionYP_003093048 
Protein GI255532676 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.595004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGCTTGC TCATACTGGG ACAGTTTTTC GTATTGGCCG GGTATGCACA AAAAGTAAAT 
ATCCCCGAAC CACCAAAACC AATATTTAAA GGTAAAGAAG GTAAACTGAA TTACAGTCCG
GACGAAAAAG GCAACCGGAT TCCCGATTTT TCTTATGCAG GTTATAAAGC CGGAGAGCAA
CCCATACCAG AGGCTGTAGT AAAGGTTGTT GTGCCGGTTA AATCAGGCGA TGCTACCCTG
AGGATCCAAT CGGCCTTAAA CTATGTTGCA GCTTTGCCCT TAGGCAAAGA TGGCTTAAGG
GGAGCTGTAT TGCTGGAAAA AGGAAAATAT GAAGTTGGAG GTGCTTTAAA GATCAATGCT
TCCGGCGTGG TTTTGCGTGG AAGCGGAATG GGGGAAAACG GAACGGAGAT ATTTGCAACA
GGACTGGACA GAATGGGGGT ATTACGCATA GCTGGTAAAC CAGATCGTAT TAAAGAGGCC
CCTGTAGCAG TTACAGATCA ATATGTTCCG GTAAATGCAA TGAAGGTTAC CCTTGCAAAT
GGAGGATTTA AAAAAGGTGA TCAGGTAATT GTACAACGCA TATCCTCTAA AAACTGGATT
GATTTGATTG GAACAGACCA TTTTGGCGGG GGCATTACCT CACTGGGCTG GAAAGCAGGA
CAACGTGACA TTTATTGGGA CAGAAAAGTG ATTGGGGTTG AAGGGAATAC TTTATTATTG
GATGCACCCT TAACTACAGC GCTGGATGCT GTTTATGGTG GGGCTACTGT ATCAAAATAT
AGCTGGAATG GCAGAATTTT CAATTCCGGT GCAGAAAATA TAAGATTTAC ATCGGGCTTT
GATGATAAAA ACCCTAAAGA TGAATACCAC CGCTGGACGG CCATTTCTAT AGAAAATGCC
ACAGATGCAT GGGTACGCCA GGTTGTTTTT GAACATTTTG CAGGTTCAGC AGTAACTGTT
CAGGAAACTG CAAACAGGAT AACTGTGGAA GATTGTAAAT CGCTGGCGCC GGTTTCGGAG
ATTGGTGGCG AACGCAGATA TACTTTTTTA ACTACAGGAG GGCAAACACT GTTTCAAAGA
TTGTATTCTG AATATGCTTA TCATGATTTT GCAGTTGGCT TTTGTGCTCC CGGTCCAAAT
GCTTTTGTCC AGTGCCAGGC TTATCTGCCA TTTAGCTTTA GCGGAACAAT TGACAGTTGG
GCATCAGGTG TTTTATTTGA TATTGTTAAT GTGGACGGAC AGGCCCTGAG TTTTATGAAC
AGAGGGCAGG ACGGACAAGG TGCAGGCTGG TCGGCCGCCA ACAGCGTATT CTGGCAGTGT
ACAGCGGCCC GGGTAGACTG TTATGCTCCG CCAACTGCGC AGAACTGGGC ATTTGGTACC
TGGGCACAAT TCTCGGGCGA CGGTTATTGG GATATGTCTA ACGAGCAGAT CCAGCCGCGT
AGTTTGTATT ATGCCCAATT GAAGGACAGG CTGGGAAAAC AGGCCGATGA ACGGACTTTT
GTAATGCCTG TAGAAACGGA AGCTTCAAGT AGCCCGCCAG TGGATGTAGC TCAGAAGCTA
ACCAGACTGG CTGATAAACC GGCCATGCTG TTAACGGAAT ATATAGATCA GGCAACCGAA
AGACAAAAGA TTTCAACTGA TACGCGCAAT GCAAAAAATA TTGACAAAAT AGGTGTTGAG
AGAATTAATA CTCCGGCTAA GGCCAGTGCG ATGCAGATAA GCAATGGTTG GTTGCGTCGG
GGCAATGCCC TGGTAACCGG AAACCGTGCA GACGTCCAAT GGTGGAATGG CAGCGCAAGG
CCATATGCGC TTAAAGGCAT GAAAATGCAC ATCACCCGTT TTGTACCTGG CCGTACAGGT
AAGGGGCTGA CAGACGATCT GGAAGAAATA ACTGACTCTA TGCAAAAAGG ATCAGTAAAG
ATCTTAGACC ATAATTATGG TTTGTGGTAC GACAGGAGAC GTGACGACCA TGAGCGGATC
AGAAGAATGG ACGGAGAAGT ATGGACGCCA TTTTATGAAT TGCCTTTTGC ACGCAGCGGA
CAGGATAAAG CATGGGATGG ACTGAGCAAA TACGACATTA GCAAATACAA CCCATGGTAT
TGGGGCAGGT TAAAACAATT TGCAGACCTT GCTGATCAGA AAGGCCTGGT ACTGATCCAC
GAAAACTATT TCCAGCATAA CATTATAGAA GCTGGTGCGC ATTATGCCGA TTTTCCATGG
CGTACGGCAA ATAACATCAA TAATACCGGT TTTCCGGAGC CAGTACCTTA CGCGGGCGAC
AAGAGGATAT TTATGGCAGG GCAATTTTAT GACATCAGCA ATGCTGAGCG TAAGGCACTG
CACCGTGCCT ATATCCGTAA ATGCCTTGAT AATTTTAAAG ACAATACCGG TGTAATCCAG
TTGATCGGTG CAGAGTTTAC CGGGCCATTA CATTTTGTAG AGTTCTGGAT AGATACCATT
AAAGAATGGG AAAAAGAGAC AGGTAAACAC CCGATTATTG GTTTAAGTAC CACTAAAGAT
GTGCAGGATG CCATATTGGC CGATAAAAAC AGGGCAGGAG TAGTCGATCT GGTCGACATC
CGTTATTGGC ATTACCAGGC TGATGGTTCT GCTTATGCAC CACAAGGTGG ACAAAACCTG
GCTCCTCGCC AGCATGCACG TTTGCTGAAA CCTAAAAAAA CATCTTTTGA TCAGGTATAC
CGTGCTGTAG CAGAATATCG TACCAGATAT CCCGAAAAGG CAGTGATCTA TTCAGGTGAT
GGTTTTGATG CTTTTGGCTG GGCCGTTTTT ATGGCTGGCG GATCTTTGTC AAATGTTCCG
GCTGCCAATA ATGCTTCGCT TTCGGGTGTG GCCACAATGA AACCATTTAA TTTGGCCGGC
CGGTCTTCAG GTCAGTATGC TTTGGCTAAT CCAGATGGCG CATATCTGTT GTACAACAGC
TCTTCCGTTC CTGTAAAACT TGACCTAAGT AAAGCTAGAG GAAACTATGT GGTAAAATAC
ATCAACCCGC GCAGCGGCCT GGTAGTTAAG GAAGAAAAGA TAAAGGGGGG AGCCGCTAAA
GAATTTAATA AGCTTTCATC GGGAGACGAA GTCGTTTTTA TCAATAAAAT TTAA
 
Protein sequence
MGLLILGQFF VLAGYAQKVN IPEPPKPIFK GKEGKLNYSP DEKGNRIPDF SYAGYKAGEQ 
PIPEAVVKVV VPVKSGDATL RIQSALNYVA ALPLGKDGLR GAVLLEKGKY EVGGALKINA
SGVVLRGSGM GENGTEIFAT GLDRMGVLRI AGKPDRIKEA PVAVTDQYVP VNAMKVTLAN
GGFKKGDQVI VQRISSKNWI DLIGTDHFGG GITSLGWKAG QRDIYWDRKV IGVEGNTLLL
DAPLTTALDA VYGGATVSKY SWNGRIFNSG AENIRFTSGF DDKNPKDEYH RWTAISIENA
TDAWVRQVVF EHFAGSAVTV QETANRITVE DCKSLAPVSE IGGERRYTFL TTGGQTLFQR
LYSEYAYHDF AVGFCAPGPN AFVQCQAYLP FSFSGTIDSW ASGVLFDIVN VDGQALSFMN
RGQDGQGAGW SAANSVFWQC TAARVDCYAP PTAQNWAFGT WAQFSGDGYW DMSNEQIQPR
SLYYAQLKDR LGKQADERTF VMPVETEASS SPPVDVAQKL TRLADKPAML LTEYIDQATE
RQKISTDTRN AKNIDKIGVE RINTPAKASA MQISNGWLRR GNALVTGNRA DVQWWNGSAR
PYALKGMKMH ITRFVPGRTG KGLTDDLEEI TDSMQKGSVK ILDHNYGLWY DRRRDDHERI
RRMDGEVWTP FYELPFARSG QDKAWDGLSK YDISKYNPWY WGRLKQFADL ADQKGLVLIH
ENYFQHNIIE AGAHYADFPW RTANNINNTG FPEPVPYAGD KRIFMAGQFY DISNAERKAL
HRAYIRKCLD NFKDNTGVIQ LIGAEFTGPL HFVEFWIDTI KEWEKETGKH PIIGLSTTKD
VQDAILADKN RAGVVDLVDI RYWHYQADGS AYAPQGGQNL APRQHARLLK PKKTSFDQVY
RAVAEYRTRY PEKAVIYSGD GFDAFGWAVF MAGGSLSNVP AANNASLSGV ATMKPFNLAG
RSSGQYALAN PDGAYLLYNS SSVPVKLDLS KARGNYVVKY INPRSGLVVK EEKIKGGAAK
EFNKLSSGDE VVFINKI