Gene Phep_2784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2784 
Symbol 
ID8253892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3290645 
End bp3292492 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content46% 
IMG OID644936430 
Producthypothetical protein 
Protein accessionYP_003093045 
Protein GI255532673 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00400401 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATGA GAAGATATAA AATTTGGTTA ACCCCCGTAC TTTTGATACT GGTACAAGCA 
GCACAAGCAC AAGATACCTT ACGTTATACC GGCAGTGTCA TGGTTAATGC CGACTATCAC
CACGGTCAGC TTGTGCCTGC GATGGGGGTA CACAACATCC AGACTTTCAG GGCCAACCGG
GAACATCCTG AACTGGCTGA GGGACTAAAC TGGACTTATA ACCATGCACC CATGCTGGCC
TGGTGGAATG GTACTTTTTA CCTGGAATAC CTGAGCGATC CTGTGGGTGA GCATATCCCG
CCCAGCAGAA CACTGTTGCA AACCTCTAAA GATGGTTACA AATGGTCTAA GCCGGACATT
GTTTTTCCGC CCTATAAAAT TCCCGATGGC TGGAAAAAAG ATGGCTATCC TGGTGTAGCT
AAAGATTTAT ATGCCACCAT GCACCAACGC GTTGGTTTTT ATGTATCCCA GTCCGGACGC
CTGTTCCTGC TGGCTTATTA TGGTATAGCT ATGGATAAAA AAGACGATCC GAATGATGGA
AAAGGCATCG GGCGCGTCAT CAGGGAGATC AGGAAAGACA ATACTTATGG TCCTATATAT
TTCCTCAGGC CGAATTCCAG CTGGGATATG AAGCATGCGG CATATCCGAT GTATAGCAGC
AGTAAGGATA AAGATTTCGT AAAGGCATGC AATGAAATTC TGGCCAGTCC CTTAATGATG
CAGCAAATGG TAGAAGAGGC CGATAGAAAT GATCCTCTAA TTCCATTGAA CAGACCAGTA
AAGGCATTCA GTTATTACCA CCTGCAGGAT GGAAGGGTAG TGGGGTTATG GAAACATGCG
CTAACATCCA TCAGCAAGGA CAATGGCAAG AGCTGGCAAT ATAATCCGCT GCGTGCACCT
GGAGTAGTAA ACAGCAATGC CAAAATATGG GGTCAGCGTA CCTCAGACGG CCGTTTTTCC
ATAGTATACA ACCCTTCAGA ATTCCGCTGG CCACTGGCCG TTTCTACCAG TGATGACGGC
TTAGATTACA AAGACCTTTT GCTGGTTAAT GGCGAAATTT CTACCATGCG CTATGGTGGT
AATTACAAAT CCTATGGTCC GCAGTATATC CGTGGGATCC CGGAAACTGA TGGTAAGCCA
GCTGATGGCA ATATGTGGCT TACTTATAGC ATGAACAAAG AAGACATCTG GATCGCCAAA
GTACCGGTAC CTGTAATTTC CAGTACTAAA ACGCCGGTCG ATGAAGTGTT TAATTCCCTT
CCCGACGGGC AGGAACTTAA GTTATGGAAC ATTTTTAGTC CGCTTTGGGC CCCGGTTCGA
ATAGAAAAGA CACCCGACGG TACTAAGGCA CTCACCTTGC GCGATAAAGA TCCATACGAT
TATGCCAAAG CGGAACGGTT GATCCCGGAA GCAAAAAAAG TAAAAGTAGA ATTTTCAGTA
AGCCCGGCCC AGAACAATAC CGGGTCATTG CAGATTGAAT TTCAGGATGC CAGAGGTACA
GCAGCGGCCA GGTTAATATT TGATGCCGAT GGTGCCTTAA AAGCCAAGGT GGGTTACCGG
AATTCTGAGG TCATGAAATA CGAAGCAGGG AAAACATACC AGGTCAGGAT AGAGCTGGAC
CGTGACAAAA GGATGTACGA CATCTTTGTA AATGGCCAAA GTAAAGGAAC AAGGCTGATG
TTTGTACCTG TAGCATCATT TGAAAAAATT ACCTTCAGAA CAGGTGATAT ACGCCGGTTT
CCGGATGTAG ATACACCAAC AGATCAGGAT TTCGATCTTA AAAATGCTGG CACACCAGTG
AAAGAAGCCG TATATTACGT CAAATCATTA AGAACAACAG CATTTTAA
 
Protein sequence
MPMRRYKIWL TPVLLILVQA AQAQDTLRYT GSVMVNADYH HGQLVPAMGV HNIQTFRANR 
EHPELAEGLN WTYNHAPMLA WWNGTFYLEY LSDPVGEHIP PSRTLLQTSK DGYKWSKPDI
VFPPYKIPDG WKKDGYPGVA KDLYATMHQR VGFYVSQSGR LFLLAYYGIA MDKKDDPNDG
KGIGRVIREI RKDNTYGPIY FLRPNSSWDM KHAAYPMYSS SKDKDFVKAC NEILASPLMM
QQMVEEADRN DPLIPLNRPV KAFSYYHLQD GRVVGLWKHA LTSISKDNGK SWQYNPLRAP
GVVNSNAKIW GQRTSDGRFS IVYNPSEFRW PLAVSTSDDG LDYKDLLLVN GEISTMRYGG
NYKSYGPQYI RGIPETDGKP ADGNMWLTYS MNKEDIWIAK VPVPVISSTK TPVDEVFNSL
PDGQELKLWN IFSPLWAPVR IEKTPDGTKA LTLRDKDPYD YAKAERLIPE AKKVKVEFSV
SPAQNNTGSL QIEFQDARGT AAARLIFDAD GALKAKVGYR NSEVMKYEAG KTYQVRIELD
RDKRMYDIFV NGQSKGTRLM FVPVASFEKI TFRTGDIRRF PDVDTPTDQD FDLKNAGTPV
KEAVYYVKSL RTTAF