Gene Phep_0772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0772 
Symbol 
ID8251861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp905076 
End bp908147 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content47% 
IMG OID644934422 
Producthypothetical protein 
Protein accessionYP_003091056 
Protein GI255530684 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACTA AGGTATGGTG CTTTTTTCTC TGCTGGTTTA CACTGGGTAA TTTGCATGCA 
CAAAGTCCGG CTACGGTAAG CTGGGCCTTA ACAGCTAATC AAAGTGCATC CGTAACGGGC
AACATTACAG CAGCTGTTCA GCAGCTTGCG GGCTTAACCA TCCAGGACTA CATCAGCGGG
GGACAGCGGA CTTTGCCTCC TGGGGGTACT TGGCCTGCCC AGACTGTGCC AGATACTACA
CGGTTTATGC AATTTGCATT GAGCCCCCTT AGCGGAAATG ATTTAAATGT TAGTTCGGTA
GCCTTGATGA TCAGTTTTTA TGGTTCCAGT GCCGGACGTG TAAACATGGC CTGGTCTACC
GATAGTGTTC ATTTTACAAA CCTGACCTCA AACTTTTCAC TGGTATCAGG TACAACACCT
ACATCCTATA CTTTTGGCGG ACTGAACATT ACTGTGCCAA GCGGCAAAAA GTTGTATTTC
AGGATTTCTC CCTGGACAGC CAGTGTCATA AATAATAAAT ACCTGCTCAT CAGGAATGTG
TCCATCAGCG GAACTACCAA TGTTTCCCCA TTTGCCAGCT GGAGCTTAAC TGCAAACCAG
ACAGCCAGTG TTACCGGTCA GGTTACTGCA CCGGTACAGG TGCTTTCCGG ACTTAAGGTA
AACAATTACA TTTCCGGCAG CGGAGGTCAG CGCATTTTAC CGCTGAATGG GAACTGGCCT
GCTAATACCG GTGCAGATAG CAACAGGTAT GTGCAGTATG CTGTAAACCC GGCCGCGGGC
TATAATTTTG TGGTAAGCCA GGTTAAGGTG CCCCTTAGTT TTAATTCCTC TGCTTATGCA
CATGCGCGGA TCTGCTGGTC AACAGATGGG ACCACCTTTA CCAATTTAAA CCCCGATGTA
ACACTGACAT CTGGTTCAGT ACCTGCTGTA ATTACCTTTG CAAACCTGAA CATTCCGGTT
ACAGATAACC GTACTTTTTA CCTCCGGGTA TATCCATGGA CAACCAGCGG ATTCAGTGAT
GGCAGATACC TGGTCTCAAA AGATGTTGTC GTCAGTGGCA GTAGCGAGGT TAGTCCGCAG
CTGGCCTTTC CAGGAGCGGA AGGTGGAGGA CGTTATACCA AAGGCGGTCG CGGAGGGGAT
ATTTATTATG TGACCAATTT GAACGACAAC CTGGCCGGAA GCCTGCGCGA TGCGGTATCG
CAACCCAACC GTACGGTACT GTTTAAAGTA TCTGGCACGA TCAATCTGCA AAGTGCCATA
ACCATTACTA AAGATAACAT CACTATTGCT GGTCAGACTG CTCCGGGAGA CGGTATTTGC
CTGAAGAATT ATGGACTGGG CATACGTGCA AACCAGGTTA TCGTAAGGTA TATCCGTTCC
CGTCCGGGAG ATGTGATCAC AGTGCCGGGT GATTCTTCTA AAGTGGTGGA TGCGATGTAC
AACAATTTTG GCAGTCCCAT CAGTCAGCCT TACAACAACA TCATGATTGA TCATTGCTCT
ATGAGCTGGT CTACCGATGA AGTAGGCTCT TTTTATGCGG TTTCGAAGTT TACCCTGCAA
TGGAGTATGC TGAGTGAAAG CCTGTACCAG TCGCTCCATA CCAAAGGTAC CCCACACGGA
TATGGTGGAA TATGGGGTGG CCAGAACGCC TCGTTCCATC ATAATTTACT GGCAAGTAAC
TCTAACAGGA ACCCGCGTTT TTCCGGGAGC ACTACCAGTT TACAGCCGGA ACTTGAATAT
GCAGATTTTA GAAATAATGT GATCTTTAAC TGGGTAGGAT CGCCTTATGG CGGTGCCGGG
GGGCATTATA ACATGGTGAA CAATTACCAT AAGGCCGGGC CGGCAACTAC AGGTGGGGCT
GGCAGTTCGG CCACGAACCG TAAAAACCGC ATCCTTTTAT TCCCCAGCTT CAGCACTACC
CTGGCGGGGG ACACTGTGTT TGGTGCTAAA TTTTATATTG ACGGCAATTA CGTACATGGC
TTTCCGGATG TTACAGCTGA TAATTGGACA AAAGGTGTGC AGTTAGATAG TTATTATGAT
GCTGCAGCAA TGAAGGCTGC AGGAAAGGTA TCGACCGCTT TTCCTTACAG CCCTGTAGTA
ACGCAAACCG CTGAAGCAGC TTTTGATGCG GTCATGAACA GTGCCGGAGC CATTTTGCCC
CGCCGCGATA CGGTTGACAG ACGGATCATT AAAGAAACCA GAACGGGCAC CGCAACCTAT
GAGGACAGCA GCTATGTGGC TGCAGGTATG GGACACCCTT CGGGTATCAT AGACAGCCAG
AACACTGTTG GAGGCTGGCC TGTACTGAGC AGTACCACTT ATGCTAGAGA TACTGATAAT
GACGGCCTGC CGGATTGGTG GGAAAAAATG ACACAGGGTT CTGCTACAGA TTCGACCGGG
CTGGATAGGA ATACCTATGC TGCAGATGGT TATACCCTGC TTGAAAAATA CCTTAATGGT
ATACCCTCAC CCGATCAACA GGTTACGTTT ATGGCAATCA ATGCCCAAAA AGGCGGACTG
GATACCGTTA AGGTCGATTT TAATATTGAC TGGGCAAAGG ACCAGTTTAA GCTCGGATTA
TACAGGTCTA CAGACAGCCT GTCATTTACA AAAATTACAG AAATTACGGC TTCAATAAAT
CAAACTGCTT ATCTGTTGAA AGATAATGCC GCACCCGGGC AAACTGTTCA TTATAAAATT
GGGAGTAAAC GCATCGATGG TACGGGAAGC ACCGTTTACA GCAATACGGT AAGCATTCAC
CATAACCCTT TAATGCTGAA ACGTAGTTCA ACACTGCCTA AAATGCCGGA TACCAGTATT
GTAAAAGAAC TGAAAAAATT AAAGCTGTAT CCAAATCCGG TGACAGGCAT ATTAAAGGTT
AACCATTCAA AAGCCAACCC ATCGGCAATG ATGACTGTTT ATACCATAAC CGGCCGAAAA
GTAATTGTCA AATCCATTCA AAATGGCATA ATGCAAACTG AAATAGACGC TACTGATTTG
CCGCAGGGCT CTTATATCAT CGAGTTCAAT AATATCAGCG AAAGGCAAAG CGGGCTTTTC
ATAAAAATAT AA
 
Protein sequence
MATKVWCFFL CWFTLGNLHA QSPATVSWAL TANQSASVTG NITAAVQQLA GLTIQDYISG 
GQRTLPPGGT WPAQTVPDTT RFMQFALSPL SGNDLNVSSV ALMISFYGSS AGRVNMAWST
DSVHFTNLTS NFSLVSGTTP TSYTFGGLNI TVPSGKKLYF RISPWTASVI NNKYLLIRNV
SISGTTNVSP FASWSLTANQ TASVTGQVTA PVQVLSGLKV NNYISGSGGQ RILPLNGNWP
ANTGADSNRY VQYAVNPAAG YNFVVSQVKV PLSFNSSAYA HARICWSTDG TTFTNLNPDV
TLTSGSVPAV ITFANLNIPV TDNRTFYLRV YPWTTSGFSD GRYLVSKDVV VSGSSEVSPQ
LAFPGAEGGG RYTKGGRGGD IYYVTNLNDN LAGSLRDAVS QPNRTVLFKV SGTINLQSAI
TITKDNITIA GQTAPGDGIC LKNYGLGIRA NQVIVRYIRS RPGDVITVPG DSSKVVDAMY
NNFGSPISQP YNNIMIDHCS MSWSTDEVGS FYAVSKFTLQ WSMLSESLYQ SLHTKGTPHG
YGGIWGGQNA SFHHNLLASN SNRNPRFSGS TTSLQPELEY ADFRNNVIFN WVGSPYGGAG
GHYNMVNNYH KAGPATTGGA GSSATNRKNR ILLFPSFSTT LAGDTVFGAK FYIDGNYVHG
FPDVTADNWT KGVQLDSYYD AAAMKAAGKV STAFPYSPVV TQTAEAAFDA VMNSAGAILP
RRDTVDRRII KETRTGTATY EDSSYVAAGM GHPSGIIDSQ NTVGGWPVLS STTYARDTDN
DGLPDWWEKM TQGSATDSTG LDRNTYAADG YTLLEKYLNG IPSPDQQVTF MAINAQKGGL
DTVKVDFNID WAKDQFKLGL YRSTDSLSFT KITEITASIN QTAYLLKDNA APGQTVHYKI
GSKRIDGTGS TVYSNTVSIH HNPLMLKRSS TLPKMPDTSI VKELKKLKLY PNPVTGILKV
NHSKANPSAM MTVYTITGRK VIVKSIQNGI MQTEIDATDL PQGSYIIEFN NISERQSGLF
IKI