Gene Phep_0567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0567 
Symbol 
ID8251654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp679040 
End bp680401 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content45% 
IMG OID644934215 
ProductNHL repeat containing protein 
Protein accessionYP_003090851 
Protein GI255530479 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00166551 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACT ACCGAAAGAG TATGGATAGG TGTTTTGCCG CCTTCCTATG GGTATGCTTC 
AGTTCCTGTT GTTTTGTTTC CTGTAAAAGC GAAAAGGAAG GCCCTAGTAA AACCTATGAT
TCTTCAAAGC CCGTTGTACT CTCCTCTTTT TCTCCGAATG AGGGAGGAGC GCGCGATAAA
ATACTTTTAG ACGGGGAAAA CTTTGGAAAT GACCCAAGTA AAATCAAAGT CTATTTTAAC
CATGCAAAAG CTTCAGTTAT CTCGTCAAGT GGTAACCGGA TCTATGCCAT TGTACCCCGG
CTTCCAGGTG ATAATCCTAA GATTTCTGTG GTTGTGGGTA CTGACTCGGT TGTTTATAAA
GACACTTTTA CTTACCATAT TCAGGCTCAG GTAAGCACAG TTACGGGTAA CGGGCAAAAA
AACTTTAAAC CCGGAACGCT CTCTGAAGCT GAAGTTTATG GTAAATACCT GGAACTGGAT
GCCGAGGGAA ATATATTTAT GTCCTGGCGG GATGGTGGCA ATCCGGCTAC ATTTGGTGTA
GCCAGGATAA ATGAAAAGCA AAATATAGTT ACTCCACTGA TCGAGTCGGT CGCTGCCGGT
CGGATCTTAT ATGCCAACGG CTTAACTGTG GATCGTGTTA CTGGTATGCT AACAGCCGCT
CATGAGTCTA CAAAGGAGGT TATTTTTACA TTCGACCCCA GGGAGGCCTG GTACCCGCGT
CAGCGTAACA TCAAATACTC TACGGCAGAT TATAATTCCA TTGTTACTGC GGATCTGTAC
AAAAACTTCG TGACTTATTG TCCGTATGAT GGGTATCTTT ACACGAGATA CAGAGATGGA
AAGGTTGCCA AAATAAATCC TCAGACATTC GAAGCAAAAA TTGTTCACCA GGGACCTTAT
GGATCGCAGT ATGGTCAGGC CATTAATCCA GTAAAACCAT GGCTCCTGTA CATCACGCTC
ACTACCAATG CCACACCGAC CAATTTCAGA CAAGGTATTA TGGTACTGGA TCTTCGTGAC
CCTAATGGAT CAGGCGGCTT TAAACGCCTC AATGCCCCTG GAGGTAGTGC CTTCCGCGAT
GGGCCGTTGG CAGACGCACT GTTCAACGAT CCTAAAGAGA TTAAGTTTGA CAACAGCGGA
AACATGTTTG TTGCAGACTA TGGCAACCAT TGTATCCGTA TGGTATCGGC TGATAATATC
GTAACAACGG TAGCAGGCCA ACCGGGCAAA TCAGGCTATA AAGATGGCGG ACCGGTAGAA
TCTCTATTTA ACCAACCCTG GGGAGTGGCT GTCAATGAGC AAGGTGACAT TTATATTGCA
GACTGGAGTA ACGCCAGGAT ACGCAAATTA GTTATTGAAT AA
 
Protein sequence
MKNYRKSMDR CFAAFLWVCF SSCCFVSCKS EKEGPSKTYD SSKPVVLSSF SPNEGGARDK 
ILLDGENFGN DPSKIKVYFN HAKASVISSS GNRIYAIVPR LPGDNPKISV VVGTDSVVYK
DTFTYHIQAQ VSTVTGNGQK NFKPGTLSEA EVYGKYLELD AEGNIFMSWR DGGNPATFGV
ARINEKQNIV TPLIESVAAG RILYANGLTV DRVTGMLTAA HESTKEVIFT FDPREAWYPR
QRNIKYSTAD YNSIVTADLY KNFVTYCPYD GYLYTRYRDG KVAKINPQTF EAKIVHQGPY
GSQYGQAINP VKPWLLYITL TTNATPTNFR QGIMVLDLRD PNGSGGFKRL NAPGGSAFRD
GPLADALFND PKEIKFDNSG NMFVADYGNH CIRMVSADNI VTTVAGQPGK SGYKDGGPVE
SLFNQPWGVA VNEQGDIYIA DWSNARIRKL VIE