Gene Phep_1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1697 
Symbol 
ID8252799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2006807 
End bp2008207 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content38% 
IMG OID644935349 
Producthypothetical protein 
Protein accessionYP_003091970 
Protein GI255531598 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.472344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0751505 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATGT TTTCAGGTAA TCATTTTAAA ATCAAGTATA AACAGGCAGG TATATGTTTG 
CTTTTAGTAA CGGCTATGAG TTGTCAGCAA AAAACTTCAG GATCTGATGA ACTAAAGATC
AGTTTACTTA CACTGGATCC GGGGCATTTT CATGCTGCCC TCATACAAAA ATCGATGTAT
AAGGAGATTA GCCCTGTGGT ACATGTGTAT GCACCTGAAG GACCGGAGCT GGAAGCCCAT
TTGAAGCTGA TTGAAAAATA CAACAGCAGG GCAGAAGATC CTACTAAATG GCAGGAAGTG
GTTTATAAAG GAGAGGATTA CCTTCATAAG ATGCTGGAAG AAAAGAAAGG GAATGTGGTA
ATTTTAGCAG GCAATAACCA AAGGAAAACA GAGTACATTC AAAAATCCAT AGATGCGGGT
ATAAATGTAC TGGCTGATAA ACCGATGGTT ATTGATGGCA AAGGATTTGA ACAACTGGAA
AAGTCGTTTG AATCTGCACA AAAAAATAAG GTAATATTAT ACGACATTAT GACTGAGCGC
TATGAAATTA CCAATATGCT GCAAAAGGAA TTTTCATTAC AACAGGATGT ATTTGGAACA
CTTGAAAAAG GGACTGCCGA AAAACCAGCT ATTACTAAAG AAAGTGTACA TCATTTCTTT
AAAAATGTTT CGGGAGCCCC ATTGATAAGG CCACAATGGT ATTTTGATGT AGACCAGGAA
GGAAACGGAT TGGTTGACGT AACTACACAT CTTGTAGATA TGATCCAATG GGAATGCTTT
TCCGATCAGC AGATCGACTA TAAAAAGGAT GTGAATATGC TGTCTGCAAA ACGCTGGACC
ACTCAGATTA CCCCTTCACA ATTTAAAAAA AGTACGGGTG CAGGCAGTTA TCCTGCTTTT
CTAAAAAAAG ATGTAAAAGA TAGCTTGCTG AACGTTTATT CAAATGGAGA AATGAACTAT
ACCCTTAAAG GAGTACATGC AAGGGTTTCC GTGATCTGGA ATTTTGAAGC ACCTGAAGGT
ACAGGAGATA CACATTATTC GGTAATGCAT GGGACCAGGG CCAGTCTGAT TATAAAACAG
GGACCTGAAC AGCAATATAA GCCTACTTTA TATATAGAGT CCCAAAAGGT TGATGACAAA
GACTATGCTG CAGCTTTAAA GCAAAGTGTG GAAAAGATAG CCAAAACCTA TCCGGGATTA
GAGCTGAAAG CTTATAAAGG AGGCTGGGAG GTAGTAATTC CGGAAAAATA CAAAGTGGGG
CATGAGGCGC ATTTTGCTGA AGTAGCTAAA AAATATTTAG GCTTTTTAAA AGAAGGAAGA
CTTCCTGAAT GGGAAAAGGC TGCTTTATTG AGTAAATACT ATACCACTAC AAAAGCCCTG
GAATTTGCCG TAAAAAAATA A
 
Protein sequence
MSMFSGNHFK IKYKQAGICL LLVTAMSCQQ KTSGSDELKI SLLTLDPGHF HAALIQKSMY 
KEISPVVHVY APEGPELEAH LKLIEKYNSR AEDPTKWQEV VYKGEDYLHK MLEEKKGNVV
ILAGNNQRKT EYIQKSIDAG INVLADKPMV IDGKGFEQLE KSFESAQKNK VILYDIMTER
YEITNMLQKE FSLQQDVFGT LEKGTAEKPA ITKESVHHFF KNVSGAPLIR PQWYFDVDQE
GNGLVDVTTH LVDMIQWECF SDQQIDYKKD VNMLSAKRWT TQITPSQFKK STGAGSYPAF
LKKDVKDSLL NVYSNGEMNY TLKGVHARVS VIWNFEAPEG TGDTHYSVMH GTRASLIIKQ
GPEQQYKPTL YIESQKVDDK DYAAALKQSV EKIAKTYPGL ELKAYKGGWE VVIPEKYKVG
HEAHFAEVAK KYLGFLKEGR LPEWEKAALL SKYYTTTKAL EFAVKK