Gene Phep_4066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4066 
Symbol 
ID8255200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4908590 
End bp4909906 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content43% 
IMG OID644937730 
Productprotein of unknown function DUF21 
Protein accessionYP_003094319 
Protein GI255533947 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.659836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCAT TTAAAATATT TTTTACCTTT TTTCTGGTTG CCCTTAATGG CTTTTTTGTT 
GCTGCAGAGT TTGCAATAGT AAAAGTTCGG GCATCCCAGA TTGAGATAAA AGCTAAGTCT
GGTAGCCGGG TTGCGAATAT CGCAAAGCAC ATTACCCAGC ATTTAGATGG TTATCTGGCC
GCTACACAAC TGGGGATCAC TTTGGCCTCA CTGGGTTTGG GTTGGGTTGG CGAGTCGGTC
ATGCATAGCA TTGTACACGA CCTGCTGATC AATTTCTCCC TTTCGGAGAT CTATATCACC
TCTATTTCTA CCGGAATAGC CTTCCTGTTC ATTACGGTTA TGCACATTGT TTTTGGAGAA
CTGGCCCCTA AATCGGTCGC CATTCAAAGG CCTGTGGCCA CTACCCTTTT TATCGCACTG
CCGCTGCAAG GTTTTTATTT GATATTCAGG CCATTTATCT GGGTATTAAA CGGATTTGCG
AATGTAGTAC TTAAATTATT TGGGATCTCC AATGTTGGCG GACATGATTC TGTTCACAGT
ACTGAAGAGC TTTATTATTT GCTGGACCAG GGTAAGGAAA GCGGTGCGCT TGACACCAAT
GAACATGAGC TGATTAAAAA CGTTTTTGAT TTTAATGAGC GCGTGGTAAA AAATATTATG
GTTCCAAGAA CCAAAATTAT GGGCGTAGAG CTTTCTACCC CAAAAAGAGA AGTTGTAGAA
AAGATCATTG CAGAAGGATA TTCCCGTTTG CCGGTGTATG ATGATATTAT TGATAAGATC
ATCGGTATTG TGCATGCGAA GGATATCCTT CCTTTACTGG CCGACAATAA GGAATGGGTG
CTGGCCGACA TCATCAGGAA GCCTTATTTT GTACCCGAGA CCAAGAAGAT CAACGACCTG
CTGAGCGAGC TTCAGCAAAA ACGCATACAG ATAGCCATTG TGATCGACGA GTTTGGTGGC
ACAGCCGGTA TGGTTACCCT CGAGGATATT GTGGAAGAGA TCGTTGGGGA GATCCAGGAT
GAGTACGATG AAGAAAAGCC GACTGTAGAG AAAATATCGG ATACGGAGTT TATTATCAAT
GCTTACGCTA CTGTATACGA TGTAAACGAG CACCTGCCAC ACGACCTGCC GGAAGATGAA
GATTTTGATA CGGTAGGAGG GCTGGTCTCC CATGCTTTTG GCAAAATACC TGAAGTGGGC
GACAGTGAAG AATGTTATGG CTATTTATTT ACCATTTTAA AGAAAACGGA ACAAAATATA
GAGACCATAA AGCTGGAACT GGTGATCAAT AAGAGTGATA TGATCGATCT ACACTAA
 
Protein sequence
MEAFKIFFTF FLVALNGFFV AAEFAIVKVR ASQIEIKAKS GSRVANIAKH ITQHLDGYLA 
ATQLGITLAS LGLGWVGESV MHSIVHDLLI NFSLSEIYIT SISTGIAFLF ITVMHIVFGE
LAPKSVAIQR PVATTLFIAL PLQGFYLIFR PFIWVLNGFA NVVLKLFGIS NVGGHDSVHS
TEELYYLLDQ GKESGALDTN EHELIKNVFD FNERVVKNIM VPRTKIMGVE LSTPKREVVE
KIIAEGYSRL PVYDDIIDKI IGIVHAKDIL PLLADNKEWV LADIIRKPYF VPETKKINDL
LSELQQKRIQ IAIVIDEFGG TAGMVTLEDI VEEIVGEIQD EYDEEKPTVE KISDTEFIIN
AYATVYDVNE HLPHDLPEDE DFDTVGGLVS HAFGKIPEVG DSEECYGYLF TILKKTEQNI
ETIKLELVIN KSDMIDLH