Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4066 |
Symbol | |
ID | 8255200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4908590 |
End bp | 4909906 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644937730 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003094319 |
Protein GI | 255533947 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.659836 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGCAT TTAAAATATT TTTTACCTTT TTTCTGGTTG CCCTTAATGG CTTTTTTGTT GCTGCAGAGT TTGCAATAGT AAAAGTTCGG GCATCCCAGA TTGAGATAAA AGCTAAGTCT GGTAGCCGGG TTGCGAATAT CGCAAAGCAC ATTACCCAGC ATTTAGATGG TTATCTGGCC GCTACACAAC TGGGGATCAC TTTGGCCTCA CTGGGTTTGG GTTGGGTTGG CGAGTCGGTC ATGCATAGCA TTGTACACGA CCTGCTGATC AATTTCTCCC TTTCGGAGAT CTATATCACC TCTATTTCTA CCGGAATAGC CTTCCTGTTC ATTACGGTTA TGCACATTGT TTTTGGAGAA CTGGCCCCTA AATCGGTCGC CATTCAAAGG CCTGTGGCCA CTACCCTTTT TATCGCACTG CCGCTGCAAG GTTTTTATTT GATATTCAGG CCATTTATCT GGGTATTAAA CGGATTTGCG AATGTAGTAC TTAAATTATT TGGGATCTCC AATGTTGGCG GACATGATTC TGTTCACAGT ACTGAAGAGC TTTATTATTT GCTGGACCAG GGTAAGGAAA GCGGTGCGCT TGACACCAAT GAACATGAGC TGATTAAAAA CGTTTTTGAT TTTAATGAGC GCGTGGTAAA AAATATTATG GTTCCAAGAA CCAAAATTAT GGGCGTAGAG CTTTCTACCC CAAAAAGAGA AGTTGTAGAA AAGATCATTG CAGAAGGATA TTCCCGTTTG CCGGTGTATG ATGATATTAT TGATAAGATC ATCGGTATTG TGCATGCGAA GGATATCCTT CCTTTACTGG CCGACAATAA GGAATGGGTG CTGGCCGACA TCATCAGGAA GCCTTATTTT GTACCCGAGA CCAAGAAGAT CAACGACCTG CTGAGCGAGC TTCAGCAAAA ACGCATACAG ATAGCCATTG TGATCGACGA GTTTGGTGGC ACAGCCGGTA TGGTTACCCT CGAGGATATT GTGGAAGAGA TCGTTGGGGA GATCCAGGAT GAGTACGATG AAGAAAAGCC GACTGTAGAG AAAATATCGG ATACGGAGTT TATTATCAAT GCTTACGCTA CTGTATACGA TGTAAACGAG CACCTGCCAC ACGACCTGCC GGAAGATGAA GATTTTGATA CGGTAGGAGG GCTGGTCTCC CATGCTTTTG GCAAAATACC TGAAGTGGGC GACAGTGAAG AATGTTATGG CTATTTATTT ACCATTTTAA AGAAAACGGA ACAAAATATA GAGACCATAA AGCTGGAACT GGTGATCAAT AAGAGTGATA TGATCGATCT ACACTAA
|
Protein sequence | MEAFKIFFTF FLVALNGFFV AAEFAIVKVR ASQIEIKAKS GSRVANIAKH ITQHLDGYLA ATQLGITLAS LGLGWVGESV MHSIVHDLLI NFSLSEIYIT SISTGIAFLF ITVMHIVFGE LAPKSVAIQR PVATTLFIAL PLQGFYLIFR PFIWVLNGFA NVVLKLFGIS NVGGHDSVHS TEELYYLLDQ GKESGALDTN EHELIKNVFD FNERVVKNIM VPRTKIMGVE LSTPKREVVE KIIAEGYSRL PVYDDIIDKI IGIVHAKDIL PLLADNKEWV LADIIRKPYF VPETKKINDL LSELQQKRIQ IAIVIDEFGG TAGMVTLEDI VEEIVGEIQD EYDEEKPTVE KISDTEFIIN AYATVYDVNE HLPHDLPEDE DFDTVGGLVS HAFGKIPEVG DSEECYGYLF TILKKTEQNI ETIKLELVIN KSDMIDLH
|
| |