Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0377 |
Symbol | |
ID | 8251462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 447256 |
End bp | 448524 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644934025 |
Product | protein of unknown function DUF1501 |
Protein accession | YP_003090663 |
Protein GI | 255530291 |
COG category | [S] Function unknown |
COG ID | [COG4102] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTCAA GAAGAGGATT TATAAAAGCA GGGGGACTGG CCTTATTTGG AATTGGATTG GGCGGGATCC CGGGGTTTTT GGCAGAAGCG GTTGCCAGTA CAAAAGCACC GGGTTTGTTT AAAAGAAAAA AAATACTTGT TTGCATTTTT CAGCGTGGGG CAATGGATGG GCTGATGGCT GTAACCCCAT TTACAGATCA ATACCTTAGG GCAGCAAGGC CAACATTGTT TATGGATGCT GCAAGAGGTA ATGGAAAGCG CACTCCGCTG ATTGATCTGG ATGGCCGTTT TGGTTTGCAC CCCTCTATGG CTGCATTTGA AAAAGTGTTC AGGGAAAAAA GAATGGCTAT TGTACATGGC ATCGGTTCAC CAAATACTAC CCGCTCGCAT TTTGATGCAC AGGATTTTAT GGAATCGGGT ACACCTTTCA GAAAGGGTAC AGATAGCGGC TGGTTAAACC GTGCCGTAGG CTTACTGGGA CATGAAGCGG CCACACCATT TCAGGGGGTG AGTTTAACCT CGTCACTGCC AAGGTCATTT TATGGTGATA ATCCGGCGGT GGCCATCAGT AATCTACAGG ATTTTAACAT CCAGCTGCGT GGAAATGTGA AGGGGGCCAA TATGGCAGCT AAAAGCTTTG AAGACTTATA TGATACCACT TCGTCTGGTT TGTTAAAAGA AACCGGTAAA GAGAGTTTTG ATGCGATAAA AATGCTTCAA AAGGTGGATA CCAAGAATTA CTCTCCTTCA AACAACGCCA TATATCCAAA TACAGCATTG GGCAATTCAT TAAAACAAAT TGCCCAGTTG ATTAAAATGG ATGTTGGGAT GGAGGTTGCT TTTGCTGAAT CTGGTGGCTG GGATACCCAC TTTAATCAGG GTGCAGAAAC CGGGATTTTT GCAAGAAATG TGAACGACCT GAGTAATAGT ATAATGGCGT TCTGGACTGA TATGGGAACT TATCAGGATG ATGTTACAGT AATGACCATG ACCGAGTTTG GCCGTACGGT AAAACAGAAC GGGACTGGGG GAACAGATCA TGGCAGGGGA TCCTGTAACT TTATTTTGGG GAACGGGGTG AGCGGCGGCC TGGTGCACGG TTTGGTAAAC CCGCTGGCAG TTGAGAACCT GGAAGATGGG CGTGATCTGG CCGTTACTAC AGATTTTAGA AGTGTTTTTA GTGAAGTAGC GGATAAGCAC CTGAACATCA ATAACGATAA GGTGCTTTTT CCGGACTGGG ATGGTAACAA AATTGGTGTA ATGCGCTAG
|
Protein sequence | MTSRRGFIKA GGLALFGIGL GGIPGFLAEA VASTKAPGLF KRKKILVCIF QRGAMDGLMA VTPFTDQYLR AARPTLFMDA ARGNGKRTPL IDLDGRFGLH PSMAAFEKVF REKRMAIVHG IGSPNTTRSH FDAQDFMESG TPFRKGTDSG WLNRAVGLLG HEAATPFQGV SLTSSLPRSF YGDNPAVAIS NLQDFNIQLR GNVKGANMAA KSFEDLYDTT SSGLLKETGK ESFDAIKMLQ KVDTKNYSPS NNAIYPNTAL GNSLKQIAQL IKMDVGMEVA FAESGGWDTH FNQGAETGIF ARNVNDLSNS IMAFWTDMGT YQDDVTVMTM TEFGRTVKQN GTGGTDHGRG SCNFILGNGV SGGLVHGLVN PLAVENLEDG RDLAVTTDFR SVFSEVADKH LNINNDKVLF PDWDGNKIGV MR
|
| |