Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4116 |
Symbol | |
ID | 8255250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4972339 |
End bp | 4974129 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644937780 |
Product | hypothetical protein |
Protein accession | YP_003094369 |
Protein GI | 255533997 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.552536 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTGA CCAGCAAGCT TTATCAATTA ATTCTTTTTA GTGCATTAAC CGGAATTTCC GGCGCCGGAT TTTGTCAGCA AACGCATTGG GACAAAGGAC AAATATTTAC AGAGGCAAAC GATCCAGTAA AAACGGACAA CAAAAACTGG TTAGAGGTAA AACCGGGTTT ACATAGTTCT TTTGTATCTA TTGATAAACG TTATGCTAAA TCTGAAGTGC CTAATATTAA CATAGAGCGT TCAGTTCTTT TAACAGGTTG GAAAGGAGAG CGTTTGTCAG CCCAGGTGCT GCTCTGGACT ACAGATTCCA TTCCCGGAGT GAAAGTAACC TTATCTGATT TTATTTCTGA AAGTGGGAGT AAGCTCAAAT CAATCGGTTA TGCCCGGTTC GAGCGATATG TGCTGACAGA CGAATATGGA TCCGGATGGG CTTGCGGAAA ACGGAACGCA GCAGATTTTT CCAGTTCATT ATCTGCAGAT ATGCTGGATG ACCTTTCGTC CTTCAATCTG GAAAAGAAAA AAGTAAGACC CGTGTGGATC ACCCTAGAAA TCCCCAGAAA GGCAGAGCAA GGCTCCTATT CGGCGAAGGT ACAAATAACT ACCAAACAGG GAAAGCAGCA GGAACTGAAC TTATCCCTGG ATGTTATCAA TCAGCTGTTA CCGCAACCAT CTTCGTGGAG CTTTCACCTC GACCAATGGC AACACCCCTC AGCAGTTGCC CGTGTAAATA AATTACCTGT ATGGAGCGAG GCGCATTTCG AAGCCATGAG ACCACAAATG CAGCTCCTGG CAAACGCCGG TCAGAAAGTA ATTACGGCTA CATTGAACAA AGACCCCTGG AATATTCAAA CTTACGATCC TTATGAAGAT ATGATCATCT GGACAAAGGG AAAGGACGGA AGCTGGTCGT ACGACTACCG GATTTTTGAT AAGTGGGTAT CTTTTATGAT GGGCCTTGGA GTAAAGAAGA TGATCAACTG TTATTCTATT GTTCCCTGGA ATAATGAAAT CCATTATAAA GATGCCATCA CAAACAAGTT TGTAAACATA GTGGCCAAGC CTGGTACTAC GGCGTTTACC GAAATGTGGG AACCGTTCCT GAAAGATTTT GCAAAACATC TGCAGCAAAA AGGCTGGCTT GAAATTACAA ACATTGCCCT GGATGAAAGA AATAAGGATG AGATGGGAAT GGCTTTTGCA CTGATAGAAA AGGTAGCCCC AAAACTTGGC GTTGCCTATG CTGATAATCA AAAGACCTAT AAGCGTTATA CCAACAGTGA TGATGTGAGT ACCGCGGTTC AACATCCTAT AGATGACAAA GATATTGCAG AGCGTAGAAG CAAGGGATTG AACACTACCT TTTATATCTA TTGTGGAAAT AGTTTTCCCA ATCAATTTAC TTTTTCTGAG CCCGCTGAGT CTGCTTATTT AGGCTGGTAT ACCCTGGCTA CAGGTTATAA TGGCGTGCTG CGCTGGGCTT ATAATTCCTG GGTGGAAAAT CCTTTGGTAG ACTCCCGATT CAGAACATGG CCTGCAGGAG ACACCTATAT TACTTATCCG CAAGCCAGAA GTTCTATCAG ATACGAGCGT ATGCTGGAAG GTATCCAGGA CTATGAGAAA GTGCTTGTGG TAAAGAAAAT GCTGGAACAT AAGAATGACC TGGCTACTTT AGCGAAATTG AATGATGCCA TTGCGAAATT GAAAAGCCAT TCCCGATATG AAGGTTGGAA TAGTGATTTA AATGCAGCAA AGCAATTGCT CAACAACATT TCAGTATCCT TATCGAAATA G
|
Protein sequence | MKVTSKLYQL ILFSALTGIS GAGFCQQTHW DKGQIFTEAN DPVKTDNKNW LEVKPGLHSS FVSIDKRYAK SEVPNINIER SVLLTGWKGE RLSAQVLLWT TDSIPGVKVT LSDFISESGS KLKSIGYARF ERYVLTDEYG SGWACGKRNA ADFSSSLSAD MLDDLSSFNL EKKKVRPVWI TLEIPRKAEQ GSYSAKVQIT TKQGKQQELN LSLDVINQLL PQPSSWSFHL DQWQHPSAVA RVNKLPVWSE AHFEAMRPQM QLLANAGQKV ITATLNKDPW NIQTYDPYED MIIWTKGKDG SWSYDYRIFD KWVSFMMGLG VKKMINCYSI VPWNNEIHYK DAITNKFVNI VAKPGTTAFT EMWEPFLKDF AKHLQQKGWL EITNIALDER NKDEMGMAFA LIEKVAPKLG VAYADNQKTY KRYTNSDDVS TAVQHPIDDK DIAERRSKGL NTTFYIYCGN SFPNQFTFSE PAESAYLGWY TLATGYNGVL RWAYNSWVEN PLVDSRFRTW PAGDTYITYP QARSSIRYER MLEGIQDYEK VLVVKKMLEH KNDLATLAKL NDAIAKLKSH SRYEGWNSDL NAAKQLLNNI SVSLSK
|
| |