Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1840 |
Symbol | |
ID | 8252943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 2130175 |
End bp | 2131791 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644935490 |
Product | hypothetical protein |
Protein accession | YP_003092110 |
Protein GI | 255531738 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.511064 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00000982396 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGCATA TCGATTCGCG AATTCCACAG GTAAAAATGG CTTTAATGCT ATTTTTACTA TTAACCAGCA CATTTGTACA AGCCAAAGCA TCCAAAAAGG ATAAAGATCC AGGAAAAAGT GAATGGATTT ATTTTGGGGC TGACGGCAAG CTGGTTTATA AAAAAAGTGC CAAGGGCGAC CGCATAATGG ATTTCTCTCA TGCTGGCTAT ATGGGCGGTG GTGTGGCATT ACCAGATGTT ATTGTAAAAG TTACGGTTAA GCCACCGGGC GATGTGAATG CAGATTGTAC AGCCTTAATT CAGGCTGCTA TCGACAAGGT ATCGGCACTG CCATTGGATA AAAATGGTTT CCGCGGGGCT GTTTTACTGG CACCTGGTAC CTACCCTTGT GCCAAAACCA TAAAAATAAC AGCAGATGGT GTGGTACTCC GCGGAAGCGG CAAATCGGAA AACGGAAGTA TTATTGCCAT GAATGGTGAA AAACATACTG CTGTAATCCT ATCCAACGGC CTTAATCAAC GTGCCGGCAA TCGACTGGGT AATGCTGCAG GCAATGAAAA AACGGTTAAA ATAACAGATA AATATATCCC TGCCGGAAGC CTGTCTTTTA ATGTAGCAAA TGCAGCTGGA TTTAAGGTTG GTGATAATGT TGAGATCAGA AAACCGGTTA CCGACAAGTG GGTAAGCTTC ATGCACATGG ACGATCTGGT ACGGGATGGC AAGGCGCAAA CATGGATTAA AACAGGCTCT TTATTGATCA CCGAACGTCA CATATCAGCT ATAAACGGCA ATAAAATTAC TTTAGACGTT CCTTTGGTAG ATTCGTATGA TGTAAATTAT ACCAATGATG AAACCACAAT GGTATTGGCC AATGATGTAA AACGGCTTAA ACAGGCTGGC TTAGAAAATT TACGCATTGT ATCTCCCCCG CAAGCAGTTA ACCATACAAA GGCACTCTAC TATGCGGTTA GGCTAAACGG CGAGGATTGT TGGATGAAAG ATCTGGACCT GATGGAAACC ATGGAAAGCG TAGGAACCGG CGGACGCAGG ATTACACTGC AGCGCATCGC TGTCATCCGT AAGGCCTTAC ATGATGGTGC ATCTAAACCG GCAGAATTTG CACCCAATGC AGGGCAAATT CTGGTAGATC GCTGTTCGGT TGAAGGTGAT AACATCTGGT ATGTAGCACT GGGCGCCGGA CAAACAGGCC CCATCGTTTT TTTGAATTGT AATTTTGTTG GCAATGGCCG CATTGAAGGA CATCAGCGTT GGAGCACAGG CATGCTTTTA GACAATTGCA AGGCACCAAA TGGTGGTATG GACTTCAAAA ACAGGGGCAG TATGGGCTCT GGTCATGGCT GGGGAACTGC CTGGTCTGTT GCCTGGAATT GTGAAGCAGG CAGCTATGTA AATCAGATAC CTCCGGGCAC CTACAATTGG GTAATTGGCA GTAAAGGAAA AAGTATGCCT TTGCCAAGAC CATTTAACAA TGCTGGCCCG GCTTTGCCGG AGGGTATTTT TGATGCTCAA AACACCCGGG TCAATCCTTC AAGTTTATAT CTGGCCCAAC TGGAAGAACG TTTGGGTAAA CAGGCCCTGA AAGCAATAGG GTATTAA
|
Protein sequence | MKHIDSRIPQ VKMALMLFLL LTSTFVQAKA SKKDKDPGKS EWIYFGADGK LVYKKSAKGD RIMDFSHAGY MGGGVALPDV IVKVTVKPPG DVNADCTALI QAAIDKVSAL PLDKNGFRGA VLLAPGTYPC AKTIKITADG VVLRGSGKSE NGSIIAMNGE KHTAVILSNG LNQRAGNRLG NAAGNEKTVK ITDKYIPAGS LSFNVANAAG FKVGDNVEIR KPVTDKWVSF MHMDDLVRDG KAQTWIKTGS LLITERHISA INGNKITLDV PLVDSYDVNY TNDETTMVLA NDVKRLKQAG LENLRIVSPP QAVNHTKALY YAVRLNGEDC WMKDLDLMET MESVGTGGRR ITLQRIAVIR KALHDGASKP AEFAPNAGQI LVDRCSVEGD NIWYVALGAG QTGPIVFLNC NFVGNGRIEG HQRWSTGMLL DNCKAPNGGM DFKNRGSMGS GHGWGTAWSV AWNCEAGSYV NQIPPGTYNW VIGSKGKSMP LPRPFNNAGP ALPEGIFDAQ NTRVNPSSLY LAQLEERLGK QALKAIGY
|
| |