Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2672 |
Symbol | |
ID | 8253780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3127998 |
End bp | 3131420 |
Gene Length | 3423 bp |
Protein Length | 1140 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644936320 |
Product | protein of unknown function DUF1080 |
Protein accession | YP_003092935 |
Protein GI | 255532563 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.189319 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAAAA AGATATTCTT TATTCTGATT GCTGCGGTAA TGCTGCAAAG TGCTGCTTTT GCACAGGATA AAACAGACCA GCGTACCGTT ACGACCAGGA TTGCAGATTT GCTGGCACAA TTACCTGCAA GGGATGCCAA GCAATTGAAA GCCAATATGC TGGAAATTGC GCAGATGGGT GAGGATGGTT ATGTAAGCCT GATTACCGGG CTTACTGCTC CGGGTAAGGG CAATAACGCC TTGCTGGAAT ATGCGATAGG CGGATTTTCA GGCTATGTTA GCCAGACGGG ACAGGAAGCC TGGAGAAAAA TGAGTGTAAA TGCTTATTGT AAGGCATTAA GCAAAATAAC CGATAAACAA AATAAATCAT TTGTGATCAG CCAGCTTGAA CTGGTTGGTA AAGATGATGC CATAGCTTGT CTGGAGCCTT ATTTAACAGA TGCACAGCTG GCTGATCCTG CTGCACGTGC CCTGGTAAAA ATCAATTCAG CTGCGGCCAA AGCGGCCTTA TTAGCGGCAT TGACTAAAAC CAGCGGCACC GCTAAACTTT CAGTAGTAGA GGCGCTTGGA GACATCAGGG CTAAAGATGC CGCAAAACCT ATTGCTGCTT TAACAACAGG TGACAATGAC CTGGCCAAAA TGTCTTTATA CGCCCTGGCT TATATTGCCG ATCCTGCTTC TGAAGCTGTA ATGGCTGCAG CCGCCGAAAA AAGTGGATTT AAGTATGAAA ATACAAATGC TGTTGCGGCT TATCTGATCT ATGCTGAGCA GCTGATGAAA AATGGTAATA AGGAGCTTGC CAATAATATT GCAAAAAAGA TACTTGAAAA AGCAACTGCC GACGAGCAGG TACATGTGCG TACAGCTGCG TTGAAGATTG TTTCCGGTTT TAGTGAAGCC CAAAGTAATG AATATTTACT GGGCGCAATG GATGATAAAA ACTTTGAATA CCGTGCTGCT GCATTAAAAT TTGCTTTACC AACCCTTACT CCTGCAACAG CTGAGCTTTG GGCCGGAAAA GTTACCAAAG CTGATCCTGC TACTCAGGTG GCCATTATCA ATATGCTGGG CAAGAGCAAA ACTCTTTCGG TACTACCTTC AATAACTAAA TTGTTTAAAA ATAAGGACCA GGGGGTAAGA GCAGCTGCAA TTGCAGCTGC CGGCAACATT GGCCAGGAAC AGGCACTGGA AGATTTGCTT AAAATAATGG GAAAAGGTGA TGCAAATGAT ATTGCTGCAG TTTCCAATGC AATTTTAAGA ATGAAAGGAG AGGGTATCAA TGCAAAAATT GCCGCTTTTA TCCCCAAAGC AAAACCAGAG GTTCAGGTAG CTCTGATCAA TGTGCTCGCT TCGAAATCGG CAAATGGACA GTTAAACACC ATATATAGCT TGCTGAAAAG TAAAAAGCCT GAAGTTAGGC AGGCTGCTTT TGCTGCTTTA AAGCAAACGG TGGCCAGCGA TAACCTGCCG CAGTTGTTTA CTTTACTGAA TGAGACAAAG GATCAGACGG CCCTGGTTAA TGTACAGGAG GCAATTATCT CAGCCCTGAA AGGCAGTAAA AATAAAGATG AACAAGCTGA TATGGTATTA CAGCAAATGG CTGCTGCTCC TGGAGATAAA AAAGATCTTT TTTATAAAAT ACTGGGCAGC ATAGGTGGTA ACAAATCATT AAAAGCAGTT TCTGAAGCGT TTAATACCGG TAATGAAGAA ACTAAAAAGG CTGCAATTGT AGCTTTGTCG TCATGGACAG ATACCGGTTC AATACCTGAA CTGATCAGGA TCAGTCGCCA GCCCAGCAAT GTTGCTTACC TTGATCAGGC AATTGAGGGC TACCTGAATT TGGTAAGGGC TGCAAAATAT AAACCTGAGC AGCGCTTGTT GGTGCTGCGC GAAGCCATGA TCGTAGCTAA AACACCAGTA CAGCAACAGC AAATATTAAA AGATGCTGAG CAGGCAAAGT GTTTCAATAC TTTATTATTT GCGGGTAGAT ACCTTGACAA CCAGACTTTG CAGCAGGCCG CTGCCAATAC TGTAATGAAT GTTGCGCTGG CTGATAAATC ATACAGGGGA ACCATTGTGA AAGACCTGCT GAATAAAGCG ATTGGTACAA TAAAAGGAGG TGACAGCGAG TATCAGAAGG AAGCCATGCG TAAATACCTT GCAGAAATGC CTGCAGGAGA AGGTTTTGTA TCGATGTTTA ACGGAACTGA CCTGAGCGGA TGGAAAGGTT TGGTGGAGAA TCCGATAAAA AGATCGAAAA TGGATGCCAA GAGCCTGGCC GATGCACAGG TAAAGGCTGA CGCCGAAGCA AAAGAAAGCT GGAAACCGAT GAATGGTGAA CTGCATTTTA TGAGCCATGG AAATAACCTG GCTACTGTAA AGCAATATGG TGATTTTGAA ATGCTGGTAG ACTGGAAAAT TATAGATGAC AAAAAAGGTA ACGGAGATGC TGGTATCTAT CTTCGTGGTT CGCCGCAAGT ACAGATCTGG GATACTGCCA GGGTAAAATC CGGAGCACAG GTAGGTTCCG GTGGATTATA CAACAATAAA GTATATGAGA GCAAACCCCT TAAGGTTGCC GACAATAAGC TGGATGAATG GAATACCTTC CGTATTTTAA TGAAAGGTGA TAGGGTTACC GTTTATTTAA ACGGAGAACT GGTAACGGAC AATGTGATTT TGGAGAATTA CTGGGACAGA AATCTTCCAA TCTTTGCAGA AGAACAGATA GAACTGCAGG CACATGGATC GCCTGTGGCC TATCGTGACA TATACATCAG GGAAATTCCC CGTGTAAAAC CTTTTGAATT GAGTGCCCAG GAGAAGAAAG AAGGCTATAA AGTACTGTTT GATGGTACCA ATATGCACAA CTGGACCGGA AATACAACAG ATTATATCAT TGAAGATGGA AACATTTCCA TTCGCCCGAG ACCAGGAAAA GGATCTGGAG GTAATTTATT TACCAAGGAA GAATTCAGTG ATTTTATATT CCGCTTTGAA TTTCAGTTAA CACCTGGTGC AAACAATGGG TTGGGGATCA GGGCACCACT GACAGGGGAT GCTGCTTATC AAGGTATGGA GCTGCAGATA CTTGACAATG AGGCACCGAT GTATAAAAAC CTGCATGTTT ATCAGTACCA CGGTTCTGTT TATGGAACTA TCCCTGCAAA AAGAGGTTTT CTGAAACCTG TTGGCGAGTG GAATTATGAA GAAGTTGTAG TGAATGGGCC TAAAATTAAG GTGATCCTGA ATGGAACTGT GATTTTGGAC GGGGACATTA CTGATGCAAG AAAAAATGGT GCAGCTGATG GAAAGCCACA TCCGGGTTTG TTACGTGAAA GCGGACATAT CGGTTTTCTG GGACATGGTT CACCGGTACA GTTTAAAAAC ATCAGGATTA AGGACCTGAG TAAAAAGAAA TAA
|
Protein sequence | MIKKIFFILI AAVMLQSAAF AQDKTDQRTV TTRIADLLAQ LPARDAKQLK ANMLEIAQMG EDGYVSLITG LTAPGKGNNA LLEYAIGGFS GYVSQTGQEA WRKMSVNAYC KALSKITDKQ NKSFVISQLE LVGKDDAIAC LEPYLTDAQL ADPAARALVK INSAAAKAAL LAALTKTSGT AKLSVVEALG DIRAKDAAKP IAALTTGDND LAKMSLYALA YIADPASEAV MAAAAEKSGF KYENTNAVAA YLIYAEQLMK NGNKELANNI AKKILEKATA DEQVHVRTAA LKIVSGFSEA QSNEYLLGAM DDKNFEYRAA ALKFALPTLT PATAELWAGK VTKADPATQV AIINMLGKSK TLSVLPSITK LFKNKDQGVR AAAIAAAGNI GQEQALEDLL KIMGKGDAND IAAVSNAILR MKGEGINAKI AAFIPKAKPE VQVALINVLA SKSANGQLNT IYSLLKSKKP EVRQAAFAAL KQTVASDNLP QLFTLLNETK DQTALVNVQE AIISALKGSK NKDEQADMVL QQMAAAPGDK KDLFYKILGS IGGNKSLKAV SEAFNTGNEE TKKAAIVALS SWTDTGSIPE LIRISRQPSN VAYLDQAIEG YLNLVRAAKY KPEQRLLVLR EAMIVAKTPV QQQQILKDAE QAKCFNTLLF AGRYLDNQTL QQAAANTVMN VALADKSYRG TIVKDLLNKA IGTIKGGDSE YQKEAMRKYL AEMPAGEGFV SMFNGTDLSG WKGLVENPIK RSKMDAKSLA DAQVKADAEA KESWKPMNGE LHFMSHGNNL ATVKQYGDFE MLVDWKIIDD KKGNGDAGIY LRGSPQVQIW DTARVKSGAQ VGSGGLYNNK VYESKPLKVA DNKLDEWNTF RILMKGDRVT VYLNGELVTD NVILENYWDR NLPIFAEEQI ELQAHGSPVA YRDIYIREIP RVKPFELSAQ EKKEGYKVLF DGTNMHNWTG NTTDYIIEDG NISIRPRPGK GSGGNLFTKE EFSDFIFRFE FQLTPGANNG LGIRAPLTGD AAYQGMELQI LDNEAPMYKN LHVYQYHGSV YGTIPAKRGF LKPVGEWNYE EVVVNGPKIK VILNGTVILD GDITDARKNG AADGKPHPGL LRESGHIGFL GHGSPVQFKN IRIKDLSKKK
|
| |