Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1914 |
Symbol | |
ID | 8253018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 2210067 |
End bp | 2213249 |
Gene Length | 3183 bp |
Protein Length | 1060 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644935565 |
Product | WD40 domain protein beta Propeller |
Protein accession | YP_003092184 |
Protein GI | 255531812 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0823] Periplasmic component of the Tol biopolymer transport system |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.657093 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAA GCTATACTTT CCTCAAAAGA TTAATTCCTG TTCTGTTCAT CTTCACGATG TTTTCTACCT CCCTTCATGC CCAGTATTTC GGACAAAACA GGGTTCGCTA CAACAATGAG AAGTTTAAAG TACTGCAGAC CCCACATTTC GAGATCTACT ATTACCTTAA AAATGAACAG CTGATACAGA AATTTGCCCA GGATGCAGAG ACCTGGTATA AAATGCACCA GGAGATTTTC AGGGATACCT TTCTGAAAAA GAACCCGATC ATACTTTACA ATAACCATCC TGATTTTCAG CAAACCACTG CCCTGCAGGG CGAGGTTGGC ATAGGAACCG GCGGGGTTAC AGAAGCTTTT AAAAACAGGG TAATCATGCC GGTGATGGAG CTGAATAACC AAACACGGCA TGTTTTAGGT CATGAGCTGG TACATGCTTT TCAATACCAC CTTTTACTGG AAAAAGATTC TGTCAACCTG GAAAATGTAA GCCAGATCCC CTTATGGATG ATAGAGGGTA TGGCCGAGTA CCTTTCTGTA GGAAAAACGG ATGCCTTCAC TTCCATGTGG ATGAGGGACG CTTTGCTGAA CAGGGATATC CCTTCCTTAA AAGACCTTAC CAATTCCAAT AAATATTTTC CCTACAGATA TGGACAGGCA TTCTGGACTT TTGTTGGCTC TGTATATGGC GATACCACCA TCGTTCCCTT ATTTAAGGCC ACCGCAAAAT ATGGTTATGA GAACGGATTG AGGTATACCT TCGGGTATGA TGACAGGACG CTTTCAGGTC TCTGGAAAAA TGCCATTGAA GCGCATTACC GCCCTATGTT AAGGGCAGAC AGCTCCCAGA TCAGGATTAC CGGAACAAAG ATCATCGACA ATAAAAATGC AGGGAACATG AACGTAGCGC CCTCTATTAG TCCGGATGGC AAATACCTTG CCTTCTTATC TGAAAAAGAT CTTTTTGGAA TAGACCTCTT TCTCGCCGAT GCAAAAACCG GTAAAATTAT CAGAAAGCTC AGCAGCCAGG TCGCCAATTC ACATATCGAC GATTTTAACT TCCTCGAATC TGCAGGTACC TGGTCGCCAG ACGGTAAACA ATTTGCCTTT AGCATTTTCA GCAAGGGCAA AAACCAGCTG ATGATCATCA ATATAGACAA TGGCAGCGTT GCCTTGCGTG CAGACATGGG CGATGTGGCG CAGTTTGGCA ACTTGTCCTG GTCGCCAAAT GGCGATGACA TCGCTTTCTC CGGAATGATA CAGGGCCAAA GTGATATCTT CTCTTATAAC CTTAAAACGA AAAAGGTTAC CCAGATTACC AATGACGCCT ATTCAGACTA TGCCCCCGCC TATTCACAGG ATGGAAAGAA AATAGCCTTT TCATCAGACA GGGCCTCTAT AACTAAAAAC AACAATACGG CAGTCCACTC CATCAACCTG AGTATTTACG ATATTGAAAG CAAAACTTTA ACTGATATCC CCGTATTCCC TGGCGCCAAT AACCTCAATG CCCAGTTTTC TGGCGACAGC AAAAGGCTCT TTTTCTTATC CAACAGGGAC GGGTTCAGGA ATCTTTATGC GTACAACCTG GCCGACAATA CGGTTAAACA GCTAACCGAT TATTTTACGG GGATCAGTGG CATTACGGAA TTTTCTCCGG CCATCTCCGT ATCCAGAAAT GACGACATCG TGTACAGCTA CTACCGTTCG CAGCGTTATA CTTTATACAA CTCCCCGATC AGCAGTTTCC GTTCAAAACC GGTAGATGCC AATGCAGTAA ATTTTGATGC TGCCGTGCTG CCACCCATGG AAACCATTGG TGTTGACATT GTAAACTCCA ACCTCGGCAA TTTTGAACGC TTTGAAAAAA CCATAGCAGA TTCTATGCGC CTTGTGCCTT ACAAGCCTAA ATTTAAGCTG GACTACCTCG CCAACAGCGG CGTAGGTGTT TCCACCAGCC GCTTCGGTAC TGGTGTACAG GGTGGTATTG TCGGAATGTT CAGCGATATA CTTGGTCGTA ACCAGATCGT TGCCAACCTT TCTGTAAACG GGGAGATCTA TGATTTCGGG GGACTGGTGG GTTACATCAA CCAGCAAAAC CGGATCAACT GGGGCGTAGC TTTATCGCAT ATCCCTTATA TCACCGGTTT CCGAGAAATT GTTCAAACTA CACTGGACAA CAATGGAACT CCGGTTAGCG TAATTGACGA CAGGACCAAC CTGATCCGTA CATTTGAAGA CCAGGCACAG GTATTTGGTG CCTACCCTTT CAATAAGGTA CACCGTTTTG AAACAGGTGG TGCATTCTCA CGTTACAGCT ACAGGGTAGA CCGCATCAGC AATTACTATG AGAACCTGAA TGGTTATCCC GGTTATTACA TCACTTCGGA CAAAAGAAAA GTACCACTGA GCGAAGCGAC CAATGATCTG GGAGTTCCGC TCAAAAGCTT TAGCATCTTC CAGCTCAATG CTTCTTTTGT GGGCGACAAT TCCATCAATG GCATCACCTC ACCTCTGGAG GGTTTCAGGT ACCGCCTGGG CATGGAACAG TATTTTGGTG ATTACAAATT CTCGGCAGCA ACCATTGATG TAAGAAAATA CTGGCGCTTA AAACCCATTA CCATCGCTGC CAGAAGCTAT AACTACCTGA GGATTGGTAA AGACGGCGAA AACTTATACC CGCTATATGT AGGATACCCA TATTTCATCA GAGGTTATGA AGCGAATTCC TTATACAATA GTGGGAGTAC CGGAACCAGT AATGGTTTTG ATATCAACCA GCTTTCAGGA AGTAAAATGG CTGTATTTAA TTTTGAACTC CGGCTTCCTT TTACAGGTCC AAAAAAGCTG TCTGCCATTC CTTCTAAATT TCTCTTTACA GACCTGAATC TCTTCTTTGA TGCCGGTCTG GCCTGGAACG AAGACTCTAA AGTGGTATTT AAAAACCAGC CCACCAATAA CATCAGACCC AAGCTGGGGT CTGACGGCCT GCCGGTTAAA GACCTCAACA ATAATCCGGT ATATACCGGT ACCAACGAAC GTGTTCCGGC ATTAAGTGTG GGTATATCTT TACGTGTTAA TGTATTCGGT TACTTTGTAC TCGAACCTTA TTATGCCATT CCATTCCAGC GTAAAGATAT CTCAGCAGGT GTATTCGGCC TTACTTTTGC CCCGGGATGG TAA
|
Protein sequence | MNKSYTFLKR LIPVLFIFTM FSTSLHAQYF GQNRVRYNNE KFKVLQTPHF EIYYYLKNEQ LIQKFAQDAE TWYKMHQEIF RDTFLKKNPI ILYNNHPDFQ QTTALQGEVG IGTGGVTEAF KNRVIMPVME LNNQTRHVLG HELVHAFQYH LLLEKDSVNL ENVSQIPLWM IEGMAEYLSV GKTDAFTSMW MRDALLNRDI PSLKDLTNSN KYFPYRYGQA FWTFVGSVYG DTTIVPLFKA TAKYGYENGL RYTFGYDDRT LSGLWKNAIE AHYRPMLRAD SSQIRITGTK IIDNKNAGNM NVAPSISPDG KYLAFLSEKD LFGIDLFLAD AKTGKIIRKL SSQVANSHID DFNFLESAGT WSPDGKQFAF SIFSKGKNQL MIINIDNGSV ALRADMGDVA QFGNLSWSPN GDDIAFSGMI QGQSDIFSYN LKTKKVTQIT NDAYSDYAPA YSQDGKKIAF SSDRASITKN NNTAVHSINL SIYDIESKTL TDIPVFPGAN NLNAQFSGDS KRLFFLSNRD GFRNLYAYNL ADNTVKQLTD YFTGISGITE FSPAISVSRN DDIVYSYYRS QRYTLYNSPI SSFRSKPVDA NAVNFDAAVL PPMETIGVDI VNSNLGNFER FEKTIADSMR LVPYKPKFKL DYLANSGVGV STSRFGTGVQ GGIVGMFSDI LGRNQIVANL SVNGEIYDFG GLVGYINQQN RINWGVALSH IPYITGFREI VQTTLDNNGT PVSVIDDRTN LIRTFEDQAQ VFGAYPFNKV HRFETGGAFS RYSYRVDRIS NYYENLNGYP GYYITSDKRK VPLSEATNDL GVPLKSFSIF QLNASFVGDN SINGITSPLE GFRYRLGMEQ YFGDYKFSAA TIDVRKYWRL KPITIAARSY NYLRIGKDGE NLYPLYVGYP YFIRGYEANS LYNSGSTGTS NGFDINQLSG SKMAVFNFEL RLPFTGPKKL SAIPSKFLFT DLNLFFDAGL AWNEDSKVVF KNQPTNNIRP KLGSDGLPVK DLNNNPVYTG TNERVPALSV GISLRVNVFG YFVLEPYYAI PFQRKDISAG VFGLTFAPGW
|
| |