Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1958 |
Symbol | |
ID | 8253062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 2260665 |
End bp | 2262812 |
Gene Length | 2148 bp |
Protein Length | 715 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 644935609 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_003092228 |
Protein GI | 255531856 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0263254 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000489248 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATATTA AAAGGTTTTC GAAAATACTA AAGAAGTTTT CATGGATATT GATTTTATTG CCAGTAATTG CTGGGACCCT AACTTACTTT CTCTCCAAAA ATCTTCCGAA AAAATATAAA TCAGAAGCAC AGATAGCCAC GGGACTTGTA GACCAGTCTA AACAAATATC AACCCAAGTT CAATCTGATT TTTTTAAGAT CAGCCAGCAA TTTAGTAATA TCATTGAAAA AATGAAAATG AGAAAAATGA TGGGAATACT TTCCTATCAT CTCATTATTC ATGATTTAGA AAATCCAAGC ACTGCTTTCA GAAAAAACAG TTCTAAAATA GATTCACTGT CCCAGTCTGA TAAATCAGAA TTAATTAATA TCTACAAAAA AAAGCTAAGT GAAAAAGCTA CAATTACCGT TTTGGACAAT AGAAATAAAT TCAAATTATT CGATTTGCTC CAATCAATGG GATATGATGA AGCTAGTATC AATGAAGATT TGACCATTTA CAGAGCAGAG AACAGCGACT TCATAAATGT GCAGTTTACT TCTGAAGATC CTTTGTTATC TGCTTTTGTA GTAAATACGC TTTCAACAGA GTTCATTAAT AATTATAGCG TAGAAGTTTA TAATAATCAG AACAACTCTA TTTTGCTTCT AGATTCATTG TTAAAGAAGA AAGAACAAGA TATGAATGCG AAAAATAGCG CACTGAAGGA CTTCAAAATG AAAAACGGAG TCTTAAATGT TAATGAACAA GGTGCAACTT TATACGCTCA GATATCTCAA TATGAAGAAA AAAAGGCGCA GGCGATAAGA GATATACAAG CTAATTTAGG TGCATTATCG GCTATTAATA AAAAGCTGAG TGGGAAAGAT GAACCGTTTT TAGGAGGGAG TGCCATTGAA GACAATAATA ATATTCTCAA TTTAAAATCA CAGATAAAAA CAGCCAATGA CAGGTATATA GACGGAGGTT TTAAAGTAGC AGATAAAAAG AAAGTAGATT CTTTACAAAA TCTGCTTAGT ATCCAGTCTT CCAAAGTCTC AGATAAAAAT GTAACTGATC CTCTTGTTCC AAGACAGGGA CTTGTTCAGC AAAAAATAAA TCTTGAAGTA TCTACCGACC AGATTAAAAA TACCATTAAA TCAATTGATA GAGAGTTGGC TACTTTAAAA GCAAAGTACA GTACAATGGT GCCATTTGAT GCTGGCATTC AAAACTACGA ACGCGATGCC GATATTGCTA CGAAAGATTA TATGGCAGCA CTTGATCGGA CCAATCAAAG TAGAACGGAG CAAAACACTG GTTTAAAGCT TCAGATCGCA CAAATTGGTT TACCAGGTAC GCCAGAAAAA TCAAAGGCGA TACTGTTTAT TGCAATGTCT GCCGCTGCCA GTTTTATGTT GTGTTTTGTT ATTCTATTGA TTCTTTTTCT GCTTGACAGA TCTGTTTATA CCACAGCTCA ATTGGCCAAA CAAACCGGTG GGCCAGTTTT AGGCTTACTC AATTTAATTA CAGAATCAGA CAAAGAACCC AAAACGATTT GGAAAGATAA AGGAGACAAT TTAAATTATA ATGTCTATAA AGACCTTTTA AGATCCGTTC GGTTTGAAAT TGATAAAGCA TTCGGTGATA GTGAGTCAAA AATATTAGGC ATCACAAGTT TAAATATTGG TGAAGGAAAA TCATTTTTAG CATCCAGTTT GACTTATGCA TTTGCTATGA CTGGAAAAAA GGTATTGCTT ATAAGTAGCG AAGAAGATAG TGTAGATAAG GATGGATCTC AGAAACTGAT CCCCAGTGAA TTTGTAGGTA CATTTATTGT GAAGAAAGAA GTACAAACAG AAGACCTCAT TACTGTGTTT AATATGAAAT CCAGCAATTC TTCTTTATTG GAAACTCAGA GCAGTGCTAA CATAAAAAAT GGGTTTGATC TTTTACGGAA GGAGTTCGAC TATATTATAA TTGACATCAA TAACTTAAAG GATGTTAACA ATACCAAAGA ATGGTTGTAT TTTACGGACA GGAGCATTGC GGTATTTGAG TATGGTAAAT CAATAGGAGA TGGAGATGCT GAATATATCA ACTATATCAG GAACCATCCA GGTTTTATGG GATGGATACT CAATAAATTT AAGTATAAAA AGAAATAG
|
Protein sequence | MDIKRFSKIL KKFSWILILL PVIAGTLTYF LSKNLPKKYK SEAQIATGLV DQSKQISTQV QSDFFKISQQ FSNIIEKMKM RKMMGILSYH LIIHDLENPS TAFRKNSSKI DSLSQSDKSE LINIYKKKLS EKATITVLDN RNKFKLFDLL QSMGYDEASI NEDLTIYRAE NSDFINVQFT SEDPLLSAFV VNTLSTEFIN NYSVEVYNNQ NNSILLLDSL LKKKEQDMNA KNSALKDFKM KNGVLNVNEQ GATLYAQISQ YEEKKAQAIR DIQANLGALS AINKKLSGKD EPFLGGSAIE DNNNILNLKS QIKTANDRYI DGGFKVADKK KVDSLQNLLS IQSSKVSDKN VTDPLVPRQG LVQQKINLEV STDQIKNTIK SIDRELATLK AKYSTMVPFD AGIQNYERDA DIATKDYMAA LDRTNQSRTE QNTGLKLQIA QIGLPGTPEK SKAILFIAMS AAASFMLCFV ILLILFLLDR SVYTTAQLAK QTGGPVLGLL NLITESDKEP KTIWKDKGDN LNYNVYKDLL RSVRFEIDKA FGDSESKILG ITSLNIGEGK SFLASSLTYA FAMTGKKVLL ISSEEDSVDK DGSQKLIPSE FVGTFIVKKE VQTEDLITVF NMKSSNSSLL ETQSSANIKN GFDLLRKEFD YIIIDINNLK DVNNTKEWLY FTDRSIAVFE YGKSIGDGDA EYINYIRNHP GFMGWILNKF KYKKK
|
| |