Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3959 |
Symbol | |
ID | 8255093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4761010 |
End bp | 4764210 |
Gene Length | 3201 bp |
Protein Length | 1066 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644937623 |
Product | hypothetical protein |
Protein accession | YP_003094212 |
Protein GI | 255533840 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.383952 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTTAA CATTTACCAA GCAACTAAAA ATGCCTTTGC TGGCATTTAT GTTTTTATGG ATGGGCGTTT TACATGTTCA AGCGCAATTG ACACATCCGG GGATACTGTT CAACGCTGCT GGCCTGGCCA GGTTAAAGAC CTATGCAAAT ACGGAACGGC AGCCATGGGC AGCGACATAT GCTAAATTAC TTGCCTATAA CGATATAAAC TATGTGCAGG AACCTGCTTA TGCCATTGTT AATAGAGAGT TCGCAGGGGC AACCAGTACT GAGTCAAGGG CAATGTCTGC CCAAAGTCAG AGAGCTTACA GGTGTGCAAT TTTGTGGGTC ATTACTGGAA ATCAAATATA TGCAGATAAA GCTAAAACAA TATTGAACTC GTGGTCAGGT ACCCTGGATA GTATTACCGG AGGCGCTGCC AAGTTATGTG CGGCCTGGTA TGGTTTTGGT TTTGTTAATG CTGCGGAAAT TCTGCGTTAT ACAAACTCAG GCTGGAGTAC GACAGATATA CAGCGCGCGG AATCTATGTT CAGGTATAAA TTTTATCCGG TAATTGAACC TTTCCAGGGG GGATGGGCGG GGAATTGGGA TACCGCCATT TGTAAAACTA TGATGGGAAT AGGAGTTTTT ATAAATGACG TTGCCATTTA CACCAGAGGG CGCAATTATT TGTGGTCAAC CACCGAAACC GCTTCAGGAA CACTGAACAA TTACATTTAC CCTACTACCG GCCAGTGTTT TGAAAGCGGT CGCGATCAGG AGCATACCCA AATGGGCATA GGGGGCTTAG CGGAAGCATG CGAAATAGGG TATAACCAAG GTACGGATCT TTATGGTTTG TTTTCAAACA GGTTACTGTT AGGAACCGAA TATACCGCTA AATATAATTT AGGGTATAGC GTACCCTATA CAACAAATCA TTATGGATCT GTGATTTCTC CTGATTTAAG GGGAGAGTTT CTTCCGTTTT ATGAATTGGT TTACAATCAT TATGTAAACA GAAAAGGGAT GTCCGGAGAG CCGGTTAAAT TTACCAAAAT GGTTGTGGAA AAAATAAGGC AAGACAATGG AGGAGAAAAT GGTACTGCGA TATTATCAGG ATACGGATCA TTGTTGTTTA ATGAATATGT ATTTAAATAT GTTCCTGCTG CGGGTGACTA TCGGACAACA GGATTGAGTA GCAGTATGGG TACCCCATCA CAGTTTGAAG TTTTTACGAA CGGAGATTGG GTAACAGCAA CAACTGCCCT TGGATTAACA ACTAATTTGT TGGTAAGAAA TGGGCAAAGT GCATTGGCAG CAGGTACTAG AAATTTAAAG AACCTGATAG TTGGAGAGGG GGACGGAGCT GTTTTAAGGG CACAGGTCAA TGCAGGTACA GTATCGGCAT TAAACATTGT AGAGCCAGGT AATTTAAGCA GTATTCCTGC GCTATCTTTC GTTAACGGTG GACAAGCTAC AGGCTCTACC AGTGCCGTTG CAACGATTAC AAGCGTAAAA GTAACAGGTG CAGATATAAA AAACCGGGGT TCTGGCTATA CTAATGCCGG TGTTACTTTT AGCGGTGGTG GCGGAAGCGG AGCCACGGCT ACTGCAATAG TGAGTAATGG TAAAATTATG GATATTGTGA TCACCAATTC AGGATCTGGC TATACATCCA TACCAACTGT GTCTGTTACA GGAAATGGTA CAGGAGCAGC TGTTTCGGCA AAAGTTGGTA TTACAGAGAT AAGCATTAGT AGCGGAGGGA CCGGTTATAC CAAAGCGCCA GCAGTGATAG CAGGAACCTT TCTGAGAGTG AATGACGGAA TTGCTCTAGG TGTATCAAAT GATGTATCTT TTCAAAGAGG ATCATCTGTG TATAGCGGGG GAGCCTCTTC GGCCGTTACA GGTACATTGA ATATCAGCGG CAACCTGATA GCAGAAGATA CTGTTAATTT TATCTCTGTA GCTCACGATA ATACCATTTC ATCACTTACA GTAAATTTTA AAAAATCTAC CGTAGATAAT ATCGGTTTCA TAGGCGGAAA TGTTACTTTT AGCGCACTGG GTGTTGAAGC TGCGAATACC CTTAAATTGA AGTTAGGTGC CGTAATGAAT GTTGCAGATG CTATTGGTCT AAACAGTACT GGTATAATTG ATGCAACTAA TGGCACTATA GGTTTTGTAA ATATCTCTCC TTCTATATCG ATAGCAAGAA CCATCGCAGC CAATACTTTT AAAGATGCAA CTGTCAATAA AATGGTAGTC AATTCAGCCG CCGGTGTTAC GCTGAACCAG GGGCTGACGA TAACAAAACT GGATATGCAG AAAGGATTGC TGAATATTCC GGAAACTTCA GAGATTACTG TTTCAAGTGT TTCAGGCGGA AGCAACACTT CCTATATCAA CACCATGTCG TCAGCAAGTT CCGGTGCAAT TGCGAAAGTT AAAGTAACCG GATTAACAAC TGCTCAGGGA GATATTCCTT TAGGAAATGG CGGAAATTAT CTGCCTGTAC GGATTACCCC ACCTGCAGAA ACTGCATTTA ACTTTACAAT GAGTGTACTT ACAGGCCTTA CAGCTAATGG TTTGCCAGAT GGCGGTGTAG TTGCAGACAA GAGCCAGTTT GTAAATGCGT CATACCATGT TATCCGTACA AGCGGAACAG GAGATTATAC GTTTCGTGTT GGCTTTCCTG CAAGTCTGAA AGGCAGTGCT TTTACCCCAT CATCTCCTTT CGGTATTTCA AAATATAATG GAAACAGTTG GTTACCTGTT ATAGGCAGTG GTAATTATGC ACTAAATACC GCTACCGCTA CTTTTAATAC CAATGGCTTA CGCCACCGGA TACAGATTGG TGGGGCTCAG CCATTGAATT TAACAGGTAC CAATGCCAGC GGCTATACAG AACGCAAGGC GATTAACTTA ATCGAGCCGG AAGATTTACT GAAGATCAAG GCAACCAATA TCCTATCTCC AAACGGTGAT GGGGTGAATG ACAAATGGGT GGTGGATAAT ATTGATTTTT ATCCAAATAA CGAGGTGAAG ATCTTTGAGC GTACGGGCAG ATTAATGTAT AGTAAAAAAG CTTATGACAA TAGTTGGGAA GGTACCTTAA ATGGTGTGCC GCTGGCTGAA GGAACTTATT ACTATATCAT AAATTTTGGA ACAAGCAGGC CAAGTTTAAG CGGTTTCATT ACCATTACCA GACCAGAGTA A
|
Protein sequence | MNLTFTKQLK MPLLAFMFLW MGVLHVQAQL THPGILFNAA GLARLKTYAN TERQPWAATY AKLLAYNDIN YVQEPAYAIV NREFAGATST ESRAMSAQSQ RAYRCAILWV ITGNQIYADK AKTILNSWSG TLDSITGGAA KLCAAWYGFG FVNAAEILRY TNSGWSTTDI QRAESMFRYK FYPVIEPFQG GWAGNWDTAI CKTMMGIGVF INDVAIYTRG RNYLWSTTET ASGTLNNYIY PTTGQCFESG RDQEHTQMGI GGLAEACEIG YNQGTDLYGL FSNRLLLGTE YTAKYNLGYS VPYTTNHYGS VISPDLRGEF LPFYELVYNH YVNRKGMSGE PVKFTKMVVE KIRQDNGGEN GTAILSGYGS LLFNEYVFKY VPAAGDYRTT GLSSSMGTPS QFEVFTNGDW VTATTALGLT TNLLVRNGQS ALAAGTRNLK NLIVGEGDGA VLRAQVNAGT VSALNIVEPG NLSSIPALSF VNGGQATGST SAVATITSVK VTGADIKNRG SGYTNAGVTF SGGGGSGATA TAIVSNGKIM DIVITNSGSG YTSIPTVSVT GNGTGAAVSA KVGITEISIS SGGTGYTKAP AVIAGTFLRV NDGIALGVSN DVSFQRGSSV YSGGASSAVT GTLNISGNLI AEDTVNFISV AHDNTISSLT VNFKKSTVDN IGFIGGNVTF SALGVEAANT LKLKLGAVMN VADAIGLNST GIIDATNGTI GFVNISPSIS IARTIAANTF KDATVNKMVV NSAAGVTLNQ GLTITKLDMQ KGLLNIPETS EITVSSVSGG SNTSYINTMS SASSGAIAKV KVTGLTTAQG DIPLGNGGNY LPVRITPPAE TAFNFTMSVL TGLTANGLPD GGVVADKSQF VNASYHVIRT SGTGDYTFRV GFPASLKGSA FTPSSPFGIS KYNGNSWLPV IGSGNYALNT ATATFNTNGL RHRIQIGGAQ PLNLTGTNAS GYTERKAINL IEPEDLLKIK ATNILSPNGD GVNDKWVVDN IDFYPNNEVK IFERTGRLMY SKKAYDNSWE GTLNGVPLAE GTYYYIINFG TSRPSLSGFI TITRPE
|
| |