Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4096 |
Symbol | |
ID | 8255230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4935464 |
End bp | 4939129 |
Gene Length | 3666 bp |
Protein Length | 1221 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644937760 |
Product | hypothetical protein |
Protein accession | YP_003094349 |
Protein GI | 255533977 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATGA AAACACTTTA TATGCTTTGC GTATTGCTGA TGTTGTCAGG CGCTGCCCAA CATTCTTTTG CACAGAAAAA GAATGGCCTG CCACACGACA CAAAGTTTAG TGGCTACCCA ACAAGAAATC CGGACTTTGA TGTGCGGCCA GGCTTCGGTA ACCCACCTAA AGGCTATGGA AATGTCCCTT TCTTCTGGTG GAACGGTGAT ACGCTTTCGC GCGCACGCTT AACTGAACAG TTGGATATCC TTAAACAATC TTCTACAGAT GGTTTTGCAA TAAGCTACCT GCATACCGAT CCCAGGGGCG CAGATGCAGA ACTGAATAAA AAGCTGGGCT ATGGCCTTTA TGGACGTACC GAACATGGTA ATCCCGCAGT ATATTCTGAT GAATGGTGGA AAACCTGGAA CTGGTTTTCC GGCGAAGCCG CAAAACGGGG CCTGGCTGCA GGCCTTGACG ATTATACCAT AGGCTGGGTA GGTAACGGCT ATTCTACCGA TGAGGTACGC GACCTGGAAC GTTTCAAATC CTATAAGGGT GCGCTGGTTA TAAAAATCGA TACCATAGCA GCAGGTACTT CCTTAAAAAA GGCAGTCCCA CCAGATCTGA TTAGCCTTAC TGCCTGGCCG GTTACAGGCA GGGCCGCCTT TATAGACCTT AAAGGAAAGG TTAAAAACGG AAGTATTGAA TTTAAAGCAC CAAACAATGG CAACTGGAAA ATATATACCA TTGTGGCCAG CAATAACTAC ATGCTGCATC CCGATCATGG TAAGGAACTG GTAAAACATT ATTTTCAGCG CTTTGAAGAT AAAATGGATG CCCAGGGCCG CAAAGGAATG AACTATTTTT TCCAGGATGA GCTGTTCATC CCTTTTGACA TGAACACCTG GTCGGAAGAT TTTGCAGAGC AGTTTAGCCG TATAAAAGGC TACGACGTAC TGCCCTATCT GCCAGCCTTA AAAGAAAATA TCGGTGCAAT CACCCCAAAA ATCCGTATGG ATTATGCCGA TGTGCTGATA GAACTGGCCG AGGAAAGATA TTTTAAGCCC ATTTACAACT GGCATGCCAG CCGCGGGCTC ATTTATGGCT CAGATAACCT GGACCGGGGG CTGGAACCCC TCAATTATGT AGATTATTTT CGTATAGAAA GCTGGTATAC TGCCCCAGGA AATGATGCCC CGGCCAGAGG CTCTTCTTTT TTACAGACCA AAGTATCCAG CTCTGTTGCA CATTTATACA AGCGTCCACG CACCTGGCTG GAAGCCTTTC ACAGCATGGG CTGGGGCAGC AGTGGCGAAT GGCTGGCCGA ACAGCTGGAC CACCATTTTA TAGCAGGGGG TAATCTCATT TGTATGCATG GCCTCTATTA CACCACGCAT GGCGGATGGT GGGAGTGGGC ACCACCTGAT TTCCACTACC GCATGCCTTA CTGGCCACAT ATGAAAAAAT GGCTGGAATA TGGGGAGCGC CTGAGTTATT TGATGAGCCA GGGCATCCAT GTTAGCGATA TTGCCCTGAT GTACCCTACC GAACCCATGC AGGCTTTTCC GGGCAGCAGG CCCGATATAA GTTTTGCCAC TGCAAGAACA TTAAGCAATG CCGGCCTGGA TTATGATTTT ATGGATTACA GATCCCTTCT TAAAACAGAG ATCAGTGACA AATCCCTGAA AGTATCAGAT ATGTCTTTTA AGGTGCTGAT CCTACCTGAT ATGAAAGCGA TGCACCACGA GGCTTTGCAA CAGGCGCTAA ATTTTTATCG TAAGGGAGGA ATAGTAGTAG CCACCGGTGC ATTGCCTATG GCCAGTACAC GTAAAGGCAG CAACGATCCG GAGGTTGATG CCATTGTAAA AGAAATTTTC GGCTTAACGG CAAGAGAACT GGAAGCCGGT AAAAAGGCCG AGATTCATCA GAATGCTGCC GGAGGGAAGG GTTTGCTTGC CGATACGCTG AATATTGCAA CTGTATTGGC GCAACACATT ACACCTGATT TTATTCCTGG TGAAGGAGGG GGCAAAGTGC TGCACCGCAG GGTCGGAAAT AAAGAGGTAT ACATGACAAT GAATGTAAAA CCGGGCAACG AAGTTTTTTA CCGGGCCCTG GGCCGTGCCG AATTATGGGA TGCCAAAACC GGCGAGACCA AAGTTTTGCC TGTTGTTAAA CAGACTGCAG CAGGAACTTA TATCCGGTCA GATGCCGGTT ATACCAATTC ATCACTCATT GTTTTCTCGC CAGGAGAACC GCTGATAGAA AGCAAAAAAA CATCTGATGC AGTTGCTCTT GAAAAAATTC CGGTGACAGG AAACTGGAAG ATAGAATTTC TGCCTACCAT GAACAACAAA TGGGGGGATT TCCGCCTGCC CGCCTTTGAT GGTATGATCG GTACGGAAGC CCGTACATTT AAATTCAGCA CTAAGGCCAA TGCGGCTAAA GACTGGACAG CTGCTGCATT TAATGACAAC AGCTGGAAAG AAGGCATTTA CGATTACAGC TACCAGATGC AATGGTTGCT GGACTCTGCT TCCAACAATT TTGATGCCTT AATAGGCCAG GCACTTGACG GAAAGCTGGA CAGCTGGAAA CCTTACCGTT TTAGCTGGCG TTTTGGTGTT TGGGACCATC CCGGCCCGCA GGGTTACCAT GGGTTAAAAG CTAAGGTAAA CGATGGTTTC CTTATTTTGG ATGGGGCTGG CGACCACCTT TTTAAAACTT ATGTTTATGC CCCGCAGGAA CAATTGTACC ATGTAGAGCT GGGAGAAAGG GTGCCAGATC GCTTTTATGT GGATGGCAAG GCTTTAACAG GCACTGAGAT CAAACTGAAT AAAGGCTGGC ATGCTGTACT TGCCGCTTAT GCCAAGGTGC CTAAAAAAGG GTATAAAACC GGGCCAAACT CACGTGATGA ACGTCCGAGA AGTGCCATTG TCTTTTTGCC GGCAGCAAAC CCACTTCCGG AAAAGGTAGA CCCTTATTCT AAAATACTGG CTATGCGCTG GTATCAGGTG CCGCATTTAA TGTTTGATCC GGATAAGGGC GAAAACAAAG CCTATTGTTA CCGGTTCAAA TCTGCGCCGG GTTTAGAAAA AATGGAAATG GGTATTTATG GTAAAAACCT TGCTGTTTGG ATAGACGGAC AGCCGCTGGA TAAAAAGTAC ATCCAGCTGT TGAAAGCCGA ATCTGGTTTA AATACTTATC AGGTAAACCT GCCGGAGAAG AAAGCGCAGA TCGCAGAAGT GGCCATGCAG ATCGAAACCG AAACGGGCAT CCAGGATGTT GCCGCATTCC CTTTTCCTGT AAAATTATTC TGCCGGTCTG GATTGCTGGA AGCGGGTGAC TGGTCGCTAA CGGGGCAGAT GAAGCATTAT TCGGGTGGAT TGTATTACAG AAAGACCTTA AGTTTCAGTG CAGAGCAATT GAAAAATAAA GTTTCACTCG ATTTGGGTGA AGTTGTGGCT ACCTGTGCTG TTAAAATAAA TGGACAGGAT GCGGTAACGA TGATGTCCAA ACCTTATAAA ACTGAGATCA CTAAATTCCT GAAAGCAGGG ACTAATGAAG TAGAAGTATT GGTGTATAGT ACGCTGTCTA ATCACTATCA AACTATTCCA TCTGCTTATC GTGGGAATCC AAGGGCCGGA CTGATCGGGC CGGTTGCGAT CGAGATACAG AAATAA
|
Protein sequence | MNMKTLYMLC VLLMLSGAAQ HSFAQKKNGL PHDTKFSGYP TRNPDFDVRP GFGNPPKGYG NVPFFWWNGD TLSRARLTEQ LDILKQSSTD GFAISYLHTD PRGADAELNK KLGYGLYGRT EHGNPAVYSD EWWKTWNWFS GEAAKRGLAA GLDDYTIGWV GNGYSTDEVR DLERFKSYKG ALVIKIDTIA AGTSLKKAVP PDLISLTAWP VTGRAAFIDL KGKVKNGSIE FKAPNNGNWK IYTIVASNNY MLHPDHGKEL VKHYFQRFED KMDAQGRKGM NYFFQDELFI PFDMNTWSED FAEQFSRIKG YDVLPYLPAL KENIGAITPK IRMDYADVLI ELAEERYFKP IYNWHASRGL IYGSDNLDRG LEPLNYVDYF RIESWYTAPG NDAPARGSSF LQTKVSSSVA HLYKRPRTWL EAFHSMGWGS SGEWLAEQLD HHFIAGGNLI CMHGLYYTTH GGWWEWAPPD FHYRMPYWPH MKKWLEYGER LSYLMSQGIH VSDIALMYPT EPMQAFPGSR PDISFATART LSNAGLDYDF MDYRSLLKTE ISDKSLKVSD MSFKVLILPD MKAMHHEALQ QALNFYRKGG IVVATGALPM ASTRKGSNDP EVDAIVKEIF GLTARELEAG KKAEIHQNAA GGKGLLADTL NIATVLAQHI TPDFIPGEGG GKVLHRRVGN KEVYMTMNVK PGNEVFYRAL GRAELWDAKT GETKVLPVVK QTAAGTYIRS DAGYTNSSLI VFSPGEPLIE SKKTSDAVAL EKIPVTGNWK IEFLPTMNNK WGDFRLPAFD GMIGTEARTF KFSTKANAAK DWTAAAFNDN SWKEGIYDYS YQMQWLLDSA SNNFDALIGQ ALDGKLDSWK PYRFSWRFGV WDHPGPQGYH GLKAKVNDGF LILDGAGDHL FKTYVYAPQE QLYHVELGER VPDRFYVDGK ALTGTEIKLN KGWHAVLAAY AKVPKKGYKT GPNSRDERPR SAIVFLPAAN PLPEKVDPYS KILAMRWYQV PHLMFDPDKG ENKAYCYRFK SAPGLEKMEM GIYGKNLAVW IDGQPLDKKY IQLLKAESGL NTYQVNLPEK KAQIAEVAMQ IETETGIQDV AAFPFPVKLF CRSGLLEAGD WSLTGQMKHY SGGLYYRKTL SFSAEQLKNK VSLDLGEVVA TCAVKINGQD AVTMMSKPYK TEITKFLKAG TNEVEVLVYS TLSNHYQTIP SAYRGNPRAG LIGPVAIEIQ K
|
| |