Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0773 |
Symbol | |
ID | 8251862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 908197 |
End bp | 910128 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644934423 |
Product | Protein of unknown function DUF1800 |
Protein accession | YP_003091057 |
Protein GI | 255530685 |
COG category | [S] Function unknown |
COG ID | [COG5267] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAACGC CTGCCAAAAT TGTCTGTTTG TTTGTATTCC TGTTCATCGC GACTGTATTC TGTACTGCTT TTCAAAAGGG AAGGCCAGCA GCAAAATCTG TCTTTCCTTA TAAACAGGCA GGGTTAAACA TACGGCAGGC TGCGGCACAT CTTTTAAGCC GATTTACCTA TGGGGCTACA CCCGGTCAAA TTGATGAGGT AGTAACCATG GGACTTGAAA ACTGGTTTAA CAAACAACTG GAAGGAAATT TGCCGGATGA GGGGCTGCAG GGCAGGTTAA GAAGTTATGA TGCCATTCAA CTTAGCAATA CAGAGGTGCT GAGCAAATAT CCGCCGGGCT TTGTAGTATT GCAAATGGCA CTGAAAGACA GTGTGATCAG TAAAGATTCT GTGGGAAAGG CCGTAGACAA AAAAGCTTAC AACGAGCAGA TACAGGCTTA TATGGCCAGT AAAGGGCTTA AGTCCGATCA GGAGCTGTAT AAGCAGTTTA TATCACAGCA CATATTAAGG GCTGCCTATA CCAATAACCA GTTGCAGGAA GTGATGTCGG ATTTTTGGTT TAACCATTTT AATGTCTCAT TTACCAAAGG GGAGTGTGCG CAGTTTATAC CTGCTTATGA AAGGGACGTC ATCAGACCAA ATGCGCTGGG AAAATTTGAT CAGTTGCTGA TCGCTTCTGC CAAATCTCCG GCTATGCTGT ATTTTCTCGA CAATTTTACC AGTCAGGGTG CAGCAGTTCC TGCCAGTCAG CCTGCTATGG GTATGATGAT GGCATCCGAA AGCCCGCAAA AGACAAAACC GGTAAATATG GCAGCTGTGC CGCCTAAAAA CGTGCCTGGG CTGAATGAAA ACTATGCCCG GGAAGTAATG GAACTGCATA CTCTTGGTGT TGATGGAGGT TATACCCAGT CGGATGTTAC ACAGGCCGCC CGCGTTCTGA CAGGCTGGAC CATTTATCCG ATCAGCAGTT CAGGTTATGG CAGTGCGATG AAAGGCCTGG TCGCTAAAAT CGGGGAAAAT AACCTGGCTG CGGAAGGCTT TGTACATGAG GGTGACTTTT TGTTTACACC TAACCGGCAT GATAAGGGAG AGAAGGTGGT ATTGGGAAAA CGTTTTGAGG CAAATGGTGG CTATGAAGAG GGGGTAGCAC TTTTAGAGAT GCTGGCCCAT CATCCGGCCG CTGCTAATTT TATTTCCAGG AAACTGGCTG TACGTTTTGT AAGTGACAAT CCTCCGGCCA GCATGATCAG GAAAATGGCC AAGACATTTA CCTCGGCAGA TGGGGACATC AGGCAGGTGT TGCGTACCAT GGTAAATTCG CCTGAATTCT GGGATGCCAA AGCATTGAGG CAGAAAACCA AATCGCCTTT TGAACTAGCT ATAAGTAGTG TACGGGCCGT AAATGCCGAC ATTCAGCAGC CTTATCAGTT GTTTACCTGG ATTTCAAGAA TGGGGCAGCG GATTTATTAT TATCAGGCCC CTACCGGTTT CCCCGACAAG GGGCAATACT GGATCAATAC GGGTTCCTTA CTGAGCCGGA TGAACTTTGG CCTGGCACTC AGCTCAGGAC GCATTCCGGG GGTGGTGGTA AACCTTGCTG CGCTGAACCC GCAAAAGGGT ACTGAAACTG CAGGAAAGGC ATTGGTCAGT TATAGTAAAG TAATGATACC TGAGCGAAAC CTGGATATGG CTTATCAACA GCTGATCCCC TTGTTGCAGG TGCCCGTGCC AGCGGCTAAG ATGAATAATG AAGCTGGTAA AACAACCGCT GTCTCCCAGC CTGTGTCTAA TAAGTTTTCC GGCATGGATG CCGTATCAGG AGCAGAAGAG CTAAATGAGA AGAAGAAAGA AGTTGTTCCG GTAGTGAATA ACAACATGTT GGCCCAGGTG GTGGGTATTA TCATTGGTTC TCCGGAATTT CAAAGGAGAT AA
|
Protein sequence | MRTPAKIVCL FVFLFIATVF CTAFQKGRPA AKSVFPYKQA GLNIRQAAAH LLSRFTYGAT PGQIDEVVTM GLENWFNKQL EGNLPDEGLQ GRLRSYDAIQ LSNTEVLSKY PPGFVVLQMA LKDSVISKDS VGKAVDKKAY NEQIQAYMAS KGLKSDQELY KQFISQHILR AAYTNNQLQE VMSDFWFNHF NVSFTKGECA QFIPAYERDV IRPNALGKFD QLLIASAKSP AMLYFLDNFT SQGAAVPASQ PAMGMMMASE SPQKTKPVNM AAVPPKNVPG LNENYAREVM ELHTLGVDGG YTQSDVTQAA RVLTGWTIYP ISSSGYGSAM KGLVAKIGEN NLAAEGFVHE GDFLFTPNRH DKGEKVVLGK RFEANGGYEE GVALLEMLAH HPAAANFISR KLAVRFVSDN PPASMIRKMA KTFTSADGDI RQVLRTMVNS PEFWDAKALR QKTKSPFELA ISSVRAVNAD IQQPYQLFTW ISRMGQRIYY YQAPTGFPDK GQYWINTGSL LSRMNFGLAL SSGRIPGVVV NLAALNPQKG TETAGKALVS YSKVMIPERN LDMAYQQLIP LLQVPVPAAK MNNEAGKTTA VSQPVSNKFS GMDAVSGAEE LNEKKKEVVP VVNNNMLAQV VGIIIGSPEF QRR
|
| |