Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4200 |
Symbol | |
ID | 8255336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 5077730 |
End bp | 5079046 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644937866 |
Product | hypothetical protein |
Protein accession | YP_003094453 |
Protein GI | 255534081 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0156411 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.730059 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACA TGATCAAATT ACCGGCAATT GTACTTGGGC TGCTGCTCAT GTTCACCGGT GCAGCACAAA AGGTTATGGC GCAGGGAGAT GATGTTTCGC TCCAATCGTT TTATGATGAA CTGTCGCCTT ATGGTACCTG GATACAGGAT CCGCAATACG GGTACGTTTG GAGACCTGAT GTAGAACAAG GCGATTTCAG GCCTTATTAT ACCAATGGCC GCTGGGCAAT GACCGAGTAC GGTAACACCT GGGTTTCCAA TTACGACTGG GGCTGGGCGC CTTTTCACTA TGGAAGATGG GTTTACAACC GTTACGGTCA ATGGATATGG ATTCCTGATA CCACATGGGG ACCTGCATGG GTAAGCTGGA GAAGTGGTGG TGGCTATTAT GGCTGGGCGC CGATGGGCCC TGGAATGAAC ATCAATATCA ACCTTAACAT CCCCGATCTG TGGTGGGTAT TTATCCCCCA AAGGAATATC TATTACGATA GCTTTCCACG ATATTATTCC CGCAGAAATG TGACCATCAT CCATAACACG ACCATCATTA ACAATACTTA CGTAAATAAC CGCCGTACTT ATTACACTGG CCCAAGGGCC GATGACATCA GACGTGCAAC CGGAAGAGAT GTAAGAGTAT ATAATGTAAA TACCACAGGC AGGCCAAGCC GCAGTAACAT AAATGGCAAC AGTGTTGACA TCTATACGCC AAGGCCAAGC AGGGGTAGTT CCAATGTAAA TGCCAAACCA CGGGAAGCCA TCAGAGGAGA AGGGTATACC ACACCAAGAG GAGACCGCGG AACGGCAAGC AACGGATCAT CGTCCGGGCG CCCTTCCAGA ATTGACAACC AGGGTAACAG ACCTGATAAC CGTGAAAACG GCGTAACTAC ACCTCAAAAC AGGGGCGAAA GACCTATTTA TGAAAATAAT GGCAGGCCAT CAAGAACAGG AAGCCCGGAA AACAACGGCT CAACAAGACC ACAGCGAATA GAAAGACAAA ATCCTTCAGG AGAAACTCCT GCACAAAGGC CACAGGAAGT TCAGCCTGCC CCACAACAAC GTCAGGAGCG TCAGGAGCGT CCTCAACCAC AGGCACGACC TCAGCGTCAG GAAAACAGGC CGGAAGCCCC GCGTCAGCAG GAAAGACAGC AACAACCACA ACGTCAGGAA AGCAGACCGC AGCAGCCACA GGCCCAGCCT CAGTACCAAA GACCGGAACG GACCCAACAA AGTGCACCTC CGGCCAGAAG TTCTGAAAGC CGCGGCGAAC AAGGTGGCAG AGGAGCTGAA CGACCAAGCC GCGGAGGCAG GAGTTAA
|
Protein sequence | MKNMIKLPAI VLGLLLMFTG AAQKVMAQGD DVSLQSFYDE LSPYGTWIQD PQYGYVWRPD VEQGDFRPYY TNGRWAMTEY GNTWVSNYDW GWAPFHYGRW VYNRYGQWIW IPDTTWGPAW VSWRSGGGYY GWAPMGPGMN ININLNIPDL WWVFIPQRNI YYDSFPRYYS RRNVTIIHNT TIINNTYVNN RRTYYTGPRA DDIRRATGRD VRVYNVNTTG RPSRSNINGN SVDIYTPRPS RGSSNVNAKP REAIRGEGYT TPRGDRGTAS NGSSSGRPSR IDNQGNRPDN RENGVTTPQN RGERPIYENN GRPSRTGSPE NNGSTRPQRI ERQNPSGETP AQRPQEVQPA PQQRQERQER PQPQARPQRQ ENRPEAPRQQ ERQQQPQRQE SRPQQPQAQP QYQRPERTQQ SAPPARSSES RGEQGGRGAE RPSRGGRS
|
| |