Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1967 |
Symbol | |
ID | 8253071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 2269971 |
End bp | 2271647 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644935618 |
Product | hypothetical protein |
Protein accession | YP_003092237 |
Protein GI | 255531865 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.929106 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0000146847 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGAATT TAAATTTTTA TGATGCGATT GTAATTGGGT CTGGTATCAG CGGGGGGTGG GCAGCAATGG AATTATGTAA AAAGGGATTG AAAACGTTAC TTCTTGAACG TGGAAGAGAT GTCAAACATA TACAAGATTA TCCCACAGCT AATTTGAACC CGTGGGATTT TGAGTTGGGC TTTAATAATA CACTAAAAGA TCAGGAGAAT GATCCCATAC AAAGTATGGC ATATACTCCT GCAGACAAGC ATTTTTATGT GAGTGATAAA GACCAGCCTT ATGTTCAGGA AAAGCCATTT AATTGGTTTC GCGGCTATCA AGTAGGCGGC AGGTCGTTAC TATGGGGGCG CCAATGTTAT CGTCTTAGCG ATTTAGATTT TGAGGCAAAT TTGAAAGAAG GCGTAGCCAT AGATTGGCCA ATAAGATACA AAGACATTGC CAGCTGGTAT AGTTACGTGG AATCATTTAT CGGCGTTAGC GGAAAACATG AAAATCTATC ACAGTTGCCC GATGGAGAGT TTTTGCCGCC ATTCGAATTG AATTGCATAG AGGAACATTT AGCTGATTCT ATCCTCAAAA CTGAAGAAAA CAGGTTACTT ACCCCAGCGC GCGTAGCCAA CCTAACGAAA GGCTGGGATA ATAGAGGACC ATGCCAAAAT AGAAATCTTT GTACCAGGGG CTGTCCATTT GGTGGTTATT TTAGCAGTAA CAGTTCTACA ATCCCTTCTG CAATGGCTAC AGGTAATTTA ACATTGAGGC CTTTTTCAAT CGTTGTTGAG TTAATATATG ATGAACAAGA ACAGAAAGCA AAAGGAGTTA AAGTAATTGA TAGCGTAAGT AATGAAGTTC ACCTATTTTA TGCAAACATC ATCTTTATCA ATGCATCTAC TATACCAACT ACAGCATTAT TACTTAACTC CGTTTCCTCA AGGTTTCCAA ATGGGTTTGG AAATGATAGT GGCCAGGTAG GACATAACCT AATGGATCAT CATTCTTCTG CGGGAGCATT TGGAATGCAT GATAGTTTTA AAAATCAGTA TTATAAAGGT AGACGGCCTT GTGGATTTTT AATTCCAAGA TATAGAAACC TGAATAATAA TGAAAATCTT GGTTTTAGCA GAGGGTACAA CATCCAGGGT CGGGGCCAGC GTCAGGAGTG GGTAGATCTT TCTTCTTCAA ATGGATATGG AAGTCAGTTT AAAAAAGAAA TCACAACACC TGGTAAGTGG ATGGTATGGA TGGCCGGATG GGGAGAGTGT TTGCCATATT TTGAAAATCG AGTCAGTCTA GTTCCCGATA AAGTAGATAA ATGGGGGCAA AAGTTAATAG CTATTGATTT TGAATTTAAG GATAATGAAA GAAAAATGAT GGATGATATC AAAGACACGG CTACTGAAAT GCTTATAAAA GCTGGTTTTA ATAATATTGA TAGTTTTAAT TACAATAAAC CAGGAGGTTC CACTGTACAT GAAATGGGCA CGGCAAGAAT GGGGAACGAT CCAAAAACGT CGGTATTGAA TAGATTTAAC CAAATGCATT CTGTAAAAAA TGTTTTTATA ACTGATGGAA GCTGTATGAC CTCTTCGGGA TGTCAAAATC CTTCATTAAC TTATATGGCT TTAACTGCCA GAGCTTGCGA TTACGCTGTT AAGCAATTAA AGCTGGGTAA TCTTTGA
|
Protein sequence | MKNLNFYDAI VIGSGISGGW AAMELCKKGL KTLLLERGRD VKHIQDYPTA NLNPWDFELG FNNTLKDQEN DPIQSMAYTP ADKHFYVSDK DQPYVQEKPF NWFRGYQVGG RSLLWGRQCY RLSDLDFEAN LKEGVAIDWP IRYKDIASWY SYVESFIGVS GKHENLSQLP DGEFLPPFEL NCIEEHLADS ILKTEENRLL TPARVANLTK GWDNRGPCQN RNLCTRGCPF GGYFSSNSST IPSAMATGNL TLRPFSIVVE LIYDEQEQKA KGVKVIDSVS NEVHLFYANI IFINASTIPT TALLLNSVSS RFPNGFGNDS GQVGHNLMDH HSSAGAFGMH DSFKNQYYKG RRPCGFLIPR YRNLNNNENL GFSRGYNIQG RGQRQEWVDL SSSNGYGSQF KKEITTPGKW MVWMAGWGEC LPYFENRVSL VPDKVDKWGQ KLIAIDFEFK DNERKMMDDI KDTATEMLIK AGFNNIDSFN YNKPGGSTVH EMGTARMGND PKTSVLNRFN QMHSVKNVFI TDGSCMTSSG CQNPSLTYMA LTARACDYAV KQLKLGNL
|
| |