Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3919 |
Symbol | |
ID | 8255053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4717467 |
End bp | 4719425 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644937583 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_003094172 |
Protein GI | 255533800 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0124304 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTACAA AACTACAAAT TGTATCTCGC TGGGTTATTT TTACGATTGA TATTTGTTTA AGTGTCATTG CATTGTTATT TGCGATCATT TTGCAAAATA ATTTCATTAT CGATACAATA GATTTTCTTG CTTTTTATAA AGCTGTAGTT TTAGTGATTA TAGTGAACTC TTTTGTATTT TACAGCGTTA AAACATTTGC AGGCATTGTT AGATATACTT CTGCACAGGA TTCTTTCAGG ATTTTATTTG CTGTTGTTCT GAGTTCATTG ATCCTGTTTT TTACCCATGC ATTGGCAATA GTGATTACAG GTGAGCGTGT GATTAGTAGT GTAGCCATCA TCACCTATAC TTTATTTAAT TTCTTAATGC TGATTACTTA CCGGATCATT GTAAAGTACT TTTTTATGTA CATAAAGAAT GCTAATCTGG ACAAAAGAAG AATTATCATT TATGGTGCTG GAGAGGCAGG TGTAGCTACC AAAAGAACTT TTGACCATGA TCCTAAAATA AATAAGACCA TTATTGCCTT TGTGGATGAT GATCTGCGGA AAGTAGGTAA AACCATTGAT GGCGTGAGAA TTCTTGATGC AGCACAATTG GAAGATCTGA TTACAAAACA TGAAGTGGAT GAAGTGATCT TTGCCTCTTA TACCATTCCA TCGGAGCGCA AGAACCAGGT GGTGGATATA TGTTTAGAAA ATGATGTTAA AATATTAAAT ATACCTTCAC CCGAGGTTTG GGCTAAGGGA CATGTGACCA CGGCACAAAT CCAGAATATC AATATTGAGG ACTTGCTAAA CAGGAAGACA ATCGACATTG ATATAGAAGG TATTCAGAAC CAATTGAAAG GGAAAAGAAT ACTGATAACT GGAGCAGCAG GCTCTATAGG CAGTGAGATT GTAAGACAGT TGTTAAAGTT TGAAACAGGG CTGATTATTT TATGTGACCA GAGCGAAACG GCTTTGCATA ACATTTACCT GGAACTGGAA GAGACTCATG TAAATACCAA TTTCCATGCT TTTATCGGGG ATGTGAAAGA CCAGAAACGG ATGGAGACTT TGTTCAATAC TTATAAACCA CATTACGTAT ACCACGCAGC GGCCTACAAG CACGTGCCTT TGATGGAAGA TAACCCTGCA GAGGCAATTA AAACAAATGT AATGGGCACA AAAACCATTG CTGATCTTTC GGTTAAGCAT GGGGTGCAAA AATTTGTAAT GATCTCTACA GATAAGGCCG TAAATCCTAC CAATGTAATG GGTGCTTCAA AACGGATTGC GGAGATTTAT GTACAGTCAC TAAATAATTC CCTGAATAAC CCCGATCTGA TCTTCTCCAA TGGATTGAGT TATTTAAATG ATTCCAATAT CAAACCCATC ACAAAGTTCA TCACTACTCG GTTTGGGAAT GTATTGGGTT CAAATGGTTC TGTCATTCCT AGATTCAAAC ACCAGATCGA AAATGGTGGT CCGGTTACCG TTACCCATCC AGAAATTACC CGCTATTTTA TGACCATACC CGAAGCTTGT CGTTTGGTAT TGGAAGCGGG TTGCATGGGT AAAGGAGGCG AGATTTACGT ATTCGACATG GGTAAATCAG TAAAGATTGT GGAATTGGCC AAGAAAATGA TTCGCTTAGC GGGTTTAGTT CCTAACCAGG ACATCAAGAT TTCCTATTCA GGTTTGAGGC CTGGTGAAAA GCTATTTGAA GAGCTTTTAA ACGATAGTGA AATCACTAAG CCTACACACC ATGAAAAAAT TATGATTGGG CAGGTTAGAG AGTATATTTT TAATGAAATA GAAACCCAGA TTTATCAGTT GCTAAATCAC GCAAGTTCTG GCAATACGAG GCAGGTAGTA AGGCAAATGA AGGTCATTGT TCCTGAATTT ATTAGTAAGA ATTCTGTATT TGAAGAATTA GACGCAGAGG TTCCAGTAGA AGAAAGCCCC CTAAAATGA
|
Protein sequence | MFTKLQIVSR WVIFTIDICL SVIALLFAII LQNNFIIDTI DFLAFYKAVV LVIIVNSFVF YSVKTFAGIV RYTSAQDSFR ILFAVVLSSL ILFFTHALAI VITGERVISS VAIITYTLFN FLMLITYRII VKYFFMYIKN ANLDKRRIII YGAGEAGVAT KRTFDHDPKI NKTIIAFVDD DLRKVGKTID GVRILDAAQL EDLITKHEVD EVIFASYTIP SERKNQVVDI CLENDVKILN IPSPEVWAKG HVTTAQIQNI NIEDLLNRKT IDIDIEGIQN QLKGKRILIT GAAGSIGSEI VRQLLKFETG LIILCDQSET ALHNIYLELE ETHVNTNFHA FIGDVKDQKR METLFNTYKP HYVYHAAAYK HVPLMEDNPA EAIKTNVMGT KTIADLSVKH GVQKFVMIST DKAVNPTNVM GASKRIAEIY VQSLNNSLNN PDLIFSNGLS YLNDSNIKPI TKFITTRFGN VLGSNGSVIP RFKHQIENGG PVTVTHPEIT RYFMTIPEAC RLVLEAGCMG KGGEIYVFDM GKSVKIVELA KKMIRLAGLV PNQDIKISYS GLRPGEKLFE ELLNDSEITK PTHHEKIMIG QVREYIFNEI ETQIYQLLNH ASSGNTRQVV RQMKVIVPEF ISKNSVFEEL DAEVPVEESP LK
|
| |