Gene Phep_3919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3919 
Symbol 
ID8255053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4717467 
End bp4719425 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content38% 
IMG OID644937583 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_003094172 
Protein GI255533800 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0124304 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTACAA AACTACAAAT TGTATCTCGC TGGGTTATTT TTACGATTGA TATTTGTTTA 
AGTGTCATTG CATTGTTATT TGCGATCATT TTGCAAAATA ATTTCATTAT CGATACAATA
GATTTTCTTG CTTTTTATAA AGCTGTAGTT TTAGTGATTA TAGTGAACTC TTTTGTATTT
TACAGCGTTA AAACATTTGC AGGCATTGTT AGATATACTT CTGCACAGGA TTCTTTCAGG
ATTTTATTTG CTGTTGTTCT GAGTTCATTG ATCCTGTTTT TTACCCATGC ATTGGCAATA
GTGATTACAG GTGAGCGTGT GATTAGTAGT GTAGCCATCA TCACCTATAC TTTATTTAAT
TTCTTAATGC TGATTACTTA CCGGATCATT GTAAAGTACT TTTTTATGTA CATAAAGAAT
GCTAATCTGG ACAAAAGAAG AATTATCATT TATGGTGCTG GAGAGGCAGG TGTAGCTACC
AAAAGAACTT TTGACCATGA TCCTAAAATA AATAAGACCA TTATTGCCTT TGTGGATGAT
GATCTGCGGA AAGTAGGTAA AACCATTGAT GGCGTGAGAA TTCTTGATGC AGCACAATTG
GAAGATCTGA TTACAAAACA TGAAGTGGAT GAAGTGATCT TTGCCTCTTA TACCATTCCA
TCGGAGCGCA AGAACCAGGT GGTGGATATA TGTTTAGAAA ATGATGTTAA AATATTAAAT
ATACCTTCAC CCGAGGTTTG GGCTAAGGGA CATGTGACCA CGGCACAAAT CCAGAATATC
AATATTGAGG ACTTGCTAAA CAGGAAGACA ATCGACATTG ATATAGAAGG TATTCAGAAC
CAATTGAAAG GGAAAAGAAT ACTGATAACT GGAGCAGCAG GCTCTATAGG CAGTGAGATT
GTAAGACAGT TGTTAAAGTT TGAAACAGGG CTGATTATTT TATGTGACCA GAGCGAAACG
GCTTTGCATA ACATTTACCT GGAACTGGAA GAGACTCATG TAAATACCAA TTTCCATGCT
TTTATCGGGG ATGTGAAAGA CCAGAAACGG ATGGAGACTT TGTTCAATAC TTATAAACCA
CATTACGTAT ACCACGCAGC GGCCTACAAG CACGTGCCTT TGATGGAAGA TAACCCTGCA
GAGGCAATTA AAACAAATGT AATGGGCACA AAAACCATTG CTGATCTTTC GGTTAAGCAT
GGGGTGCAAA AATTTGTAAT GATCTCTACA GATAAGGCCG TAAATCCTAC CAATGTAATG
GGTGCTTCAA AACGGATTGC GGAGATTTAT GTACAGTCAC TAAATAATTC CCTGAATAAC
CCCGATCTGA TCTTCTCCAA TGGATTGAGT TATTTAAATG ATTCCAATAT CAAACCCATC
ACAAAGTTCA TCACTACTCG GTTTGGGAAT GTATTGGGTT CAAATGGTTC TGTCATTCCT
AGATTCAAAC ACCAGATCGA AAATGGTGGT CCGGTTACCG TTACCCATCC AGAAATTACC
CGCTATTTTA TGACCATACC CGAAGCTTGT CGTTTGGTAT TGGAAGCGGG TTGCATGGGT
AAAGGAGGCG AGATTTACGT ATTCGACATG GGTAAATCAG TAAAGATTGT GGAATTGGCC
AAGAAAATGA TTCGCTTAGC GGGTTTAGTT CCTAACCAGG ACATCAAGAT TTCCTATTCA
GGTTTGAGGC CTGGTGAAAA GCTATTTGAA GAGCTTTTAA ACGATAGTGA AATCACTAAG
CCTACACACC ATGAAAAAAT TATGATTGGG CAGGTTAGAG AGTATATTTT TAATGAAATA
GAAACCCAGA TTTATCAGTT GCTAAATCAC GCAAGTTCTG GCAATACGAG GCAGGTAGTA
AGGCAAATGA AGGTCATTGT TCCTGAATTT ATTAGTAAGA ATTCTGTATT TGAAGAATTA
GACGCAGAGG TTCCAGTAGA AGAAAGCCCC CTAAAATGA
 
Protein sequence
MFTKLQIVSR WVIFTIDICL SVIALLFAII LQNNFIIDTI DFLAFYKAVV LVIIVNSFVF 
YSVKTFAGIV RYTSAQDSFR ILFAVVLSSL ILFFTHALAI VITGERVISS VAIITYTLFN
FLMLITYRII VKYFFMYIKN ANLDKRRIII YGAGEAGVAT KRTFDHDPKI NKTIIAFVDD
DLRKVGKTID GVRILDAAQL EDLITKHEVD EVIFASYTIP SERKNQVVDI CLENDVKILN
IPSPEVWAKG HVTTAQIQNI NIEDLLNRKT IDIDIEGIQN QLKGKRILIT GAAGSIGSEI
VRQLLKFETG LIILCDQSET ALHNIYLELE ETHVNTNFHA FIGDVKDQKR METLFNTYKP
HYVYHAAAYK HVPLMEDNPA EAIKTNVMGT KTIADLSVKH GVQKFVMIST DKAVNPTNVM
GASKRIAEIY VQSLNNSLNN PDLIFSNGLS YLNDSNIKPI TKFITTRFGN VLGSNGSVIP
RFKHQIENGG PVTVTHPEIT RYFMTIPEAC RLVLEAGCMG KGGEIYVFDM GKSVKIVELA
KKMIRLAGLV PNQDIKISYS GLRPGEKLFE ELLNDSEITK PTHHEKIMIG QVREYIFNEI
ETQIYQLLNH ASSGNTRQVV RQMKVIVPEF ISKNSVFEEL DAEVPVEESP LK