Gene Phep_4128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4128 
Symbol 
ID8255263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4989467 
End bp4990453 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content39% 
IMG OID644937793 
Producthypothetical protein 
Protein accessionYP_003094381 
Protein GI255534009 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0321104 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.774041 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT TATTTAATTT AAAAAATGTT AAGGTTGAAG TGAAAGTTTT CACTTTTAAA 
TGCACGTTAT TGGTATGGGC ATTAACAACT GTGCTTTACG GGTGTAAAAA AGATAAAAAA
GAAGATGGGG GTACAGATGT ACCAGAAGAA GTAGTGGATA ATGTGATTGA ACCTCTTACC
AAGAAGATTA TGGACAATAC CACGGTAATT GGAACATTCA TTTCGGATGA GACAGGTTCT
GTTACTGCTG GAATAAATAT TACCAGGCTG GCTTTTCTCA GGAAAGATAA ATTGCCTGTA
AGAATCTTTA TTATGGAAGT TGATATGAAA ACGCCAAAAC TTGAAATTCA GGCAATGGCG
CCTTATAATG ACTACATTAA TGGTTTGCAG AGGCTTTCCG AAATGTGCAG GGACAATGAA
CTTCCGGGAA CAAATATTGT TGCTGCTGTT AATGGTGATA CCTTTAGTAC AACAGGTGCC
CCAACCAGTT TGTTTTATAT AAATAATCGG GTTTACTATG GTACTGTTGC TACAGGGAGA
ACCTTTTTTG CTGCAATGAA GGATGGGACA ATAGTTATTG GGGGAAAGGA TACAAAGGGG
GTAGAAAGAC CTGTTGATAA AGCCCAGATT AAGAATGCAG TTGGGGGGAA TCAGTGGCTG
GTAGACAACA ATATAAAAGC CACTTTGACT GACGCTACGA TTAGTGCCCG GACAGCAATT
GGTTATAATG CCAATAAGGT AATTTATGCA ATTGTAGTGG ATGGATCACA AGCTACTTAT
TCAAATGGTT TAACGCTTGT TGACTTAAGA GATATTATGG CTGCGCTTGG TACAAAAGAC
GCAGTCAACC TTGACGGAGC TTCGTCCTCA ACTTTAGTGG CTAAGGATTT GACTAAAGGA
ACGTGGAATG TTTTAAATAA GCCTGCATTG GCACTTAATG CAGAAAGGTT AATTGGAAAC
GGGCTTGGCT TTATCCTTAA AAACTAA
 
Protein sequence
MKKLFNLKNV KVEVKVFTFK CTLLVWALTT VLYGCKKDKK EDGGTDVPEE VVDNVIEPLT 
KKIMDNTTVI GTFISDETGS VTAGINITRL AFLRKDKLPV RIFIMEVDMK TPKLEIQAMA
PYNDYINGLQ RLSEMCRDNE LPGTNIVAAV NGDTFSTTGA PTSLFYINNR VYYGTVATGR
TFFAAMKDGT IVIGGKDTKG VERPVDKAQI KNAVGGNQWL VDNNIKATLT DATISARTAI
GYNANKVIYA IVVDGSQATY SNGLTLVDLR DIMAALGTKD AVNLDGASSS TLVAKDLTKG
TWNVLNKPAL ALNAERLIGN GLGFILKN