Gene Phep_3947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3947 
Symbol 
ID8255081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4746749 
End bp4747771 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content36% 
IMG OID644937611 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_003094200 
Protein GI255533828 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID[TIGR03589] UDP-N-acetylglucosamine 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.689942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.177131 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGATT TAAAGAATAA AAGTATTTTA ATAACTGGAG GGACAGGTTC TCTGGGAAAA 
GCATTAACAA AAGAGATTTT GTCAAAATAT CCTGAGGTAA GAAGACTTAT CATTTTTTCT
AGGGATGAGC AAAAGCAATT TCAGATGGCT CAGGAATATC CAGAAAAACA ATATCCACAG
ATAAGATTCT TTATTGGAGA CGTAAGAGAT TCACAAAGAT TGACCAGAGC TTTTAAAGGT
GTAGATTACG TTATACATGC GGCTGCAATG AAACATGTCC CGATAGCTGA ATATAATCCT
GACGAGTGTA TTAAGACAAA TATAAATGGC GCACAAAACG TTATTCATGC TTGTTTTGAG
ACTAATGTGC AAAAGGTAGT TGCTTTGTCT ACTGACAAGG CTTGTGCGCC AATCAATTTA
TATGGTGCAA CAAAATTAAC ATCAGATAAA CTTTTTGTCG CGGCAAATAA TATAAAAGGG
AATAGTCCGA TTATCTTCTC GGTAGTGCGA TATGGTAATG TAATGGGATC AAATGGATCT
GTAATTCCAT TTTTTTTAAA TAAAAGAACA GAAGGAAAAT TGCCTATTAC CGATGCTGAA
ATGACACGCT TTAATATTTC ACTGCAGGGC GGAGTAGATA TGGTTATGCA TGCACTGGGT
AACGCTTGGG GTGGAGAGAT ATTTATCCCA AAGATACCAT CTTATAAAAT AACTGATGTT
GCTACTGCTA TTGGTCCAAA TTGTGAACAG GTTCTGGTAG GAATCAGACC AGGGGAAAAG
GTTCATGAAG AGATGATTAC ACCTTCTGAT TCTTTTTACA CCTATGATTT GGGGAAATAT
TATACGATTT TACCGGCAAC TCACCATTGG AGCATTGAAG ATTTCAAAAG TACTTTTAAT
GCTCAGCTGG TAAAGCCAGG TTTCGCTTAC AATTCTGGTG AAAACACGGA ATGGGAAACG
GTAGATACAT TGCGATCTTT AATTAAAGAA CATGTTGACG TAAGTTTTAA CTACAATGAA
TAA
 
Protein sequence
MLDLKNKSIL ITGGTGSLGK ALTKEILSKY PEVRRLIIFS RDEQKQFQMA QEYPEKQYPQ 
IRFFIGDVRD SQRLTRAFKG VDYVIHAAAM KHVPIAEYNP DECIKTNING AQNVIHACFE
TNVQKVVALS TDKACAPINL YGATKLTSDK LFVAANNIKG NSPIIFSVVR YGNVMGSNGS
VIPFFLNKRT EGKLPITDAE MTRFNISLQG GVDMVMHALG NAWGGEIFIP KIPSYKITDV
ATAIGPNCEQ VLVGIRPGEK VHEEMITPSD SFYTYDLGKY YTILPATHHW SIEDFKSTFN
AQLVKPGFAY NSGENTEWET VDTLRSLIKE HVDVSFNYNE