Gene Phep_3923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3923 
Symbol 
ID8255057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4722020 
End bp4723261 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content33% 
IMG OID644937587 
Productglycosyl transferase group 1 
Protein accessionYP_003094176 
Protein GI255533804 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.192357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0157049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATCT GGATAATAAA CCCTTATGGA AGTCTTCCTG ACGAGAGTTG GCGCACTTAC 
CGAAGCACTA TGATTGCAAA TGCATTGGTG GCTTCAGGGC ATCGAGTTAC GCAATTTATT
TCAAATTTTG AGCATCGTAG TAAAACCTTT AGAAATGATT GCTACGAAAG TAGATCATAT
GGGCCCAACT ATTCAATCAG TATCATTCCA AGCACATCCT ATAAATCACA TATCTCGCTG
TCTCGCGTTA AATATGAAAG AAGTTATGCA AAAAATCTAA TAAAGTATGT AGCAGATTAT
GAAAAGCCAG ATATTGTTAT ACTTGCTGAA CCCGCTCTTT GTTATTATAA TATAATTCTC
AGATGGATCA AAAAAGATTT GAAAGCTAAA ATTATTATCG ACTTAATAGA TATTTGGCCT
GAACTGTTTC ATCTTTTGTT TCCAAAATCA CTTAATTTTT TGGCTAATTT AATATTTTTT
CCTTTGTATA TGTGGAGGAG AAGACTTTAT AGAAGTGTAG ATGGACTGGT AGCTGTTTCT
AAAAACTACC AAGATATAGC TTTAAAGATT AAACCATTCA AACCATCTCA CACTGATGTG
GTATATTGGA GCATTCCTGA AAATGAAATT AATAAAGAAG AGCTGATATC TAATAAAGAT
GTGATAAACT TAATAGAATC AAAGGGTTTA GAAGAAGTTT GGTGTATATA TGCAGGTACT
TTGGGCGAAA ATTATGACGT CAAGTCCATT ATAGATGCAG GAGAAGAATT GAAAAAGATA
TATTCACATA ATAGGTTTAA ACTTATAATA GCTGGTGACG GACCTCTCGC AGATTTTTGC
ATTGAAAATT CAGATCAAAA GAGTATTGTT TTTCTTGGAC GCTTAAGTTC AACTGACCTT
GCGGGGTTAT TTCGACATTG CGATATAGCT TTAAGTACTT ATAGACAGCG ATCAACGGTT
TCAATGCCTA TAAAAGCATT TGACTATTTT GCTTTTGGAT TGCCACTTGT TAATTCACTG
AAGCGTGACT TGGGATATTT TATCAATAAA ATGGAAATTG GCATTAACTA TGAAGCCGAA
TCAGTATCAG CTTTAACTGG GGCTATTAAA ATATTAGTGG ATGATAAAGA TAAAAGATTG
CAGATGAAAG AAAATGCTTT AATAGCTTCA ACTATCTTTT CTGAAAATTT ACAGTACTCA
AAATTTATTT ATGTTATAGA AAATGTTTAC TATGGTAAAT AG
 
Protein sequence
MNIWIINPYG SLPDESWRTY RSTMIANALV ASGHRVTQFI SNFEHRSKTF RNDCYESRSY 
GPNYSISIIP STSYKSHISL SRVKYERSYA KNLIKYVADY EKPDIVILAE PALCYYNIIL
RWIKKDLKAK IIIDLIDIWP ELFHLLFPKS LNFLANLIFF PLYMWRRRLY RSVDGLVAVS
KNYQDIALKI KPFKPSHTDV VYWSIPENEI NKEELISNKD VINLIESKGL EEVWCIYAGT
LGENYDVKSI IDAGEELKKI YSHNRFKLII AGDGPLADFC IENSDQKSIV FLGRLSSTDL
AGLFRHCDIA LSTYRQRSTV SMPIKAFDYF AFGLPLVNSL KRDLGYFINK MEIGINYEAE
SVSALTGAIK ILVDDKDKRL QMKENALIAS TIFSENLQYS KFIYVIENVY YGK