Gene Phep_3930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3930 
Symbol 
ID8255064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4729934 
End bp4731172 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content30% 
IMG OID644937594 
Productglycosyl transferase group 1 
Protein accessionYP_003094183 
Protein GI255533811 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0476116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAATC ATAAAATCAA GGTATTGGTG TTTACGGACT GCTATATTTA TGGAGGCAGT 
GAAAGACTAA TGTCTTTTCT TATAAACAAT AAAAATCTAC AGAATAAATT TGATCTTAAA
CTAAGCTACA GGGCTTATAA GGATTATGAA AGAGGAATGT CTAGAGATTA TCAGGACTTA
CCAAAGTCAT TAATTCTTCC TCAACAACTT TTATCTAATG AATCTTTATT TTATAAAATA
AATTGTTTAA AGCTTTCAAA GATTATAAAA TTACCCTTTT TTTTCATCCA AAAAATCCAG
GTTTACTCAA TTTGGAATCT GATGGTTTTC TATTTCTTGT TGAAAAAGGA GAGGCCAGAC
ATTTTGCATA TTAACAATGG TGGGTATCCT GGAGCAAAGA GTTGTAATAT GTTGGTTTTA
GCAAATAAAT TATTTTGTAA CGCAAAGGTT ATTTATCAAG TGAATAATCA GGCTACAATC
TCAAAAAAAA TTGACCTGCC TACAGATCAA TTTATTTGTA GAAATGTAGA TGTTTTTTTA
ACAGCGTCAA ATTTGGCGAA GCAAATGTTG ATAAAGAATA GACAATTCCC TAATGATAAG
ATAAAGATTA TAAATAATTG TATAATTCCT GGTTTAAAAG AAAAAAACCG AGAAGAAATT
TGCCACGAAC TTTCCCTACC TATTGAAAGT TTTATAATCG TACAGGTTGG TTTTTTGACC
GAACGTAAGG GGCAAAGAAT GCTTATATTA GCAATGGATT TGCTATTAAA AATACATCCA
GAACTTGAAA CAAAAATTTC TGCCTTGTTT ATTGGAAATG GTGAAGACGA GATAAAACTT
ACGCAGTTAG TAAATGAATT AGGCCTGGAA AAAAATATAA CTTTTCTCGG ATACAAGTCA
AATAGTCATG ATTATATAAA TGCGTGCGAT TTATTTGTAT TACCCTCTAT TAGTAATGAA
GATATGCCAT TAGTATTGTT GACGGCGCTA GAATTAGGAA AGCCTATAAT TGCAAGCAAT
TTTGCGGGAA TATCGCAGGT TATAACATCT GAAGTCAATG GTATGCTTAT TGATATCAAT
GATGTGAGTT TTCATCAGGA ATTGTATGAA ATTATATATA AGCTTTATAT GAATTTAGCT
TTGCGAACAC TTCTCGGCAA TAATGCGAGA AAATCTTTTA TAGAATATTC GCCTGAAAAA
TATGGACAAA ATATCGAAAA TTTATATCGT CAACTATGA
 
Protein sequence
MINHKIKVLV FTDCYIYGGS ERLMSFLINN KNLQNKFDLK LSYRAYKDYE RGMSRDYQDL 
PKSLILPQQL LSNESLFYKI NCLKLSKIIK LPFFFIQKIQ VYSIWNLMVF YFLLKKERPD
ILHINNGGYP GAKSCNMLVL ANKLFCNAKV IYQVNNQATI SKKIDLPTDQ FICRNVDVFL
TASNLAKQML IKNRQFPNDK IKIINNCIIP GLKEKNREEI CHELSLPIES FIIVQVGFLT
ERKGQRMLIL AMDLLLKIHP ELETKISALF IGNGEDEIKL TQLVNELGLE KNITFLGYKS
NSHDYINACD LFVLPSISNE DMPLVLLTAL ELGKPIIASN FAGISQVITS EVNGMLIDIN
DVSFHQELYE IIYKLYMNLA LRTLLGNNAR KSFIEYSPEK YGQNIENLYR QL