Gene Phep_3893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3893 
Symbol 
ID8255027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4685133 
End bp4686257 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content44% 
IMG OID644937557 
Productglycosyl transferase group 1 
Protein accessionYP_003094146 
Protein GI255533774 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAAAA AAATTCTGCT GCTCACACTG GAAACATTTT GTGCCACGGG CGGTATACAG 
AAAATGGGCC GGATACTGGC TTACGGCTTA CAGCAGTTGG GTGCAAAACA TAAATGGGAG
GCTGAGCTGT ATTCCTTATG CGACCGGAAA ACCGACTTAA TGCCCGAATA TTTAGCGGAA
GAAAAGTTTA AAGCTTTTCG CAAAAACAGG CTGAAATTTA TGTGGGAGAG CATAAAGGCA
GGAAAGAAGG CAGACCTGGT TATACTAAGC CACATCAATT TATCGGTACT CGGCTGGGCA
ATATACCTGC TTAATCCAAA TTGCCAGATC TGGCTTATTG CACATGGAAT AGAGGTTTGG
CGCCCCTTAA GGTTATGGAA AAAGTCGGTT TGGAAAATTT GCAGTAAGGT AATCTGTGTA
AGCAGGTATA CACAGGAGAA AGTTATTGCC TTACACCAGG TTGCACCCGA ACAGTGTACA
GTGGTCAACA ACGCAGTCGA CCCCTTTATC ACCTTTCCTG AACATTTCCA TAAACCCGGG
TATTTACTGG AACGGTACGA ATTAAATACA GATCAGAAAA TTGTATTTAC GCTGGCCCGC
ATTTCCGTTA CAGAACAGTA TAAAGGTTAT GATCAGGTGA TAAAAGCCCT TGGCAATCTC
GGTCAGAACA ATATACAGTA TGTGCTTGCA GGACCTTATG ATGAAGCTGA AAAGCTACGC
CTTACACAAT TGGCAAGCCA GTACGGCCTG GGCAATAATT TTATACTTCC AGGTTATATC
AAAGCTGAAG AACTGGCCGA TCATTTTTTA CTGGCTGACC TGTTTGTATT GCCCAGCAAG
AAAGAAGGCT TTGGGATTGT GTTTATAGAA GCTATGGCCT TCGGCTTACC CATCATCTGC
GGCAATGCTG ATGGCAGTGT GGATGCAGTG AAAAACCAGG AGATGGGTAC AGCCATTGAT
CCGGATGATA TCGGGGCCCT GGAACAGGCC ATCCTCCGGA ACCTTGGCCG CACCTTAAGC
ATTGGGGCAC GCAAAAGCAT TCAGCAACAA TGTTTAAAAT ATTTTAGTCA GCAGCATTAC
CTGCAAACCT TAGAACGGTT AATCAAAAAT GAAGCCTGTA ACTGA
 
Protein sequence
MSKKILLLTL ETFCATGGIQ KMGRILAYGL QQLGAKHKWE AELYSLCDRK TDLMPEYLAE 
EKFKAFRKNR LKFMWESIKA GKKADLVILS HINLSVLGWA IYLLNPNCQI WLIAHGIEVW
RPLRLWKKSV WKICSKVICV SRYTQEKVIA LHQVAPEQCT VVNNAVDPFI TFPEHFHKPG
YLLERYELNT DQKIVFTLAR ISVTEQYKGY DQVIKALGNL GQNNIQYVLA GPYDEAEKLR
LTQLASQYGL GNNFILPGYI KAEELADHFL LADLFVLPSK KEGFGIVFIE AMAFGLPIIC
GNADGSVDAV KNQEMGTAID PDDIGALEQA ILRNLGRTLS IGARKSIQQQ CLKYFSQQHY
LQTLERLIKN EACN