Gene Phep_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2031 
Symbol 
ID8253135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2340082 
End bp2341263 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content38% 
IMG OID644935679 
Productglycosyl transferase family 2 
Protein accessionYP_003092298 
Protein GI255531926 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.000685673 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTATAG CTTTCTGGAT TTGCCTCTTC ATTATCATTT ACACTTTTAT TGGATATGGT 
CTTGTTTTGT TCTTTCTGGT TAAAATTAAA AGAATATTCA CAAAACCACA GGTTTTTATA
CCTGTACCAG ATGATCTTCC TAATGTTACC TTGTTAATAG CCGCCTATAA TGAGGAGGAC
ATTATTGCCG ACAAGGTAAA TAATACATTG GAACTGAATT ATCCAAAAGA CAGATTACAG
ATTGTCTTTA TTACGGACGG TTCCAGTGAC CGCACGGTTG AACGGTTAAG AAATAGGGAG
GGCATTACTT TGTTGCATGA AGATACACGC GCTGGAAAAA TGGCAGCCAT TAAACGGGCC
ATACCTTTTA TCAATGGAGA CATCACTGTA TTTACAGATG CAAATACCTT TTTAAATAAA
GATGCCATCC TTGAGTTGGT AAAACACTAT CAGAACAATA AAGTTGGTGC AGTGGCTGGC
GAAAAAAGAA TTTTGGTGGA AGATAAAGCC GATGCCAGTT CGGCAGGAGA AGGCTTTTAC
TGGAAATACG AATCAGCACT TAAAAAGTGG GACTATGAGC TATATTCTAA TGTAGGAGCT
GCCGGAGAAT TATTTAGCAT CAGAACAGCA TTGTATCAGC CTGTTGAATC GGATACCATT
ATTGACGACC ATATGATTGC CATGCGAATT GCTGAAAAAG GTTATGTTAT TGCCTACGAA
CCCAATGCTT ATGCCATGGA AACAGCCTCG GCAAATACCA AAGAAGAATT AAAAAGAAAA
ATAAGAATAG CAGCGGGAGG CATTCAGTCC ATCTTAAGAC TAAAGAAAGC AGCAAATCCG
CTTTATTATC CTGTGCTCAC ATTTCAATAT ATCAGTCACA GGGTTTTAAG ATGGACGGTT
ACCCCAATTT TGCTTGTAGT CACTTTTCTG TTGAATGGTT TAATTGTGTT AAATGGTGAC
AGAGGGATTT ATCTGGTTAT CTTTGGTGCC CAGGTTGTGT TTTACGTCCT GGGCCTGACG
GGGATGATCT TTGAAAGAAG GAACATTAGA ATCAAAAGTT TCTTCATCCC ATATTATTTT
TGTGTAATGA ATTATGCAGT AATTGCTGGA GCCATCAGAT ATTTTAAAAG ACAACAAAGC
GCGGCATGGG AAAAATCTGA AAGAAAAACA GCTCAAACCT GA
 
Protein sequence
MIIAFWICLF IIIYTFIGYG LVLFFLVKIK RIFTKPQVFI PVPDDLPNVT LLIAAYNEED 
IIADKVNNTL ELNYPKDRLQ IVFITDGSSD RTVERLRNRE GITLLHEDTR AGKMAAIKRA
IPFINGDITV FTDANTFLNK DAILELVKHY QNNKVGAVAG EKRILVEDKA DASSAGEGFY
WKYESALKKW DYELYSNVGA AGELFSIRTA LYQPVESDTI IDDHMIAMRI AEKGYVIAYE
PNAYAMETAS ANTKEELKRK IRIAAGGIQS ILRLKKAANP LYYPVLTFQY ISHRVLRWTV
TPILLVVTFL LNGLIVLNGD RGIYLVIFGA QVVFYVLGLT GMIFERRNIR IKSFFIPYYF
CVMNYAVIAG AIRYFKRQQS AAWEKSERKT AQT