Gene Phep_3902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3902 
Symbol 
ID8255036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4694681 
End bp4695709 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content41% 
IMG OID644937566 
Productglycosyl transferase group 1 
Protein accessionYP_003094155 
Protein GI255533783 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.661278 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTG CCATCCTGGT TAATCCACTG ATTCCCGTAC CACCCGAACA ATATGGGGGC 
ATTGAACGGA TCGTTTACCT GCTGATCAAA GAACTCCAGA GAAATGGCCA TGAGGTTATA
CTATATGCGC ACAAAAACTC ACAGGCCGGT TGTAAGTTAA TCGCCTATCA GGAATCCGTA
AATTATGGTG CAAAAGATTT TATAAAGATT AATGCCTTAA CTGCAAAAAT TGCTTTTCAG
GATTTTGATG TGTTGCACAC CTTTGGACGT ATGAACAATA TCGCTTTGAT GATGTGGAGC
AAGATACCAA AGGTGGTATC CTATCAATTG CCCCCTACTA TTTCACAGGT AAAAAAAGCC
ACAAAAATAG CCTTCAAAAA TACTTTGTAT TTTACTGCCT GCAGTAATTT CATAGCCAGG
CAGATCAATA AATTTGCAAA TGTTACTACC ATTTACAATG GGGTAAACAT CAACGAATAT
CAGTTTAACG CAACAGTATC CGCTGATGCC CCACTTGTAT TTTTAGGAAG GATACAGGAA
GAAAAAGGTA CATCCATTGC CATACAGGTA GCAAGGACAA CAGGCCGGAA ACTAATTATT
GCCGGTAATA TCCCTGCAGA AGAAACCCAC AAGCAATATT TTAGCACCAA AGTAAAACCA
TTTATAGACG ATGTGCAGAT CAGCTATATT GGCCCGGTAA ACAATTTTCA AAAAAACGAG
TTACTTGGAA ACAGTTATGC TCTGTTAATG CCGGTAACCT GGGACGAACC TTTTGGTATT
GTAATGGCCG AAGCTTTGGC TTGCGGGACA CCGGTAATTG GTTTTAACAG GGGCGCTATA
CCCGAAGTGG TCATTAATGG ATTAAATGGT TTTGTATGCA ATACCCTTAC CGAAATGATT
GCCGCGGTTG GCCACATCCC AGAGGTCAGC AGGCTTACAT GTCGTGGTAC TGCTGAAGAC
AGGTTTAATG CCGTTGTGCT GGGCAAACAA TATGAAAACC TTTACAGAAA GGCGATAAAC
AGGCGTTGA
 
Protein sequence
MKIAILVNPL IPVPPEQYGG IERIVYLLIK ELQRNGHEVI LYAHKNSQAG CKLIAYQESV 
NYGAKDFIKI NALTAKIAFQ DFDVLHTFGR MNNIALMMWS KIPKVVSYQL PPTISQVKKA
TKIAFKNTLY FTACSNFIAR QINKFANVTT IYNGVNINEY QFNATVSADA PLVFLGRIQE
EKGTSIAIQV ARTTGRKLII AGNIPAEETH KQYFSTKVKP FIDDVQISYI GPVNNFQKNE
LLGNSYALLM PVTWDEPFGI VMAEALACGT PVIGFNRGAI PEVVINGLNG FVCNTLTEMI
AAVGHIPEVS RLTCRGTAED RFNAVVLGKQ YENLYRKAIN RR