Gene Phep_3897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3897 
Symbol 
ID8255031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4688897 
End bp4690402 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content39% 
IMG OID644937561 
Productglycosyl transferase family 2 
Protein accessionYP_003094150 
Protein GI255533778 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTTAA TGTACCATAA AGTTGACCTC GAAAGCCCAA CAATGTGGTG GGTGACTGCC 
GATGGGTTTT ACAGACAAAT GTTTGAACTG CAAAACAGGA AGGCTGTTTA TCTGGATGAT
TACGATCCGG AAGATCCGGA TCAGGTGGTC ATTACATTTG ACGGGGTTTA TAAGAATGTT
CTGACTTACG CAGCTCCAAT TTTACATAAA TTTTCCTACC CATTTGAATT ATTTGTAACC
AGCGATTACA TAGGAAAAAC CAATGAATTT GACAGTGTAG AACCACAGGC TGATTTTGCA
TCTGTTGATG ACCTGCAAAT TCTGCAAAAA ATGGGGGGAA GGATACAATG GCATACGAAA
AGCCATCCTG ATCTTAAAGC AACTTACGAC GAAGATCTGA TTCATCAGGA ACTGACAGTG
CCTGATGAAC TAATACCCTT TGAAAAACAT GGCTTAAAAT GGTTTGCATA TCCTTATGGA
AATTTTAACG ACAAGGTTGT TGCTGAGGTT AGCAAACGTT TTAAAGGGGC TGTGTCCTGC
CACCAGGGGT CTGACACGAA CATCTACACC TTGAACCGGG TTACTGTAAC CCAAAAGCAT
CGCTTTTCTG ACCAAACCAT ATCTTGTATT ATTCCCTGTT ATAATTACGG GCACTTCCTG
GCAGAGGCCA TTGAATCTGT ACTCAGGCAA ACTATCCCGG CAGATGAAAT TATCATTGCC
GACGATTGCT CAACAGATAT GACAGCCGAG ATCTCCCTTT TTTTTCAAAA GAAACATCCG
GACCGGATCA AATACATAAA GAACCCTGAA AACCTTGGAA TTATCAGAAA CTTCAATAAA
GCAGTTAGCT TATCAACCGG CTCATATCTT GTTTTCTTAG GGGCAGACAA TCGTTTTCAA
TCAAATTATA TAGAAGAATG TGCAGGAATA CTCCATCAAC ATGCGTGTGT TGGCATTGCT
TATACAGATT TTTTATTTTT TGGCAGCAGG GCAAAGAAAA TGTATACCGA TTCTAAAAAG
GAATACCAAT CTACTGTACA CAACGGCTTT TTCAAAATAC AGTTCCCGGA AGCAGAAGAT
ATAGATGTAG CCGAGAGGCT AAAACATGAA AACTTTATAC ATGGATCTTC TATGTTTAGG
CGTATATGCT ATGAACAGGT TGGAGGTTAT CAGTCAAACC CAACGGTACC GGAAGATTAT
AACCTTTTTT TTAGCATAGT TAAAAGCGGT TATCAAATTA AAAAGGCAAA TAAGACGGTA
TTACATTACA GACAACATTC CCCTGACCAG GCAAATCATG TTTTTGGGGC CCAGGTACTG
GTCAACATTT ATATGATTAA GATAAAAGAG CTGGAGACTG AATTACGGTT CTTAAGAAAG
TACAAAATTG TGCAGGCCAT TTCATTTGCG TTCAGGATAA AGAACAATAC CAGAAAACTA
ATCCGTTATT CAAATCAGCA TGGGATGGGC AAGACCATCC GAAAAGTATT TAGCAGGTTA
TTTTGA
 
Protein sequence
MILMYHKVDL ESPTMWWVTA DGFYRQMFEL QNRKAVYLDD YDPEDPDQVV ITFDGVYKNV 
LTYAAPILHK FSYPFELFVT SDYIGKTNEF DSVEPQADFA SVDDLQILQK MGGRIQWHTK
SHPDLKATYD EDLIHQELTV PDELIPFEKH GLKWFAYPYG NFNDKVVAEV SKRFKGAVSC
HQGSDTNIYT LNRVTVTQKH RFSDQTISCI IPCYNYGHFL AEAIESVLRQ TIPADEIIIA
DDCSTDMTAE ISLFFQKKHP DRIKYIKNPE NLGIIRNFNK AVSLSTGSYL VFLGADNRFQ
SNYIEECAGI LHQHACVGIA YTDFLFFGSR AKKMYTDSKK EYQSTVHNGF FKIQFPEAED
IDVAERLKHE NFIHGSSMFR RICYEQVGGY QSNPTVPEDY NLFFSIVKSG YQIKKANKTV
LHYRQHSPDQ ANHVFGAQVL VNIYMIKIKE LETELRFLRK YKIVQAISFA FRIKNNTRKL
IRYSNQHGMG KTIRKVFSRL F