Gene Phep_1956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1956 
Symbol 
ID8253060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2258074 
End bp2259267 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content32% 
IMG OID644935607 
Productglycosyl transferase group 1 
Protein accessionYP_003092226 
Protein GI255531854 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.454805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.000698573 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTTTAA AAAACAAGGT TATTATCTTT TTGGGAAACA CACGTTTTGA TAGTGATATT 
AAGGCTACCT CTCTTTTTAT TGCAAGGAGT TTATCAAAAG ATAATAAAGT TTTTTTCATC
GATTACCCTT TTACACTTAA AGATTATTTT CAACATAAAA AATCAGAAAG CTTAAAAAAC
CGAAAAAAGA AATTCTCTCT GTTTTCGGAT GGAATATTAG ATACTGATTT ACCCAATTTA
AAAATTGTAA TTACTCCCCC GGTTTTACCG ATTAACTTTT TGCCGGAAAG TTTTTTATTT
AGATCTCTAC TTAAGGTTAA TGAATCAATT ATATCCAGGA GAATAAAAAA AATTCTTAGG
ACAAGGAAAA TTGACCAATT CATTTACATT AACTCATTTA ACTTCCATTA CCCTAATATA
TCAAGTTATA TTAAGCCAGC TTTGAGTGTT TATCATTGTG TTGATCCCAT GATTGTACCA
TACGACATGA AACATGGGAT AATATCTGAA AAGCAATTGG TTGAAGGAAG TGATTTGGTA
ATCTGTACAA GCAGGGCGCT TTATGAAGAA AAAAAAGCAC AAAATAAAAA TACCTTCTTT
GTACCAAATG GAACTGATTT GAGCAATAAC CCTGTTGTTT GCGAAACTGA ACATCCAAAG
TTAAAATCAT ATCCAAAACC CGTAGTGGGA TATTTAGGTA CGATTGAAAG ACGAATTGAC
TACGAACTGC TGAAGGAAGT TATTGAGGCA AACCAAGATA AAAGTTTTGT ACTTGTTGGA
CCTGTTTACA GGAACTTTGT TCCAGATGAA TATTATAAGT TTAAAAATGT ACATATCCTT
GGTCCGATAC CTTACGAAGA AGCAGCCCAA ATAATCAGCA GTTTCGACAT AGCAATTATT
CCATTTAAGC TTGATGAAGT GAGTAAGACC ATCTTTCCCA TTAAACTTTT TGAGTATTTA
AGTATTGGCA AGCCGGTTGT ACTTACTGAT TTTAACCCCG ACCTCAAAGA ATTTACGTCG
CAAGAATTAG TGAGTTACTG TAATAACGCC AAATCTTTTT CAATGGCTAT TAACAACGAG
CTGGCAACTA ATAATCAAGT AAAATTAGAA TTACGAAAAA AATTAGCTTT AGAAAATACT
TGGGATAAAC GTGCGGAACA GATTAAAGAA ATAATAGACA GCTTTATAAA ATAA
 
Protein sequence
MALKNKVIIF LGNTRFDSDI KATSLFIARS LSKDNKVFFI DYPFTLKDYF QHKKSESLKN 
RKKKFSLFSD GILDTDLPNL KIVITPPVLP INFLPESFLF RSLLKVNESI ISRRIKKILR
TRKIDQFIYI NSFNFHYPNI SSYIKPALSV YHCVDPMIVP YDMKHGIISE KQLVEGSDLV
ICTSRALYEE KKAQNKNTFF VPNGTDLSNN PVVCETEHPK LKSYPKPVVG YLGTIERRID
YELLKEVIEA NQDKSFVLVG PVYRNFVPDE YYKFKNVHIL GPIPYEEAAQ IISSFDIAII
PFKLDEVSKT IFPIKLFEYL SIGKPVVLTD FNPDLKEFTS QELVSYCNNA KSFSMAINNE
LATNNQVKLE LRKKLALENT WDKRAEQIKE IIDSFIK