Gene Phep_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4047 
Symbol 
ID8255181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4890369 
End bp4891637 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content50% 
IMG OID644937711 
Productglycosyl transferase group 1 
Protein accessionYP_003094300 
Protein GI255533928 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAA CCTTGATCAA TACATCCGAT GCAGGCGGGG GTGCTCCCGT AGCCAGTATG 
CGCCTGTTAA AAGCGCTGGC ATCAGAAAAC ATCGAGGTAA ACTTCCTGGC CGGCGATAAA
AAGACCAGCG AGGCCCGCGT TCAGGCCGTT CAAAATAACT TATGGCAGCG GTTAAAGGCC
CGTTTTAATT TTTTATATGA ACGCCTGCCC TTTATCCTTT TTTATGAAAA GGACAAATCT
GTACGCTTCG CCTTTTCAAC GGCCAATGCC GGAACCAGCA TTGCCGGTCA TCAGCTGGTT
AAAGAGGCCG ACATTTTACA CCTGCACTGG ACCAACTCCG GCTTTCTTTC TATCCGAGAT
TTAACAGAAC TGATGAAACT GGGCAAACCG GTAGTGTGGA CCCTCCACGA TATGTGGGCA
TTTACAGGGG GCTGCCATTA TGCGGGCACC TGCGACCATT TTACAGGGGA ATGCGGCGAT
TGTTACTTTT TAAGAGACCC TGCCGGGAAC GACATTTCCC ATACCGGCTG GCTGCACAAG
CAGGCCATGT ATGCCAGCAG CCCAAACCTC ACTTTTGTAA GCTGCAGCAA CTGGCTGGGC
AAGGTGGCCC GTCAAAGTTC ATTAGTAAAG GATTTTCCTG TCCGGGCCAT TGCCAACCCC
ATCGATATAG AGGTATATGC CCCTAAAGCC AGGATGGCTG CCCGTACCAA ATGGAACATC
AGCAACAACA GCAGGATCAT TTTATTTGGT GCAGCCAATA TAGGCGACCG CCGTAAAGGG
ATTGGCTATC TGGTACAGGC ACTGCAGCAT TTAAAGCAAC ATTACGAATT GCCCCTTCCG
GTTGAGGTAC TGATTTTCGG GAAAAACAAA CACTTTGATG TCAGCCAGCT CCCCTTTCCT
GTTCATGAGC TGAATACCAT TAGCGCAGCA CAGGACCTGG CCGAACTGTA CAGCCTGGCC
GATGTATTTG TAATGCCTTC TGTTGAAGAC AACCTGCCCA ATACCGTTAT GGAATCTATG
GCCTGCGGTA CCCCCGTAGT GGCTTTTGAT ACAGGTGGCC TGCCGGAAAT GATAGACCAC
CAGTTAAATG GCTACCTGGC CAGTTTCAGG TCGGCCACAG ATCTGGCCAC AGGCATCCAT
GAAATGCTCT TTACCCCCCG GGCAGAAGAA ATCGCTATAC AGGCCCGGAA CAAAGTGCTC
CAAAATTTTA CCAATCAACA CATCGCCAGA CAATATACAG ACCTCTACCA ATCCCTACTT
AGCAAATGA
 
Protein sequence
MKITLINTSD AGGGAPVASM RLLKALASEN IEVNFLAGDK KTSEARVQAV QNNLWQRLKA 
RFNFLYERLP FILFYEKDKS VRFAFSTANA GTSIAGHQLV KEADILHLHW TNSGFLSIRD
LTELMKLGKP VVWTLHDMWA FTGGCHYAGT CDHFTGECGD CYFLRDPAGN DISHTGWLHK
QAMYASSPNL TFVSCSNWLG KVARQSSLVK DFPVRAIANP IDIEVYAPKA RMAARTKWNI
SNNSRIILFG AANIGDRRKG IGYLVQALQH LKQHYELPLP VEVLIFGKNK HFDVSQLPFP
VHELNTISAA QDLAELYSLA DVFVMPSVED NLPNTVMESM ACGTPVVAFD TGGLPEMIDH
QLNGYLASFR SATDLATGIH EMLFTPRAEE IAIQARNKVL QNFTNQHIAR QYTDLYQSLL
SK