Gene Phep_1358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1358 
Symbol 
ID8252458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1614874 
End bp1616814 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content46% 
IMG OID644935012 
ProductNHL repeat containing protein 
Protein accessionYP_003091635 
Protein GI255531263 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.967304 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATATT TTAATCTTAC CATAAGCAGA ACAGCACTGG TTTTTCTTAT TGTTTCTGTC 
ATACATTTCA GTTGTACAAA ACAGATTGAT CAGGTCCTGT TAAAAGAGAA ACCGATTCGC
AGTGGATCGG CAGCTGTAAC TACACTGTCC TACTCCAGCC TGACACAAAA ACTGATAGAT
GAAACGAGTC TTATCGGCAC AGTGATTTCA GATGATGAAA TCACTGTTGC GCCAGGTGTA
ACAGAAACAG ATATACATTA TACCGATACC GCCGGCAAAG CCATGCACCT GTTTATTCTG
AAAGTTAACC TCAACGAACC CCAGGTATTC ATGGAAGTAG CTACACCTTT TAATCTTCCG
GCATATGCCA GGCAAACCGT ACCTGCACAG GCGGCCGAAA TTGATACCGC TACCCATATG
GTGATAGCAG GCATAAACGG CGATTTTTTT GATACCAGCA CCGGTATTCC TATGGGTATT
GTACATAAAA ATGGTAGCAT CGTCAAAAGC ACTTTTAATG ACAATACCCT GAAACCCCAG
CAGGCAGTCA GTTTCTTTGG CGTAACAGAA AATAATGTTC CGATTATCGA TTTTAAAAGT
GGTTATGCCG CCCTCAGCAG CCAGCTTTAC AACAGTACTG GCAGCGGAGT AATGCTGGTA
AACAATCATC TTCCTGTTTC ACAACCTTAC ACGGCAATCG ATCCCAGAAC ATCAGTTGGA
TATGATGACA ACGGCATCGT ATATTTTGTA GTAATAGATG GACGAGACGC CCCCTATTCC
AATGGAATGA ACTATGCACA GCTTACCAGT GCTTTTATGG CTTTTAATGT AAAAAATGCT
GTAAACCTGG ACGGCGGCGG CTCCTCTACT TTTATGACCA GAAACCCGGT AACCAATTTA
TTACAGGTAA GGAATCAGCC TTCAGACGGT ACCGCACGTG CCGTTGCCAA TGCCTGGCTG
GTCTATATCA GTAAGGTGCT GGTCAGCAAT TACGCTGGTA CAGGAACTGC CGGCTTAGTG
AACGGAGCTA AAGCCAGTGC CCGTTTCGAT AGCCCCGAAG GTCTTGCTAT TGATGCATCA
GGTAATATGT ACATTGCAGA TAAAAACAAC AATGTGATCC GGAAAATCAC TTCTACCGGA
ACAGTGAGTA CCTTTGCAGG TACCGGAGTG GCGGGCTTTG CAGATGGGGC CGGCAGTATA
GCTAAATTTA ACGGACCATG GAAAGTTGCT GTCGATGCAA CAGGCAATGT ATACGTCGCA
GACAGGGACA ACTTTAAGAT CAGGAAGATC ACTCCGGCAG GTATCGTAAG CACACTTGCC
GGAAGTACAG CCGGTTATGC AGATGGAACA GGGAGTGCCG CTAAGTTTAT GCAACCGCTT
GATGTGGCCA TTGACCCCTC TGGCAACGTA ATTGTGGCCG ACAATACCAG CCACCGCATC
CGCAAAATAA CAGCAGCAGG TGTGGTAACT ACAATTGCCG GAAACGGAAC AGCAGGTTAT
ACCAATGGAA CAGGCACAGC TGCACAATTT AAAAACCCAT CAGGTGTAGA TGTCGACGCA
TCCGGAAATA TTTATGTTGC CGATCGTTTA AACCATCGGA TCAGAAAGAT CACCACATCG
GGAGTAGTCA GTTCCTTAGC TGGCACCGGA ACTTCGGGCA CTACAGATGG CGCGGCTGGT
TCAGCCAAAT TTTCAGACCC TTATGGTGTT ACTGTCGATG TATCCGGAAA TGTGTATGTA
GCAGACCTGA TCAGTTCAAG GATCAGAAAA ATATCATCCG GCCAGGTTAG CACCTTAGCC
GGAACTATAC CTGGTTATCA AAATGGAACA AGCACAATAG CCAAATTCAA TCAGCCTACC
GATCTGGTTA TCCAGGGGTC GAACATCTAT ATAGCAGACC ATTCCAACAA CAGCATCCGT
CTGGTCAAAT TAATCAATTA A
 
Protein sequence
MKYFNLTISR TALVFLIVSV IHFSCTKQID QVLLKEKPIR SGSAAVTTLS YSSLTQKLID 
ETSLIGTVIS DDEITVAPGV TETDIHYTDT AGKAMHLFIL KVNLNEPQVF MEVATPFNLP
AYARQTVPAQ AAEIDTATHM VIAGINGDFF DTSTGIPMGI VHKNGSIVKS TFNDNTLKPQ
QAVSFFGVTE NNVPIIDFKS GYAALSSQLY NSTGSGVMLV NNHLPVSQPY TAIDPRTSVG
YDDNGIVYFV VIDGRDAPYS NGMNYAQLTS AFMAFNVKNA VNLDGGGSST FMTRNPVTNL
LQVRNQPSDG TARAVANAWL VYISKVLVSN YAGTGTAGLV NGAKASARFD SPEGLAIDAS
GNMYIADKNN NVIRKITSTG TVSTFAGTGV AGFADGAGSI AKFNGPWKVA VDATGNVYVA
DRDNFKIRKI TPAGIVSTLA GSTAGYADGT GSAAKFMQPL DVAIDPSGNV IVADNTSHRI
RKITAAGVVT TIAGNGTAGY TNGTGTAAQF KNPSGVDVDA SGNIYVADRL NHRIRKITTS
GVVSSLAGTG TSGTTDGAAG SAKFSDPYGV TVDVSGNVYV ADLISSRIRK ISSGQVSTLA
GTIPGYQNGT STIAKFNQPT DLVIQGSNIY IADHSNNSIR LVKLIN