Gene Phep_3983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3983 
Symbol 
ID8255117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4805951 
End bp4807144 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content40% 
IMG OID644937647 
ProductN-acylglucosamine 2-epimerase 
Protein accessionYP_003094236 
Protein GI255533864 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.192648 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAA TTCTTATACA AGAGTTTGAA AAGGAACTGG CCGGTATGCT GAACTACTGG 
GCTACATATA CAGTAGACAA AGTGCATACA GGCTTTTATG GACAAGTTGA TAATGACAAT
AATGCTATTG AAAAAGCAGA TAAAGGTGCT GTGTTAAACT CCCGCATCCT TTGGTTTTTC
TCTGCTGCCT ATAATTACAA GAGCTGGCCC GAAAGTTTAC ATCTGGCCAG AAGAAGCTAT
GATTATATAA GAGACCACTT TGTGGACAAG CAATTTGGAG GAGTATATTG GTCTGTAGAT
TTTCAGGGCA AACCAGCAGA TACCAAAAAA CAGATGTATG CACTAGCGTT TGCCCTGTAT
GGGATAACAG AGTTTTATAA GGCTAGCAGA GAGGAAGAGG CATTGGCCCT GGCTAAAATT
CTTTATGCTG ATATAGAGAA ACATAGCTTC GACCCCATAA ATAATGGATA TTTTGAAGCT
TTTTCCTGCC AGTGGTCAGA ATTGACAGAC CAACGTTTAA GTGATAAGGA TGCCAACGAA
AAAAAGACTA TGAATACACA CCTTCATGTT TTAGAGGCAT ATACCAGTCT TTACACTGTT
TGGCCCGATG AGGGATTGGG CAGGCAGATC CGTAATTTGC TTGGTGTTTT TACAGATAAG
ATTATTGACA GAGATACCCA TCATCTGATG CTTTTTTTTG ACGAAAGCTG GCATTCAAAA
TCCCGGGCAA TTTCTTTTGG ACACGATATA GAGGCTTCCT GGCTTTTGCT GGAAGCAGCG
GAATCTTTGG GAGATGAAAA CCTGATCTGG CAATTTAAAG ATGTAGCAGT AAAGATGGCC
ATGGCATCTA TTCAAGGGCT AGATGAAAAT GGAGGGCTAA ACTATGAGTT TGAACCATCC
AATTGGAGCA GAGAAAAACA TTGGTGGGTA CAAGCTGAAG CCATGGTAGG TTTTTTTAAT
GCTTTTCAGC TTACGAAAGA ACAGACCTAC TACGATAAAT TTTTAAAATG CTGGGAGTTT
ACAAAAGCGC ACATTATCAA TACACAAAAA GGAGAGTGGT TTTGGGGTGT AAATGAAGAT
CTTTCCTTGA TGCCTGAACA ATATAAAGTT GGTTTATGGA AGTGCCCATA TCATAATGGC
AGGGCCTGCT TGGAGATGAT ACGCCGCCTT GGTGTCAATT TCGATTTTTC CTGA
 
Protein sequence
MSEILIQEFE KELAGMLNYW ATYTVDKVHT GFYGQVDNDN NAIEKADKGA VLNSRILWFF 
SAAYNYKSWP ESLHLARRSY DYIRDHFVDK QFGGVYWSVD FQGKPADTKK QMYALAFALY
GITEFYKASR EEEALALAKI LYADIEKHSF DPINNGYFEA FSCQWSELTD QRLSDKDANE
KKTMNTHLHV LEAYTSLYTV WPDEGLGRQI RNLLGVFTDK IIDRDTHHLM LFFDESWHSK
SRAISFGHDI EASWLLLEAA ESLGDENLIW QFKDVAVKMA MASIQGLDEN GGLNYEFEPS
NWSREKHWWV QAEAMVGFFN AFQLTKEQTY YDKFLKCWEF TKAHIINTQK GEWFWGVNED
LSLMPEQYKV GLWKCPYHNG RACLEMIRRL GVNFDFS