Gene Phep_3251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3251 
Symbol 
ID8254370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3856289 
End bp3857500 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content45% 
IMG OID644936904 
ProductN-acylglucosamine 2-epimerase 
Protein accessionYP_003093508 
Protein GI255533136 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.947715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0046378 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGTTATAG AATATACATT AGAAAAATTA AAAGACCTTC AAGGGTTTTA TCAAAAACAA 
TTATTGGATG ATACTGTGCC ATTTTGGTTC CCAAGATCTA TAGATAGGGA ATTTGGAGGG
TATTTACTTA TGCGTGACCA GGACGGAAGC CTGATAGACG ACGATAAAGC TGTATGGATA
CAAGGGCGTG CCGCCTGGTT GCTGTCAACT TTATACAATA CTGTTGAACA AAAGCAGGAA
TGGCTGGACG GTGCGAAATC CGGAATAGAT TTTTTGAACC GGCATTGTTT TGATACCGAC
GGGCAGATGT TTTTCCATGT TACCCGCGAT GGACAGCCCA TCCGTAAGCG CCGCTATTAT
TTTTCGGAAA CCTTTGCGGT AATTGCCAAT GCAGCGTATG CCAAAGCCAG CGGGGATGAG
GCTGCAGCTA AACAAGCCCG TTACCTCTTT GGCAAATGTA TTGAATATTC CACCAATCCG
GGATTATTAC CTCCAAAATA TACCGGTACC AGGCCCGCTA AAGGGATTGG GGTGCCCATG
ATCATGATGA ATACGGCACA GCAACTCCGT GAAACGATAG GTGATCCGCG TTGTGATGAA
TGGATCGATA AATGGATCAA TGAAATTGAA ACCTATTTTG TGAAGGATGA CATCAGGTGT
GTAATGGAAC AGGTTGCTCC CGATGGTAGT ATCATTGACC ACATCGATGG CCGTACCTTA
AACCCCGGAC ATGCGATTGA AGGGGCCTGG TTTATCCTTC ACGAAGCAAA ATACAGGAAC
AATGATCCCA GGTTAATTAA ACTCGGCTGC AAAATGCTGG ATTACATGTG GGACCGCGGC
TGGGACAAAG AGCACGGCGG GATTTTATAT TTCCGGGATG TGTACAACAA GCCTGTGCAG
GAGTACTGGC AGGATATGAA ATTCTGGTGG CCCCATAATG AAGTCATAAT CGCAACGCTA
CTGGCTTATA CCATAACAGG AGAGGAGAAA TATGCACAAT GGCACAAACT GGTACACGAG
TATGCTTACC AGCATTTTCA CGACGCAGCA AACGGAGAGT GGTTTGGTTA TCTGCATAAA
GACGGGACCC TGGCCCAAAC TGCAAAAGGA AATTTGTTTA AAGGCCCTTT TCATTTGCCA
AGACAGGAAT GGTATTGCAT GACTTTGTTA AATGAATATC TGCAGCAATC TGCTTCCTAT
ACGGCTCAAT AA
 
Protein sequence
MVIEYTLEKL KDLQGFYQKQ LLDDTVPFWF PRSIDREFGG YLLMRDQDGS LIDDDKAVWI 
QGRAAWLLST LYNTVEQKQE WLDGAKSGID FLNRHCFDTD GQMFFHVTRD GQPIRKRRYY
FSETFAVIAN AAYAKASGDE AAAKQARYLF GKCIEYSTNP GLLPPKYTGT RPAKGIGVPM
IMMNTAQQLR ETIGDPRCDE WIDKWINEIE TYFVKDDIRC VMEQVAPDGS IIDHIDGRTL
NPGHAIEGAW FILHEAKYRN NDPRLIKLGC KMLDYMWDRG WDKEHGGILY FRDVYNKPVQ
EYWQDMKFWW PHNEVIIATL LAYTITGEEK YAQWHKLVHE YAYQHFHDAA NGEWFGYLHK
DGTLAQTAKG NLFKGPFHLP RQEWYCMTLL NEYLQQSASY TAQ