Gene Phep_1567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1567 
Symbol 
ID8252669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1850599 
End bp1851774 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content43% 
IMG OID644935221 
Producthypothetical protein 
Protein accessionYP_003091842 
Protein GI255531470 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.405342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.475535 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATCA ATTTATTTGA TGATAATGCC TGGTTATCCC TGCGTCCATT ATCTTTTACC 
AGACCAGTTG CAGATCTTCG TGTAGGAATC CTCACGATTG CTGAAAAGTG GAAGAAACAT
CTGAATGTAG CATCTGGCTT CATTACCGCC GAACACCTGG CCGTTAAATA TCCTCCGCTT
AATGGAGTCC AGCTCTATAT CAATGGCTCA ATTTGTCCTG ATGAGGCACT GCTGGAAGCG
ATCTCAGCAC TGCAAACCGG TGAGGCCTTA AAAAAAGAGG GTATTTTAAT TGCCTGTAAA
ATTGATGCCG GTACAGCTTT TATACCTGAT ATCGACGCGC AGTTGGAAAT CAAAATATAT
CAGGGAAAAT TTATCAGGAT CTCTTTACCC GAAGATATAT TCAGGAATAA TGATGCTGAA
CTGAAAAAAG ACTTTGCTTT ACTGACCCAG GGGCGGGCTT CAGCTAAACT GAGCAGTACA
AATGTTTTTT TAGGTGATGA ATTTTTTGCA GAAGAGGGGG CACAAGCCGA ATGTTCTACT
TTTAACAGCC TGAACGGGCC CATTTATATA GGAGAGAATT CGCAAGTGTG GGAAGGCTGT
CACATTCGCG GATCTTTTGC ACTTTGCAAC AATTCGCAGG TAAAAATGGG AGCTAAAATC
TACGGACAAA CTACCATAGG CCCCTATAGT CGGGTAGGTG GCGAAATTAA CAATGCCATC
ATCTGGGGCT ATTCTTCCAA AGGACATGAA GGCTACCTGG GTAATGCTGT ACTGGGGCAA
TGGTGTAACA TTGGTGCCGA CAGTAACAAT TCTAACTTAA AGAACAACTA TGCTGAGGTT
AGGTTATGGG AGTATGCAAC AGAAAGTTTC CGTAATACCG GTTTACAATT TTGCGGACTG
ATCATGGCCG ATCATGCCAA ATGCGGCATC AATACGATGT TTAATACCGG AACAGTTGCC
GGTGTGAGTG CCAATATCTT TGGCTCCGGC TTTCCCAGAA ACTTTATTCC CGATTTCGCC
TGGGGTGGGG CACATGGATT TGATGTGTAT AGCCTGAATA AAATGTTTGA AACATCAGAG
AAAGTATACG AACGTAGAGA TATCTCGTTT GACCAGACAG AGCAGGATAT TTTATCTGCT
GTTTTTGAAA TGACCAAAAG CTACAGGCAC TTTTAA
 
Protein sequence
MMINLFDDNA WLSLRPLSFT RPVADLRVGI LTIAEKWKKH LNVASGFITA EHLAVKYPPL 
NGVQLYINGS ICPDEALLEA ISALQTGEAL KKEGILIACK IDAGTAFIPD IDAQLEIKIY
QGKFIRISLP EDIFRNNDAE LKKDFALLTQ GRASAKLSST NVFLGDEFFA EEGAQAECST
FNSLNGPIYI GENSQVWEGC HIRGSFALCN NSQVKMGAKI YGQTTIGPYS RVGGEINNAI
IWGYSSKGHE GYLGNAVLGQ WCNIGADSNN SNLKNNYAEV RLWEYATESF RNTGLQFCGL
IMADHAKCGI NTMFNTGTVA GVSANIFGSG FPRNFIPDFA WGGAHGFDVY SLNKMFETSE
KVYERRDISF DQTEQDILSA VFEMTKSYRH F