Gene Phep_1514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1514 
Symbol 
ID8252615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1797205 
End bp1798269 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content40% 
IMG OID644935168 
Product3-dehydroquinate synthase 
Protein accessionYP_003091790 
Protein GI255531418 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.628425 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT TGGAGAGCGC AGGTCATAGC ATTTATTTTG AAACAGAACT GGCACCTTTA 
ATGAAGGTAA TTGAAGCAGA AAAATACAGT AAAATCTTTG TTTTTGCGGA TACACATACG
TCAGAACTGT GTTTACCATT GTTCAGGGAA ATGATGGACG ATTTTAATGG TTTTGACCTG
ATAGAAACTG ATCCTGGGGA AGAAAACAAG AATATTGATT TTTGCATCGG CATCTGGAAG
ACCTTACTTG ATTTTGGTGC CGACCGTAAA TGCCTGATGG TTAACCTGGG TGGTGGCGTA
ATTACCGATA TGGGAGGCTT TGTGGCCTCA ACCTATAAAA GAGGGATAGA CTTTATCAAT
ATCCCTACTA CCCTATTGTC GCAGGTAGAT GCTTCTGTTG GCGGTAAAAC CGGGATTGAT
GTAGATAATG TGAAGAACAT GGTGGGTACT TTTACCCTAC CGCAGTCGGT TTTTATTGAA
ACTAAATTTT TAAAAACCCT GCCACAACGG GAGCTGCTTT CCGGCTTTGC CGAAATGATC
AAACATGGTT TAATTGTAGA CCGTACCTAT TATAATGATT TGAAGAGTAG CAATTACCTC
CAGATTTCGG CTCAGGCCAT TTACCGTTCT GTAGAGATCA AGAATGAAGT AGTTACGGAA
GATCCACATG AAAAAGGACT AAGAAAGATC CTGAATTACG GACATACCAT CGGACATGCA
GTTGAAACCT ATTCCCTGAT CAACGATACC CAGCCACTTA CACATGGCGA GGCCATTGCT
GTGGGTATGA TCTGTGAGGC TTTCCTTTCT TCAAACAACA ATACCTTATC AGCTGATGAT
TTAAAAGACA TTACTGATTA TATCAGTACG CTTTATCCGG CTTACAGGAT TAAAGAAGAC
AGTTTTAAGC AATTGCTGGA GTTTATGCAA AGTGACAAGA AAAACGAGAA CGGACAGATC
ATGTTCTCTT TGCTCAGTAC GATTGGCAAA TGCGATTACA ATTGCAGGGT ATCGGAAAAA
GACATTCTGG AAAGCTTTGC TTACTTTAAC CGCATTTATA GTTAA
 
Protein sequence
MKKLESAGHS IYFETELAPL MKVIEAEKYS KIFVFADTHT SELCLPLFRE MMDDFNGFDL 
IETDPGEENK NIDFCIGIWK TLLDFGADRK CLMVNLGGGV ITDMGGFVAS TYKRGIDFIN
IPTTLLSQVD ASVGGKTGID VDNVKNMVGT FTLPQSVFIE TKFLKTLPQR ELLSGFAEMI
KHGLIVDRTY YNDLKSSNYL QISAQAIYRS VEIKNEVVTE DPHEKGLRKI LNYGHTIGHA
VETYSLINDT QPLTHGEAIA VGMICEAFLS SNNNTLSADD LKDITDYIST LYPAYRIKED
SFKQLLEFMQ SDKKNENGQI MFSLLSTIGK CDYNCRVSEK DILESFAYFN RIYS