Gene Phep_1512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1512 
Symbol 
ID8252613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1794842 
End bp1795981 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content44% 
IMG OID644935166 
ProductDAHP synthetase I/KDSA 
Protein accessionYP_003091788 
Protein GI255531416 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.659236 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAC AATTGAACAT CCAGCCATTA AATACATGGC TTAACATCAA CAATGAACCG 
CTCATCATTT CAGGGCCCTG CAGTGCAGAA ACTGAAGAAC AACTGTTAAC CACCGCACAC
TTACTTGCAG CAACAGGAAA GGTATCTGTA TTAAGGGCCG GTATCTGGAA ACCACGCACC
CGTCCGGGAG AATTTGAAGG TATAGGCAGC ATTGGTCTGG AATGGCTGAA AAGAGCAAAA
GCAGAAACCG GCCTGCCTAC TGCCGTAGAA GTTGCAAATG CTAAACACGT GGAAGAGGCC
CTGGCTGCAG GAGTAGATAT TTTATGGATC GGCGCACGTT CTACTGTTAA CCCCTTTACT
GTTCAGGAAA TTGCTGATGC TTTAAAAGGA GTGGATATCC CTGTGTTGGT AAAAAACCCG
GTAAACCCTG ACCTGCAATT GTGGATTGGT GCTTTAGAGC GCATCAACGG TGCTGGTATT
ACTAAATTAG GTGCCATTCA CCGCGGCTTC TCTTCCTTCG AAAAGAGTTC TTTCCGTAAC
GAACCTATGT GGGAGCTTGC CATTCAATTG AAAACTTTAT GTCCAGAACT GCCGATCATC
AACGATCCAA GCCATATCTG CGGTAACCGT GAACTGATCC CTTACATCTC TCAAAAAGCA
TTGGACCTGG ATATGCAAGG CTTAATGATC GAGTCGCACG TAGATCCTTC AGTGGCATGG
ACAGATGCAA AACAACAGGT TACCCCCGCT GCTTTAGCTG AACTGGCTGA CCGTTTAACT
GTTCGTGAAC CAGAATCTAA AAATGAAGCG ATTACAGATC AACTGGCCGA ATTCCGTAAA
CAAATTGACA AAATTGACGA CCTGTTGTTA CAGAAACTGG GTGAGCGTAT GGCCATAGTA
GGCAAAATCG GTGAATACAA ACGCGATAAC CAGGTAACCA TTTTACAGGT TAACCGTTGG
GATGCCATTA TTAAAAAAGG TGCCTCATTT GCAAAAGCCT TAAAACTGGA TTTAAACTTT
ACAGAAAAAT TCCTGGAACT CGTTCATGGA GAATCTATCC GCAAACAAAC TGAGATCATG
AATGCTGGTA AAGCAGAAAA AGGCATTGCA GCAGAAGCAC ATACAGAAGT TAAATCTTAA
 
Protein sequence
MKLQLNIQPL NTWLNINNEP LIISGPCSAE TEEQLLTTAH LLAATGKVSV LRAGIWKPRT 
RPGEFEGIGS IGLEWLKRAK AETGLPTAVE VANAKHVEEA LAAGVDILWI GARSTVNPFT
VQEIADALKG VDIPVLVKNP VNPDLQLWIG ALERINGAGI TKLGAIHRGF SSFEKSSFRN
EPMWELAIQL KTLCPELPII NDPSHICGNR ELIPYISQKA LDLDMQGLMI ESHVDPSVAW
TDAKQQVTPA ALAELADRLT VREPESKNEA ITDQLAEFRK QIDKIDDLLL QKLGERMAIV
GKIGEYKRDN QVTILQVNRW DAIIKKGASF AKALKLDLNF TEKFLELVHG ESIRKQTEIM
NAGKAEKGIA AEAHTEVKS