Gene Phep_3804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3804 
SymbolthrA 
ID8254938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4570585 
End bp4573032 
Gene Length2448 bp 
Protein Length815 aa 
Translation table11 
GC content45% 
IMG OID644937468 
Productbifunctional aspartokinase I/homeserine dehydrogenase I 
Protein accessionYP_003094057 
Protein GI255533685 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTT TAAAATTTGG AGGTACATCT GTCGGTTCAG CTCAAAGCAT AAGTGCATTG 
ATGGATATAT TGATCAAGGG GAAGCACGCT GAGCACCCTA TCGTCGTTTT ATCTGCTATG
GGTGGTGTAA CCAACACTTT GCTGGATATG GCAGAAAGTG CCAGAAAGGG AGAGGACTAT
GCCGAAACGC TGAAGAAGGT AGAAGACAAA CACTTCGAGG TGATCCGTGC ATTGCTGCCT
GCAAGTGCTC AAAATCCGGT ACTTACCAAA CTAAAGATCT ATTTAAATGA ATTGGAAGAC
ATTCTGCAGG CAGTTTATAA CCTGAAAGAG CTGAGTTTGC AGACCAAAGA CCTGATCTTA
AGCTATGGCG AACGCTGCTC AACCATGATG GTGAGCCACA TTGCGAGACA ATATTTTGCA
AATGCCCTGT TTGTAGATGG CTCGGAACTG ATTAAAACAG ACCATAACTT CGGACAGGCG
AAAGTGAACA CTGAGCTGAC TGAAAACCTG ATCAGGGATT TTTATGCAGG CAATAACGAT
AAACTGCTTT TTGTTACCGG CTTCATTTCC AGTAACGACG AAGGGCGTAT CACTACTTTA
GGGCGTGGGG GCAGTGATTA TACAGCCGCC ATCTGGGGGG CAGCCCTTGG TGCCGATGAA
ATTGAGATCT GGACCGATGT AGATGGCATG TTAACAGCTG ATCCGAGGAT CGTTAAAAAG
GCTTTTTCAT TGCCTGAACT GAGTTATACC GAGGCTATGG AACTTTCTTA TTTTGGTGCC
AAAGTGATCT ATCCACCTAC CATGATCCCT GCCTTTTTAA AGAAAATACC TATAGTCATT
AAAAATACTT TTAATGTTGA TTTCCCTGGT ACCTATATCA GGCACAATGT GCATGCCTCC
AGCCTGCCGA TCAAAGGGAT CTCTTCGATT GATGAGATCA GTATCCTTAA CTTATCCGGC
AGCGGTATGG TTGGTAAGGC CGGATTCAGC GGAAGGCTGT TCTCTATGTT GTCCAGAGAG
CAGGTCAATG TAGTCCTCAT CACACAATCC TCATCCGAAC ACAGCATTAC CTTTGCCGTT
AAACCTGCAG ATGCCTTAAA AGCACTGGCA CTCATCAATA AAGAATTTGA ACTGGAACTT
CAGGCCCGTA AACTGGAATA CCCGGAGGTA GAAAACGGAC TTTCGGTTTT AGCGATAGTG
GGTGAAAACA TGAAGCGCAC ACCGGGTATT TCCGGAAGGT TATTCAGTGC GCTGGGCAGG
AATGGGGTAA ACATCCGCGC CATTGCCCAG GGTTCGTCAG AATATAACAT TTCCGTAATC
CTGTCACGCA GTGATCTTTC CAAAGCGGTG AACGCTGTTC ATGATGCTTT CTATGCCGAC
CTGAAAAAAA CGCTGAACGT GTTTTGTCTG GGTACCGGAA ACATTGGTAA AACCTTATTT
AAGCAATTAC AACACCAGAT GCCTTTTCTG GCCAAAAATA ACGATCTGCA GGTTAAGGTA
ATGGGGGTAA GCAATACCCG TAAAATGTAC CTGGATGCAG AGGGAATAGA TCTGAACAAT
TGGGAAGATA CCCTGAACGA AAAGGGTGAA CAGGCAGACC TGGCTGAGTT TATCAAAAAG
ATGAAGTCCA TGAACCTGGC CAATTGTGTA TTTGTAGACA ATACTGCCAG CCATAACCCG
ATCCGGCATT ACCTGGATGT ATTGCAATCG AGCATCTCGG TGGTTACCTG TAATAAAATA
GGTAATTCTG CCGAATATGA TCAGTATGTG GCTTTTAAAG AGGCGGCAAG AAAACACGGG
GTAGAGTTTT ATTATGAAAC CAATGTGGGG GCAGGACTGC CCATCATCCG TACCTTAAAG
GACCTGATGC TGAGTGGCGA CAGGATCAAC CGCATTGAAG CCATCTTGTC GGGTACCATC
TCTTATATCT TTAACAACTT TAAGGGAGAC AGGCTTTTCA GTGAAGTTGT AAAAGAGGCA
CAGGATATGG GTTATACTGA ACCTGATCCG AGGGACGACC TGAACGGTAA AGATTTTATG
CGTAAAATGC TGATCCTTGC CCGCGATGCA GGTTATGCGC TTGAAGAAAA AGATGTGGCC
ATAGAAAGTA TGCTGCCACC GGCCTGTCTG GCGGCCAGCA GTGTGGCCGA GTTTTACCAG
GAACTTGAAA ACAATGCGGC GCATTTTGAG AACTTAAAAA ATGAAGCAGC TAAAAGCAAT
AAAGTATTGA GGTATATCGG TAAGCTGGAG GATGGTAAAG TGGCCATTAC CCTGCAAATG
GTTGATGACT CACACCCTTT CTATATGCTT TCCGGCAGCG ACAACATTAT CTCCTTTACT
ACTGATCGTT ATAAGGAGCG TCCGCTGGTA GTAAAAGGAC CTGGAGCCGG TGCCGAGGTA
ACTGCTGCCG GTGTTTTTGC TGACATCATT AACATAGGTA AAAGATAA
 
Protein sequence
MNILKFGGTS VGSAQSISAL MDILIKGKHA EHPIVVLSAM GGVTNTLLDM AESARKGEDY 
AETLKKVEDK HFEVIRALLP ASAQNPVLTK LKIYLNELED ILQAVYNLKE LSLQTKDLIL
SYGERCSTMM VSHIARQYFA NALFVDGSEL IKTDHNFGQA KVNTELTENL IRDFYAGNND
KLLFVTGFIS SNDEGRITTL GRGGSDYTAA IWGAALGADE IEIWTDVDGM LTADPRIVKK
AFSLPELSYT EAMELSYFGA KVIYPPTMIP AFLKKIPIVI KNTFNVDFPG TYIRHNVHAS
SLPIKGISSI DEISILNLSG SGMVGKAGFS GRLFSMLSRE QVNVVLITQS SSEHSITFAV
KPADALKALA LINKEFELEL QARKLEYPEV ENGLSVLAIV GENMKRTPGI SGRLFSALGR
NGVNIRAIAQ GSSEYNISVI LSRSDLSKAV NAVHDAFYAD LKKTLNVFCL GTGNIGKTLF
KQLQHQMPFL AKNNDLQVKV MGVSNTRKMY LDAEGIDLNN WEDTLNEKGE QADLAEFIKK
MKSMNLANCV FVDNTASHNP IRHYLDVLQS SISVVTCNKI GNSAEYDQYV AFKEAARKHG
VEFYYETNVG AGLPIIRTLK DLMLSGDRIN RIEAILSGTI SYIFNNFKGD RLFSEVVKEA
QDMGYTEPDP RDDLNGKDFM RKMLILARDA GYALEEKDVA IESMLPPACL AASSVAEFYQ
ELENNAAHFE NLKNEAAKSN KVLRYIGKLE DGKVAITLQM VDDSHPFYML SGSDNIISFT
TDRYKERPLV VKGPGAGAEV TAAGVFADII NIGKR