Gene Phep_3824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3824 
Symbol 
ID8254958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4588403 
End bp4590382 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content45% 
IMG OID644937488 
Productdehydrogenase E1 component 
Protein accessionYP_003094077 
Protein GI255533705 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.200241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.786023 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTCG ACAGAAAAGA TAAGGACAAT GTCACGCTGT TAAATTTTTA TTTACAACTC 
CTATATCCCA GGATGGTGGA GGAAAAGATG TTGATCTTAT TGCGTCAGGG CCGTATTGGA
AAATGGTTTT CTGGTATTGG ACAGGAAGCC ATAGCAGTGG GCAGTACCAT GGCGATGAAA
GCCTCGGAGT ACATCCTTCC CATGCACAGA AATTTAGGGG TCTTTACTTC AAGGAATGTA
TCCTTAACCA AACTGATGGC CCAATGGCAG GGCAAAGCTA CTGGTTTTAC AAAAGGCCGC
GACCGTTCCT TTCATTTCGG AACCCAGGAA CACAAAATCA TTGGAATGAT ATCCCATTTA
GGGCCTCAAA TGGCACTTGC CGACGGTATT GCCCTTGCAG ATGTACTGAC GGGCAAGCAA
CAAGTTACGC TGGTATTCAC AGGAGAAGGC GCGACCAGTG AGGGCGATTT TCATGAGGCC
GTTAATGTAG CCGCAGTATG GGACCTGCCA GTTATTTTTT TAATTGAAAA CAATGGCTAT
GGCCTATCAA CGCCTGTTAA CGAACAATTT AGATGTAAGA ACCTGATAGA TAAAGCCATA
GGTTATGGCA TCGAAGGATT TAAAGTGGAT GGAAATAACA TCCTTGAAGT ATATGACCTG
ATTGACAGGG TTGCCTGCAG AATGCGGGAA AACCCTAAAC CTGTACTGAT AGAATGCCTT
ACTTTTAGAA TGCGTGGTCA TGAAGAGGCC TCGGGCACAA AGTATGTTCC CCAGGAGCTT
TTAGATGAGT GGGAAAAAAA AGATCCTGTT AAAAATTTCG AACAGTATCT GCTCGATCAA
AAAATACTTA ATCCGGATTC AATTGAGGAA ATAAAAGCCA GCCTGAAAAC TGAGATTGAT
AATGAGGTTG AAAATGCTTT TAATGAAGCC GATCCGCTTG CTGATTCTGA AAGGGAATTG
AAAGCCATGT ATTTCCCTTA TGCTGGTACA AGCACAGCAC CGGATAGTAC TATTGCAAAA
GATATCCGTT ACATTGATGC CATTTCTGAT GGTCTCCGTG TAGCCATGCG AAGGCACAAC
AACCTGGTAC TAATGGGCCA GGATATTGCG GAATATGGTG GTGCCTTTAA AATAACTGAT
GGTTTTGCCG AGGAATTTGG CAAAGCGCGG GTACGGAACA CCCCTATCTG CGAATCGGCC
ATTGTAGGCG CAGCGCTTGG CCTTTCCATA AACGGGTATA AGGCCATGGT GGAAATGCAG
TTTGCCGACT TTGTGACCTG TGGTTTTAAC CAGATTGTAA ATAATCTGGC TAAAACACAT
TACCGCTGGG GAGAAAAAGC AGATGTGCTG ATCCGTATGC CTACCGGTGC TGGTACAGGT
GCTGGTCCTT TTCATTCACA AAGCAATGAA GCCTGGTTTA CTAAAACACC TGGGTTAAAG
GTGGTGTACC CGGCCTTTCC CGCAGATGCC AAAGGCCTGC TCCTTGCCGC CATAGAAGAT
CCAAACCCGG TCATGTATTT TGAACATAAA TACCTGTACC GTTCTCTGCA CGGACTTGTA
CCGGAAGGCT TTTATACCCT GGAAATAGGG AAAGCAAACG TCTTGAGACG TGGTGAACAA
TGCTGCATCA TTACTTATGG TTTAGGTGTT CATTGGGCAA TGTCCTACCT GGACCAAAAT
CCGGACCTTT CTGTTACACT GGTAGATTTG CGAAGTTTGC AGCCCTGGGA CAAAGAAACT
GTGGCCAGTG CCGTTAAAAC AACCGGACGG GTGCTTATTT TACATGAAGA CACACTGTGT
TCTGGTTTTG GTGCTGAACT TGCCGCCTGG ATTTCCGAGC ATTGCTTTAA ATACCTGGAT
GCTCCGGTAA TGCGCTGCGC CAGTAGTGAT ACAGCAATTC CCATGAATAA GGTACTGGAA
GATAGCTTTC TTGCAAAAAG TCGTTTAAAG GGTTCGATCA GACGTTTACT CGCCTATTAA
 
Protein sequence
MTFDRKDKDN VTLLNFYLQL LYPRMVEEKM LILLRQGRIG KWFSGIGQEA IAVGSTMAMK 
ASEYILPMHR NLGVFTSRNV SLTKLMAQWQ GKATGFTKGR DRSFHFGTQE HKIIGMISHL
GPQMALADGI ALADVLTGKQ QVTLVFTGEG ATSEGDFHEA VNVAAVWDLP VIFLIENNGY
GLSTPVNEQF RCKNLIDKAI GYGIEGFKVD GNNILEVYDL IDRVACRMRE NPKPVLIECL
TFRMRGHEEA SGTKYVPQEL LDEWEKKDPV KNFEQYLLDQ KILNPDSIEE IKASLKTEID
NEVENAFNEA DPLADSEREL KAMYFPYAGT STAPDSTIAK DIRYIDAISD GLRVAMRRHN
NLVLMGQDIA EYGGAFKITD GFAEEFGKAR VRNTPICESA IVGAALGLSI NGYKAMVEMQ
FADFVTCGFN QIVNNLAKTH YRWGEKADVL IRMPTGAGTG AGPFHSQSNE AWFTKTPGLK
VVYPAFPADA KGLLLAAIED PNPVMYFEHK YLYRSLHGLV PEGFYTLEIG KANVLRRGEQ
CCIITYGLGV HWAMSYLDQN PDLSVTLVDL RSLQPWDKET VASAVKTTGR VLILHEDTLC
SGFGAELAAW ISEHCFKYLD APVMRCASSD TAIPMNKVLE DSFLAKSRLK GSIRRLLAY