Gene Phep_2474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2474 
Symbol 
ID8253581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2868476 
End bp2870176 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content46% 
IMG OID644936124 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_003092740 
Protein GI255532368 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.826754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCA ATACAGATTT AAAGAAAGAA AATACATACG ATGCTATCGT CATAGGATCG 
GGGATCAGTG GCGGCTGGGC TGCAAAAGAA CTTACAGAAA AAGGCCTCAG GGTGCTGATG
CTGGAACGGG GAATGAACAT AGAGCATGTG AAGGACTATG ATTCTGCCAT GAAAAACCCA
TGGGAGTTCC AGCATGCAGG CAAACTCACC GAAGAGCAAA AAAGAACTCA CCCTGTTCAG
AAAAGGGACT ATCCTTATCA GGAAGCAAAT GAAAAATGGT GGGTAAACGA CCTGGAATGC
CCTTATACTG AAGACAAGCG TTTCGACTGG TACCGCGGGT TCCATGTGGG TGGCAAATCC
TTAATGTGGG GCCGTCAGAG CTACCGGTTC AGCGACCATA ATTTTGAGGA CAATGCCAGG
GATGGGCATG GAAACGACTG GCCCATCCGC TATAAAGATA TTGCCCCATG GTACGATTAT
GCCGAACGTT TTGCCGGTAT CAGCGGTCAG GCCGAAAACT GGCCATTATT ACCCGACGGA
CAGTTCCTGC CACCTATGGA CCTGAACTGT GTAGAGAAAT CTGTTAAAAA ACGCATTGAA
GAGCATTACA AAAGAACACG GATCATGACC ATTGGCCGTG TAGCAAATTT AACTGTTCCG
CATAAAGGGC GTGGCAATTG CCAGTACCGT AACCTCTGCA GTCGTGGCTG TCCTTTTGGT
GCTTATTTCA GTACACAATC TTCTACCTTA CCGGCAGCCC AGGCTACAGG CAGGCTCACC
CTAAGGCCTT ATTCCATCGT TAACCATATC ATTTACGATA AAAACACCAA AAAGGCCAAA
GGTGTAATGG TGATAGATGC GGAAACGCAG AAAACAATGG AATTCTATGC AAAAATTGTT
TTTGTAAATG GTTCTACCTT AGGTTCCACA TTTATCCTTT TGAACTCTAC TTCCGAAGCA
CACCCTAACG GTCTGGGCAA TGGAAGCGGA CAACTGGGCC ATAACCTGAT GGACCACCAT
TTCCGTTGCG GTGCATCGGC TGAAGCGCTT GGTTTTGAAG ATAAATATAC CTTCGGTCGC
CGTGCAAACG GGATTTATGT GCCCCGGTAC CGCAATGTCG GCAGCGATAA ACGTGATTAT
TTACGTGGCT TTGGCTACCA GGGTGGTGCC AGCCGGAAAA ACTGGCAAAG TGATGTGGCG
GAACTGGCCA TAGGTGCTGA TTTTAAAGAT AAAATGAACA AACCAGGGGC CTGGACTATG
GGCCTGGGCG GCTTTGGAGA AATGTTGCCT TACTATGAAA ACCGGGTTTA CATCGATAAA
ACCAAAAAAG ACAAATGGGG ACAGCCGGTA CTGGCTATTG ATTGTGAGTA CAAGGAGAAC
GAGAAAAAGA TGCGTATCGA TATGATGAAC GATGCCGCCG AAATGCTGGA AGCAGCAGGC
ATGAAAAACA TCAAAACCTA TGACAATGGT TGTTATCCGG GTATGGCCAT CCATGAAATG
GGAACCGCCA GAATGGGGAA TGACCCGAAA ACATCGGTGC TCAACAAATG GAACCAAATG
CATGAAGTCA GCAATGTATT CGTTACTGAT GGTTCCTGTA TGCCATCTAT TGCCTGTCAG
AACCCATCTT TAACCTTTAT GGCATTAACT GCGCGTGCAT CCGATTATGC GGTTAAAGAA
TTGAAGAAAG GGAATATTTA A
 
Protein sequence
MNINTDLKKE NTYDAIVIGS GISGGWAAKE LTEKGLRVLM LERGMNIEHV KDYDSAMKNP 
WEFQHAGKLT EEQKRTHPVQ KRDYPYQEAN EKWWVNDLEC PYTEDKRFDW YRGFHVGGKS
LMWGRQSYRF SDHNFEDNAR DGHGNDWPIR YKDIAPWYDY AERFAGISGQ AENWPLLPDG
QFLPPMDLNC VEKSVKKRIE EHYKRTRIMT IGRVANLTVP HKGRGNCQYR NLCSRGCPFG
AYFSTQSSTL PAAQATGRLT LRPYSIVNHI IYDKNTKKAK GVMVIDAETQ KTMEFYAKIV
FVNGSTLGST FILLNSTSEA HPNGLGNGSG QLGHNLMDHH FRCGASAEAL GFEDKYTFGR
RANGIYVPRY RNVGSDKRDY LRGFGYQGGA SRKNWQSDVA ELAIGADFKD KMNKPGAWTM
GLGGFGEMLP YYENRVYIDK TKKDKWGQPV LAIDCEYKEN EKKMRIDMMN DAAEMLEAAG
MKNIKTYDNG CYPGMAIHEM GTARMGNDPK TSVLNKWNQM HEVSNVFVTD GSCMPSIACQ
NPSLTFMALT ARASDYAVKE LKKGNI