Gene Phep_3645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3645 
Symbol 
ID8254776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4358333 
End bp4360036 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content46% 
IMG OID644937306 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_003093898 
Protein GI255533526 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.948951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATT TAAATACTAA AGCAACTGCG CAGAACACTT ATGATGCGAT TGTTGTAGGA 
TCTGGGATTA GCGGCGGCTG GGCAGCAAAA GAACTGACAG AAAAAGGTTT AAAAGTTTTA
CTGCTGGAAA GAGGCAGAAA CATTGAACAT ATAAAAGATT ATACCACGGC CATGAAAAAG
CCCTGGGAAT TTGAACACAG GGGTAAACTG ACTGAAGAAC AGAAGGCCAC GCACCCTGTA
CAGAAAAGAG ACTATCCTTA TACAGAATTT AACGAAAGCT TTTGGGTGAA CGACCTGGAA
TGCCCTTATA CAGAGGTAAA GCGTTTCGAC TGGTACCGTG GCTTTCATGT GGGTGGCAAA
TCACTGATGT GGGGCCGTCA GAGCTACCGC TTCAGCGACC TGAACTTTGA GGATAACCTG
AAAGACGGAC ATGGGGTAGA CTGGCCGATC CGCTACAAGG ACATTGCGCC ATGGTACGAT
TACGTGGAGA AATTTGCCGG GATAAGCGGA CAGGCAGAAG GCTGGCCGGC ATTACCTGAC
GGACAGTTTT TACCGCCCAT GGAAATGAAC TGCGTAGAGA AAGAGGTAAA AAAACGAATT
GAAGCCAAAT GGAACAAATC AAGGATCATG ACCATTGGCC GCACGGCGAA CCTTACGGTA
CCCCACGGCG GACGTGGCAG TTGCCAATAC CGCAACCTGT GTAGCAGGGG CTGTCCTTTT
GGTGCTTACT TTAGTACCCA ATCATCTACC CTTCCGGCGG CTGTGGCTAC CGGAAACTTA
ACACTTCGTC CTTTTTCGCT GGTAAATACC ATCATTTACG ATAAAGAAAC ACAAAAAGCA
AAAGGCGTGG TTGTGATCGA TTCCGAAACC AAAGAAACGA CAGAATACTT TGCAAAGATT
GTATTTGTAA ACGGCTCTAC ACTTGCCTCG ACCTTCATTT TATTGAATTC TGTATCTGAT
GAACATCCCA ACGGATTGGG CAATGGCAGC GGACAGCTGG GCCATAATTT AATGGACCAC
CATTTCCGTT GCGGTGCCAA CGGGCGGATT GAAGGTTTTG ACGATAAATA TACTTACGGC
AAACGGGCCA ATGGCATTTA CATTCCGAGG TATCGGAACA TGGGCAATGA TAAACGTGAC
TATGTGAGGG GCTTTGGTTA CCAGGGCGGC GGAAGTAGAG GTAACTGGCA TGCCGATGTA
GCAGAAATGG CTTTCGGTGG CGAGTTTAAA GACCAGATGA CCGTACCTGG CTCCTGGAAT
ATGGGCCTGG GTGGTTTTGG TGAATCGCTC CCTTATTTTG AAAACAGGGT GTATATCGAT
AAGAGCAAGA AAGACAAATG GGGACAACCT GTACTGGCCA TTGATTGCGA GTTTAAAGAT
AACGAACGTA AAATGCGGGT AGACATGATG AACGATGCCG CTGAAATGCT GGAGGCATCT
GGTGCAAAAG ATGTTAAAAC TTTTGACAAT GGCTCGGCAC CGGGCATGGC CATTCATGAA
ATGGGTACGG CACGTATGGG CCACGATCCT AAAACATCTA TACTGAACAA ATGGAACCAG
ATGCACGAAG TAAAAAATGT TTTTGTAACA GATGGTTCAT GTATGGCCTC TGCTGCCTGT
CAGAACCCAT CATTAACATA TATGGCTTTA ACTGCAAGGG CGTGTGATTT TGCAGTTAGT
GAATTAAAAA AAGGTAACCT TTAA
 
Protein sequence
MSNLNTKATA QNTYDAIVVG SGISGGWAAK ELTEKGLKVL LLERGRNIEH IKDYTTAMKK 
PWEFEHRGKL TEEQKATHPV QKRDYPYTEF NESFWVNDLE CPYTEVKRFD WYRGFHVGGK
SLMWGRQSYR FSDLNFEDNL KDGHGVDWPI RYKDIAPWYD YVEKFAGISG QAEGWPALPD
GQFLPPMEMN CVEKEVKKRI EAKWNKSRIM TIGRTANLTV PHGGRGSCQY RNLCSRGCPF
GAYFSTQSST LPAAVATGNL TLRPFSLVNT IIYDKETQKA KGVVVIDSET KETTEYFAKI
VFVNGSTLAS TFILLNSVSD EHPNGLGNGS GQLGHNLMDH HFRCGANGRI EGFDDKYTYG
KRANGIYIPR YRNMGNDKRD YVRGFGYQGG GSRGNWHADV AEMAFGGEFK DQMTVPGSWN
MGLGGFGESL PYFENRVYID KSKKDKWGQP VLAIDCEFKD NERKMRVDMM NDAAEMLEAS
GAKDVKTFDN GSAPGMAIHE MGTARMGHDP KTSILNKWNQ MHEVKNVFVT DGSCMASAAC
QNPSLTYMAL TARACDFAVS ELKKGNL