Gene Phep_2271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2271 
Symbol 
ID8253377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2638045 
End bp2639670 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content40% 
IMG OID644935920 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003092537 
Protein GI255532165 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0315969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCAA AAAATCAGCT CGATTCTATT TTCGTAGCAG AAAATCAGAT CCCTGAAGCG 
TTTAAATTAT CCGAAGAAAT ACACCAGCGT GAATACCTGA CTAATGGTGA AATGCGCGCC
TGGAATGGTG AGGTACATGA AGTTCTATCT CCTGTTTGTA TTAGAACAGA AAAGGGACTG
GAAAGAAAGT TAATAGGAAC ATATCCTTTA TGTAGTGAAA AAGAGGCTGA TGAAGCACTT
CAGGCTGCAG TAGCCGCATA CAACAACGGA AGAGGAGAGT GGCCAACTAT GAGTGTAGCA
GACAGGATCC ATTGCGTAGA ACAGTTTACG CATAAAATCA TTGAAAAGAA GGCCATCGTT
GTCAAATTGT TAATGTGGGA AATAGGTAAA TCATATGCCG ATTCTGTCAA AGAATTTGAC
CGTACAGTGG AGTATATTTA TGCTACAATT GATGCGCTGA AAGACTTAGA CAGACAATCT
TCTAAATTCA GCATAGAGCA GGGAATTGTG GCACAGATCA GACGTTCACC CTTAGGTGTG
GTATTGTGCA TGGGGCCATT TAATTATCCC TTAAATGAAA CTTTTACCAC ACTGATCCCT
GCGTTGATCA TGGGAAATAC CTTGCTGTTT AAACCACCTA AGCATGGTAC GCTGTTACAT
TACCCACTGT TGGAAGCATT TAGGGATTGT TTTCCAAAAG GAGTAGTAAA CACCATATAC
GGTCGGGGAA ATAAAATTAT TCCTGATCTC ATGAAATCTG GTCAGATTAA TGTTTTAACA
TTAATTGGAT CAAGCAAGGT AGCCAATGAA CTTAAAAAGT TACACCCTAA AGTTAACAGG
TTAAGGGCAA TCCTTGGTTT AGATGCCAAA AATGCCGCAA TAATTACCGC AAAAGCTGAT
ATTAACCTGG CAGTACAGGA AACTGTATTG GGGTCATTGT CTTTTAACGG ACAAAGGTGT
ACTGCCCTTA AGATCGTTTT TATACACCGA AGCCTGGCCG ACGTATTTCT GAAAGAGCTT
TCCGCTGCAG TCGCTGAACT TAAATTTGGT ATGCCATGGG AAACAGGTGT TTCTTTAACT
CCTTTACCGG AGCCACAAAA ACCGGCTTAC CTTAAAGACT GTATAGCTGA TGCAGTAGCT
AAGGGTGCTA AGATTGTGAA CGACAATGGG GGAGATAGCT GTGAATCATT TGTATATCCT
GCAATTGTTT ATCCTGTAAA TAAGCACATG AAGTTGTATA CAGAGGAGCA ATTTGGACCT
GTAATACCAG TTGTACCATT TGATGATCTG GAAGAAACCA TTCAATATCT TATTGATTCT
ACACATGGAC AACAGGTGAG TATTTTTAGC AATGATGATG AAGAAATCGC TGCGCTTATT
GATCCGCTGG TTAATCAGGT AAGCAGGGTT AATATCAATT GCCAATGCCA ACGCGGACCG
GATGTATTTC CGTTTACAGG CAGGAAAGAT AGCGCAGAAG GAACCTTATC TGTAATTGAT
GCCTTAAGGT CGTTTTCTAT CCGCTCTTTA GTGGCTACTA AATTAAATGA GAGTAACAAA
CACCTGATCA ATGAAATTGT AGACAGCAAT AGTTCCAATT TCCTGAGTAC AAAATATTTG
TTTTAA
 
Protein sequence
MFSKNQLDSI FVAENQIPEA FKLSEEIHQR EYLTNGEMRA WNGEVHEVLS PVCIRTEKGL 
ERKLIGTYPL CSEKEADEAL QAAVAAYNNG RGEWPTMSVA DRIHCVEQFT HKIIEKKAIV
VKLLMWEIGK SYADSVKEFD RTVEYIYATI DALKDLDRQS SKFSIEQGIV AQIRRSPLGV
VLCMGPFNYP LNETFTTLIP ALIMGNTLLF KPPKHGTLLH YPLLEAFRDC FPKGVVNTIY
GRGNKIIPDL MKSGQINVLT LIGSSKVANE LKKLHPKVNR LRAILGLDAK NAAIITAKAD
INLAVQETVL GSLSFNGQRC TALKIVFIHR SLADVFLKEL SAAVAELKFG MPWETGVSLT
PLPEPQKPAY LKDCIADAVA KGAKIVNDNG GDSCESFVYP AIVYPVNKHM KLYTEEQFGP
VIPVVPFDDL EETIQYLIDS THGQQVSIFS NDDEEIAALI DPLVNQVSRV NINCQCQRGP
DVFPFTGRKD SAEGTLSVID ALRSFSIRSL VATKLNESNK HLINEIVDSN SSNFLSTKYL
F