Gene Phep_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2100 
Symbol 
ID8253205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2421367 
End bp2422740 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content48% 
IMG OID644935749 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003092367 
Protein GI255531995 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.773732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.123203 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATAA AATCTATTGA CCCGACAAAT GGCAAGGTAA TTAAATCTTA TCCTGAAACC 
ACCAAAGCAC AAGTTGTTAA AAAGATTGAA CAGGGACATA AAGCCTGGAC AGAATGGAGA
AAAAGCAGTA TTAAAGAAAG GGCTGCCCTG CTAAGGGGTC TCGCTGATCA GCTGCACATA
CAAAGAGCGG AGCTGGCAAG GCTCATGGCT TTGGAAATGG GCAAACCCCT GAACGATGGC
CTGGCCGAGA TAGACAAATG TGGCGCTGTA TGTAAATACT ACGCAGAAAA AGGGGCAGAT
TTTTTGCAGG ACCAGCGGAT TGAGACTGAG GCTTCAAAAA GCTACGTCAG CTTTCAGCCT
CTGGGTGTAG TGCTGGCCGT AATGCCCTGG AATTTTCCTT ACTGGCAGGT ATTCAGGTTT
CTTGCCCCCG CCCTGATGGC GGGCAATTGC GGGGTACTGA AACACGCTTC AAATGTACCG
GGATGCGCCC TATCTATAGA AAAGCTGGTA AAGGATGCGG GTTACCCTGC CCATGTATTT
CAAACCCTGA TGATTGGCAG TAACGTGGTA AACGAAGTGA TTGCCCATCC GCTCATAAAG
GCGGTAACCC TTACCGGAAG CACGCAGGCA GGAATGAAAG TTGCCGCACA GGCAGGCATG
CTGCTGAAAA AAACAGTACT TGAACTGGGG GGCAGTGATC CTTATTTGGT ACTGGAAGAT
GCCGACCTGG AATTTGCAGC CGAAACCTGC GTAAACAGCA GGCTGATCAA CAACGGGCAG
AGCTGTATTG CAGCAAAAAG ATTTATTGTT GTAAAAAAGA TAGAAAAGGA ATTTACCAGG
CTTTTTGTGC AAAAAATGAA ACAGAAAAAA CTGGGTAACC CTTTGGAAGC GGATATCAAC
CTGGGTCCTA TGGCCCGTGC AGATTTACGT GACGAGCTGC ACCAGCAGGT ACTGAAGAAT
ATAGAAATGG GTGCAAAATG CCTGCTTGGC GGCCGGATTC CTGCGTTTAA AGGCCAGCAT
GCCTACTATG AACCTACTGT ACTTAGCGGA ATAAAAAAAG GGATGCCTGC TTACAGCGAG
GAAATGTTTG GCCCGGTGGC GGCCATACTG ACGGCCAGAG ATGTGGAACA AGCCATTGAG
CTGGCCAACG ATACTTCATT CGGACTTGGA GCTGCCGTAT TTACAGCTAA TGAAAAACTG
GGTGAAGAAA TAGCAAGGAC CCGCCTTCAG GCTGGTTCCT GCTTTGTAAA TTCGCTGGTA
AAATCCGATC CCCGCCTACC CTTTGGGGGC ATTAACCAAA GCGGCTACGG GCGCGAACTG
GGCCTGTTTG GTATTCATGA ATTTGTAAAC ATTAAAACGG TTTATGTGAA ATGA
 
Protein sequence
MSIKSIDPTN GKVIKSYPET TKAQVVKKIE QGHKAWTEWR KSSIKERAAL LRGLADQLHI 
QRAELARLMA LEMGKPLNDG LAEIDKCGAV CKYYAEKGAD FLQDQRIETE ASKSYVSFQP
LGVVLAVMPW NFPYWQVFRF LAPALMAGNC GVLKHASNVP GCALSIEKLV KDAGYPAHVF
QTLMIGSNVV NEVIAHPLIK AVTLTGSTQA GMKVAAQAGM LLKKTVLELG GSDPYLVLED
ADLEFAAETC VNSRLINNGQ SCIAAKRFIV VKKIEKEFTR LFVQKMKQKK LGNPLEADIN
LGPMARADLR DELHQQVLKN IEMGAKCLLG GRIPAFKGQH AYYEPTVLSG IKKGMPAYSE
EMFGPVAAIL TARDVEQAIE LANDTSFGLG AAVFTANEKL GEEIARTRLQ AGSCFVNSLV
KSDPRLPFGG INQSGYGREL GLFGIHEFVN IKTVYVK