Gene Phep_4268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4268 
Symbol 
ID8255404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5146056 
End bp5147444 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content48% 
IMG OID644937934 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003094521 
Protein GI255534149 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCACT CCGCTTTTGA AGCCCAGCAA AAACATAAAT ATACACTTAG AAACAGCAAT 
GCTGCACAAA GAATTGACAA ATTAAAAACC TTAAAGGCCT GTATAGAAAG CTATGAGGAA
AAAATTTATG CTGCCCTGCA AAGTGATCTG CGAAAAAGCC GGTTTGAAAG TGCGCTTACA
GAACTGATCT TTATTTACAG TGAGATCGGC TTTGCCATCC ATAACCTGAA CAGCTGGATG
AAACCGAAAA GGGCTGGCAA AACCATCAGT AATCTTTTTG CGAAAAACAG GATCTGTTAC
GAGCCAAAGG GCTGCTGCCT GATCATTGCG CCCTGGAACT ATCCTTTTCA GCTCCTCATG
AGCCCCCTCA TCTCGGCCAT AGCTGCGGGC AATTGCGCGA TACTGAAGCC ATCTGAACTG
AGCCCGGCTA CCAGTTCGGT GATTGCAGCG CTGATCAGGG ATTGTTTTGA TGAACGTGAG
GTCTGTTGTT TTGAAGGGGA CGCCGGCATT TCCACTGCGT TGCTTGAGCT GCCTTTCGAC
CATATCTTTT TTACCGGCAG TACCGCAATT GGCAAACTGG TTATGCAGGC CGCCGCCAAA
AACCTCAGCT CGGTTACCCT GGAGCTTGGC GGCAAATCGC CTGTTATTAT AGAGGAAACG
GCCAACCTGA AAAAAGCAGC CGAAAAAATT GCCTGGGGCA AACTGATCAA TGCGGGCCAG
ACCTGTATTG CCCCCGATTA TGTGCTGATC CCCCGCGATC TGCAACAGTC TTTTATCGAA
TATTATAAAG AGGCCGTTAA CCGCTTGTTT TTTAAGAACG GAAAGCTCAA TACCGAAGTT
TATGCAAAGC TGATCAGTAA AAAACACTTT GAAAGGCTGT CAGACTTAAT AACAGATGCC
CTGGATAAAG GGGCCATCAC GGTGCTGGGT GGCGAAAAAG ATGAATCCAG CCAGACCATA
TCCCCAACTG TACTGGCCCG GATCCCCGTA GGAACGACCA TCATGAAGGA AGAGATCTTT
GGGCCTGTCC TGCCCCTTAT CGCTTACCAG ACCCTCAGCG AAGCGGTTGC TTATGTAAAC
CATAAAAGCA AGCCGCTGGC TTTATATGTT TTTAGTGCCA ACCGCAAAAA CATTCAATAC
ATCCTCAAAA ACACCTCTTC CGGAGGGGCC TGTATCAACG ATGTCCTCAT CCACATTTCC
AATCCGAAAC TCCCTTTTGG TGGGGTAAAC GGGAGCGGAA CGGGCAGCTG CCACGGCTTT
TTTGGCTTCA AGGCCTTTTC TCATGAAAGA GCAGTAGTCT ACCAGTCGCC CATCAATACC
ACAGCGCTCA TTTACCCGCC TTATGAAAAC AAGTCCCGGC TGCTGAAATG GTTAAAAAAA
CTGCTGTAA
 
Protein sequence
MMHSAFEAQQ KHKYTLRNSN AAQRIDKLKT LKACIESYEE KIYAALQSDL RKSRFESALT 
ELIFIYSEIG FAIHNLNSWM KPKRAGKTIS NLFAKNRICY EPKGCCLIIA PWNYPFQLLM
SPLISAIAAG NCAILKPSEL SPATSSVIAA LIRDCFDERE VCCFEGDAGI STALLELPFD
HIFFTGSTAI GKLVMQAAAK NLSSVTLELG GKSPVIIEET ANLKKAAEKI AWGKLINAGQ
TCIAPDYVLI PRDLQQSFIE YYKEAVNRLF FKNGKLNTEV YAKLISKKHF ERLSDLITDA
LDKGAITVLG GEKDESSQTI SPTVLARIPV GTTIMKEEIF GPVLPLIAYQ TLSEAVAYVN
HKSKPLALYV FSANRKNIQY ILKNTSSGGA CINDVLIHIS NPKLPFGGVN GSGTGSCHGF
FGFKAFSHER AVVYQSPINT TALIYPPYEN KSRLLKWLKK LL