Gene Phep_4075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4075 
Symbol 
ID8255209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4917335 
End bp4918372 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content44% 
IMG OID644937739 
ProductNADH dehydrogenase (quinone) 
Protein accessionYP_003094328 
Protein GI255533956 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.329571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATAG CTTTTGTAAT CGAAAAATTT GTACTTGTAG CTATCATTTT TGGTATCAGT 
TTGCTGATTG CCATGTATTC TACGTATGCA GAAAGAAAAG TGGCAGCCTT TTTACAGGAC
AGACTTGGAC CAGACAGAGC CGGTCCTGCA GGAATGTTCC AGCCTTTGGC CGATGGTTTA
AAGATGTTTA TGAAGGAAGA AATCATTCCT TCAAATGCGA GCAAATGGTT GTTCATGGTT
GGGCCTGGCC TGGCGATGCT TACTGCTTGC ATTGGTACTG CCGTGATCCC ATGGGGAAGT
CCGGTTACCA TTGACGACAG GGTGGTCCCT TTACAGGTAA CCGATATCAA TGTGGGCCTG
CTGTATATCT TTGGTGTAGT TTCACTGGGG GTATATGGGG TTATGATTGG TGGCTGGGCT
TCAAACAACA AATATTCTTT GCTGAGTGCC ATCAGGGCCG CTTCGCAGAA CATCAGTTAT
GAAATTGCCA TGGGCTTGTC TATCATAGCC CTGTTATTGG TAACCAATAC GCTGAGCTTA
AAAGAAATTG TGGAGCAGCA GCATGGCTGG CACTGGAATG TACTGTATCA GCCACTGGGC
TTTATCCTGT TTATGGTGTG TTCATTTGCT GAGACCAACA GGGCACCTTT CGATTTGCCT
GAATGTGAAA CGGAACTGAT CGGGGGCTAC CATACTGAAT ATTCTTCCAT GAAACTGGGT
TTCTATCTGT TTGCAGAGTA CATCAATATG TTTGTTTCGG CAGCAGTAAT GGCCACCTTA
TATTTTGGTG GATATAATTA TCCCGGAATG GATTGGATGG CCACATTATT GGGGCCAACC
TGGGCGCCAC TTTTTGGTAC CTTGGTGTTC TTCGTTAAAA TATTTGTATT TATATTTTTC
TTCATGTGGG TACGCTGGAC CATTCCGCGT TTCCGCTATG ATCAACTGAT GCATTTAGGC
TGGAAAGGAC TGATCCCTCT GGCGATAGCG AACATCGTGA TCACAGGTAT TGTGATCGCA
ATAATTGAAA AGTTTTAA
 
Protein sequence
MDIAFVIEKF VLVAIIFGIS LLIAMYSTYA ERKVAAFLQD RLGPDRAGPA GMFQPLADGL 
KMFMKEEIIP SNASKWLFMV GPGLAMLTAC IGTAVIPWGS PVTIDDRVVP LQVTDINVGL
LYIFGVVSLG VYGVMIGGWA SNNKYSLLSA IRAASQNISY EIAMGLSIIA LLLVTNTLSL
KEIVEQQHGW HWNVLYQPLG FILFMVCSFA ETNRAPFDLP ECETELIGGY HTEYSSMKLG
FYLFAEYINM FVSAAVMATL YFGGYNYPGM DWMATLLGPT WAPLFGTLVF FVKIFVFIFF
FMWVRWTIPR FRYDQLMHLG WKGLIPLAIA NIVITGIVIA IIEKF