Gene Phep_2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2203 
Symbol 
ID8253309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2534436 
End bp2535794 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content47% 
IMG OID644935852 
Productoxidoreductase domain protein 
Protein accessionYP_003092469 
Protein GI255532097 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.13021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGATT CAAGAAGAAA ATTTATCAAA CAATCTGCCA TAGCCGCGGC AGGAACTTAT 
TTGGGAACAA TGGGTTTGAG CGCGAAGAGT TATGGAAATA TTATTGGGGC CAACGACCGG
GTAAGGGTTG GTGTGGTCGG TTTTTCTGAC CGCTTTAAGA GTTCCCTCCT TCCCTCTTTT
TTAAACCACA ACAAAGAACT GAATTTTGAC ATTGTAGCGG TTTCTGACCT TTGGAATTAC
CGCAGGGGTT TGGGTGTAGA GCATTTGAAA TCGAAATTTG GCCATGACAT TACGGCCTGC
CGCAACAATG ATGAACTGTA TGGTTTAAAG GATATTGATG CGGTGATTGT GAGTACTGCA
GATTTTCAGC ATGCTACCCA CTGTGCCGAA GGCGTAAACA ACAAATGTGA TGTGTATTGC
GAAAAGCCTT TTGCGGAGAC GATGGAAGAT GCACGTATGG CATTGAAGGC CGTTAAAGCT
TCTAAACAGA TTGTCCAGAT TGGTTCTCAG CGGAGGAGCG GCAACAATTA CAAGGCTGCC
GAGCGCTTTA TTAAGGATGG CAAGTTTGGC GACATTACCA TGGTGGAGCT GAGCTGGAAT
GTGAACCAGC CGGGACGCTG GCGCAGACCA GAGCTTGTGG CCATGCTGAA ACAGGAGGAT
ACCGACTGGA AGCGCTTTTT GATTAACCGC CCTTTTGAAG AATGGGATCC GCGTAAGTAT
CTGGAGTATC GTCTTTTCTG GCCGTATTCT TCGGGTATGC CCGGACAGTG GATGTCGCAC
CAGATTGATA CTGTGCATTG GTTTACCGAC CTGAAGCACC CAAGAAGTGT GGTGGCCAAC
GGGGGTATTT ACCAGTGGAA AGATGGCCGC AGGAACTGGG ACACCACCAC AGCTGTATTT
GATTATGGTA AGCCGAATGA TCCTAACAAT GGTTTCCAGG TGATATTTAC TTCAAGGATG
CACAATGGTG ATGAGAACCC GGCAGAGATC TATTACTCGA ACGGCGGTGA ACTGAACCTG
AACACGAATA TGGTTTCACC TAAAGGTGGT TTAACCGCAA AAGCTGCTGC AGCCATGAAC
ATGAAGCCAA ACCTGTTGCC TGAGTTGAAG CTGAGTGACA TGACGGAGAA AGTTGCTGCA
TCGGCCGATA CCGGTGGCGA TAAGCTGACC TCTGCACATA TGCGCAACTG GATGGAATGT
GTGAGGAGCA GAAAGCAGAC CAATGCGCCT GTTGAGGCTG GATATTATCA TTCTATTGCG
AACATTATGA CGAATGCTGC AGTGAGGACG GGTAAGAAAG CAGTGTTTGA TGAGAAAACG
CAGGAAGTAA TGGTAGATGG GAAGGTGTTT AAGTACTAA
 
Protein sequence
MLDSRRKFIK QSAIAAAGTY LGTMGLSAKS YGNIIGANDR VRVGVVGFSD RFKSSLLPSF 
LNHNKELNFD IVAVSDLWNY RRGLGVEHLK SKFGHDITAC RNNDELYGLK DIDAVIVSTA
DFQHATHCAE GVNNKCDVYC EKPFAETMED ARMALKAVKA SKQIVQIGSQ RRSGNNYKAA
ERFIKDGKFG DITMVELSWN VNQPGRWRRP ELVAMLKQED TDWKRFLINR PFEEWDPRKY
LEYRLFWPYS SGMPGQWMSH QIDTVHWFTD LKHPRSVVAN GGIYQWKDGR RNWDTTTAVF
DYGKPNDPNN GFQVIFTSRM HNGDENPAEI YYSNGGELNL NTNMVSPKGG LTAKAAAAMN
MKPNLLPELK LSDMTEKVAA SADTGGDKLT SAHMRNWMEC VRSRKQTNAP VEAGYYHSIA
NIMTNAAVRT GKKAVFDEKT QEVMVDGKVF KY