Gene Phep_3357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3357 
Symbol 
ID8254476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3979766 
End bp3981037 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content38% 
IMG OID644937009 
Producthypothetical protein 
Protein accessionYP_003093613 
Protein GI255533241 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0185934 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA TATTATTGGC GTTTTTATGC CCTGCATTGT TATTGAGTAT GAATTGCCGT 
TCTGTTGGTA AACAAAACCA ATCTAAAACC CGTAAATACA TGAGCTATAA AGGTTTGGTG
ATGGCAGGTT ATCAGGGTTG GTTTAATGCC GATGGTGACG GAGCAGACAG GGGTTGGAAC
CATTATAAAA ACAGGGACAA CAGGTTTGAA CCGGGTAACT GTAAAATTGA TATGTGGCCA
GATGTTACAG ACTACACTGC AAAGTATAAA ACATCTTTTA CATATGCCAA CGGGGGAGCT
GCGTATGTTT TTAGTTCATA TGATGAGAGT ACTGTGGACC TGCATTTTAG ATGGATGAGG
GACTATGGTA TAGATGGGGT TTTCATGCAG CGCTTTGTGA CGACGTTAAA AGATGAAAAA
GGAAACAAAC ATTATCAGAA AGTATTTCAA TCTGCAGTAA ATGCAGCTAA AAAATACGAC
AGGGCTTTAG CTGTAATGTA TGATTTATCC GGGATGAATG CATCAGATTA TACAAAGGTG
ATTGCCGATT GGAAAAGTTT AGTGGATACG TATAAATTAA ACAATAAAGA TTTAAATGAC
AATTATTTAT TTCACAATAA TAAGCCTTTA GTGGCCATAT GGGGTGTTGG CTTTAATGAT
GGAAGAAAAT ATGGACTGTC TGAAATAGAT AAGTTAATTA CATTTTTTAA AAGCGACCCG
GTATATGGAA GTTGTTCCTT ACTTCTTGGT GTACCTACCT GGTGGAGAGA ATTAAAGTTT
GATACGCAAA GTGATCCCCA ACTCCACCAG ACGATAAAGA GAGCAGATAT TGTGCATCCC
TGGTTTGTAG GTAGGTATAA TGAAGAGACC TATCCGCAAT TTCAGGAACG TATTAAAACA
GATATGGCCT GGTGTAAGCA AAATAAACTG GACTATGTAC CTGTAGTTTA TCCAGGTTTC
AGCTGGAAAA ATATGCGCCC CAATGATCCC TTTGACGCCA TTCCAAGGAA TAAAGGGAGT
TTTTTTTGGA AGCAATTATC GGGTGCTTTA GAAATTGGAT GTGAGATGAT TTATGTGGCC
ATGTTTGATG AAATAGATGA GGCAACTGCT ATTTTTAAAG TTGGGCACGA TACCCCTGTT
GGAGCCAGTA AATTTGTCCC TTACGAAAAA GAGATACCGT CGGATCATTA TTTATGGTTA
ACGGGGCAGG CTGCCGGTAT GTTGAAAAAG GAAATTCCCT TTCAAAAACC TATGCCATAT
AGAACCTATT GA
 
Protein sequence
MKKILLAFLC PALLLSMNCR SVGKQNQSKT RKYMSYKGLV MAGYQGWFNA DGDGADRGWN 
HYKNRDNRFE PGNCKIDMWP DVTDYTAKYK TSFTYANGGA AYVFSSYDES TVDLHFRWMR
DYGIDGVFMQ RFVTTLKDEK GNKHYQKVFQ SAVNAAKKYD RALAVMYDLS GMNASDYTKV
IADWKSLVDT YKLNNKDLND NYLFHNNKPL VAIWGVGFND GRKYGLSEID KLITFFKSDP
VYGSCSLLLG VPTWWRELKF DTQSDPQLHQ TIKRADIVHP WFVGRYNEET YPQFQERIKT
DMAWCKQNKL DYVPVVYPGF SWKNMRPNDP FDAIPRNKGS FFWKQLSGAL EIGCEMIYVA
MFDEIDEATA IFKVGHDTPV GASKFVPYEK EIPSDHYLWL TGQAAGMLKK EIPFQKPMPY
RTY