Gene Phep_4116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4116 
Symbol 
ID8255250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4972339 
End bp4974129 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content43% 
IMG OID644937780 
Producthypothetical protein 
Protein accessionYP_003094369 
Protein GI255533997 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.552536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTGA CCAGCAAGCT TTATCAATTA ATTCTTTTTA GTGCATTAAC CGGAATTTCC 
GGCGCCGGAT TTTGTCAGCA AACGCATTGG GACAAAGGAC AAATATTTAC AGAGGCAAAC
GATCCAGTAA AAACGGACAA CAAAAACTGG TTAGAGGTAA AACCGGGTTT ACATAGTTCT
TTTGTATCTA TTGATAAACG TTATGCTAAA TCTGAAGTGC CTAATATTAA CATAGAGCGT
TCAGTTCTTT TAACAGGTTG GAAAGGAGAG CGTTTGTCAG CCCAGGTGCT GCTCTGGACT
ACAGATTCCA TTCCCGGAGT GAAAGTAACC TTATCTGATT TTATTTCTGA AAGTGGGAGT
AAGCTCAAAT CAATCGGTTA TGCCCGGTTC GAGCGATATG TGCTGACAGA CGAATATGGA
TCCGGATGGG CTTGCGGAAA ACGGAACGCA GCAGATTTTT CCAGTTCATT ATCTGCAGAT
ATGCTGGATG ACCTTTCGTC CTTCAATCTG GAAAAGAAAA AAGTAAGACC CGTGTGGATC
ACCCTAGAAA TCCCCAGAAA GGCAGAGCAA GGCTCCTATT CGGCGAAGGT ACAAATAACT
ACCAAACAGG GAAAGCAGCA GGAACTGAAC TTATCCCTGG ATGTTATCAA TCAGCTGTTA
CCGCAACCAT CTTCGTGGAG CTTTCACCTC GACCAATGGC AACACCCCTC AGCAGTTGCC
CGTGTAAATA AATTACCTGT ATGGAGCGAG GCGCATTTCG AAGCCATGAG ACCACAAATG
CAGCTCCTGG CAAACGCCGG TCAGAAAGTA ATTACGGCTA CATTGAACAA AGACCCCTGG
AATATTCAAA CTTACGATCC TTATGAAGAT ATGATCATCT GGACAAAGGG AAAGGACGGA
AGCTGGTCGT ACGACTACCG GATTTTTGAT AAGTGGGTAT CTTTTATGAT GGGCCTTGGA
GTAAAGAAGA TGATCAACTG TTATTCTATT GTTCCCTGGA ATAATGAAAT CCATTATAAA
GATGCCATCA CAAACAAGTT TGTAAACATA GTGGCCAAGC CTGGTACTAC GGCGTTTACC
GAAATGTGGG AACCGTTCCT GAAAGATTTT GCAAAACATC TGCAGCAAAA AGGCTGGCTT
GAAATTACAA ACATTGCCCT GGATGAAAGA AATAAGGATG AGATGGGAAT GGCTTTTGCA
CTGATAGAAA AGGTAGCCCC AAAACTTGGC GTTGCCTATG CTGATAATCA AAAGACCTAT
AAGCGTTATA CCAACAGTGA TGATGTGAGT ACCGCGGTTC AACATCCTAT AGATGACAAA
GATATTGCAG AGCGTAGAAG CAAGGGATTG AACACTACCT TTTATATCTA TTGTGGAAAT
AGTTTTCCCA ATCAATTTAC TTTTTCTGAG CCCGCTGAGT CTGCTTATTT AGGCTGGTAT
ACCCTGGCTA CAGGTTATAA TGGCGTGCTG CGCTGGGCTT ATAATTCCTG GGTGGAAAAT
CCTTTGGTAG ACTCCCGATT CAGAACATGG CCTGCAGGAG ACACCTATAT TACTTATCCG
CAAGCCAGAA GTTCTATCAG ATACGAGCGT ATGCTGGAAG GTATCCAGGA CTATGAGAAA
GTGCTTGTGG TAAAGAAAAT GCTGGAACAT AAGAATGACC TGGCTACTTT AGCGAAATTG
AATGATGCCA TTGCGAAATT GAAAAGCCAT TCCCGATATG AAGGTTGGAA TAGTGATTTA
AATGCAGCAA AGCAATTGCT CAACAACATT TCAGTATCCT TATCGAAATA G
 
Protein sequence
MKVTSKLYQL ILFSALTGIS GAGFCQQTHW DKGQIFTEAN DPVKTDNKNW LEVKPGLHSS 
FVSIDKRYAK SEVPNINIER SVLLTGWKGE RLSAQVLLWT TDSIPGVKVT LSDFISESGS
KLKSIGYARF ERYVLTDEYG SGWACGKRNA ADFSSSLSAD MLDDLSSFNL EKKKVRPVWI
TLEIPRKAEQ GSYSAKVQIT TKQGKQQELN LSLDVINQLL PQPSSWSFHL DQWQHPSAVA
RVNKLPVWSE AHFEAMRPQM QLLANAGQKV ITATLNKDPW NIQTYDPYED MIIWTKGKDG
SWSYDYRIFD KWVSFMMGLG VKKMINCYSI VPWNNEIHYK DAITNKFVNI VAKPGTTAFT
EMWEPFLKDF AKHLQQKGWL EITNIALDER NKDEMGMAFA LIEKVAPKLG VAYADNQKTY
KRYTNSDDVS TAVQHPIDDK DIAERRSKGL NTTFYIYCGN SFPNQFTFSE PAESAYLGWY
TLATGYNGVL RWAYNSWVEN PLVDSRFRTW PAGDTYITYP QARSSIRYER MLEGIQDYEK
VLVVKKMLEH KNDLATLAKL NDAIAKLKSH SRYEGWNSDL NAAKQLLNNI SVSLSK