Gene Phep_2555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2555 
Symbol 
ID8253662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2973438 
End bp2976602 
Gene Length3165 bp 
Protein Length1054 aa 
Translation table11 
GC content43% 
IMG OID644936205 
Producthypothetical protein 
Protein accessionYP_003092821 
Protein GI255532449 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.291539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTA AAACTTTAAC AGCAATTGTT TTCATTAGCG CCCTGGTCCT TATACTTGGT 
TTTCAACTGG GGGGCAGGAA ATTAAAGGCT GATGGTTTTG AGGTATTGAA AAAAGCCTTC
AAAAATCCTG GTAATGAATA TGGCACCACC CCATTCTGGG TTTGGAACAC TAAGGTAAGC
GGGGAAATGA TAGATTCTAT GATGCTGGAC TATAAGAAAA ATGATTTCGG AGGAGTAATC
GTCCACGCCC GGCCCGGACT GATCACTGAA TATTTATCTG ATGAATGGTT CAGTTTATTT
GAATATGCCC AGCAAAAGGG AAAACAATTG GGTCTGAAGG TCTGGATTTA TGATGAAAAT
TCTTATCCTA CCGGGTTTGG GGGTGGTTTG GTGCCCGACC AGATGCCGGA ATCGTACAAT
CAGGGCCAAA TGCTATATAT GAGTGAAGTA ACAGCTATTC CGGGTAATCT GGAAGAGGTT
TTAATAGTCC TTAGAGAAGA CAAGGGACAG TTTACGGACA TAAGTTTAAG CCTGGCCGCA
GAGAAGGGTA AAGCAGGGAA ATACCGGATT TTCAAAAAAG TGAATTATCA AAAGTCTAAC
CGTGGAACGG TGGCAGGGCC TATTGGCTCT TCTTACGTAG ACCTGATGGC CAAAGGCGTT
ACACAAAAGT TTATGGACAT TACCTTTAAA GGATATGAAA AAGTTGCAGG CCATGAATTT
GGCAAAACTG TTCCGGGTAT CTTTTCTGAT GAACCAACCA TTATCAATGA GGGTAAGGAT
TGTGTACGCT GGACACCCGA CCTGTTTGCG CATTTCAAGG AAAAATGGGG TTATGACCTG
CGTCTTCAGC TGCCATCGCT ATTTGAGGAA ACCGGTAACT GGAAAAAAGT AAGACACAAC
TATTACCAAA CGCTGTTGCA ACTGTTTATC GACAGATGGT CTAAACCTAT GTTTAATTAT
ACCGAGCAGC ACAACCTGAT CTGGACGGGG CATTACTGGG AACACGGCTG GCCAAGCCCT
TACCATGGGC CCGACAATAT GACCATGTAT GCCTGGCACC AGATGCCTGG AATTGACATG
CTGTTTAACC AGTACAATGA AGATAAACCC GTACAGTTTG GAAACATCAG GGCGGTTAAA
GAACTGGGTA GTGTAGCCAA TCAACTTGGA AAAAAAAGAA CGCTTTCCGA AACCTATGGT
GGTGGCGGCT GGGAGCTGAC TTTTAAGGAC ATGAAGCGGT TGGGCGACTG GGAGTTTGTG
CTTGGCGTGA ATTTTATGAA CCAGCACCTG TCGTTCATGA GTATTACCGG AGCAAGAAAA
TACGATTATC CGCCGAGTTT TTCTTATCAT GAGCCCTGGT GGCCTTTCTA TAAAACGCTG
AACCAATATT TTTCCCGGTT GTCGCTCTGC CTGTCAACGG GCCAGCAAAA AAATGATGTA
CTGGTTTTGG AACCCACTAC TTCTGCATGG ATGTATTATT TCAGAGGTAA AGAGAACAAA
CGGTTTTTTG AAATCGGGAA AAATTTTAAT GATTTTGTAA CTACCCTGCA AAGGGCGCAT
GTTGAATATG ACCTGGGTTG CGAGAACATT ATCAAAGATC AGGGAAAGAT TGAAAGCGGA
AAATTCGTTA TCGGCCAGCG TGCTTACAGT ACCGTTGTCA TTCCACCTGG TATGGACAAT
ATTGACCGCC CTACATTTAA TTTATTGAAA GAATATGCCC GTCAGGGCGG CAAAGTGATC
TTATTTGAAA AGCTTAGCTG TTTAGATGGG GATGCCAATG ACGCACTGGA TTTTTTTACA
GCTGCTAAAA GCAATGTTTT GCTGGCTGTT GGTGCCGCTG AACAACAGAT CATCCAAAAT
CATTTGCTGC CTGCAGACAT CCGGATATCG GCTGTAGGCA ATAATAGGAT AGGTGGTAAT
TTGTATTACC AACGCAGACA ATTGGCTGAC GGGCAATTGA TTTTCCTTTC AAATGCGAGT
ATGGAGGCCG CTTCAAAAGG TATGGTTACG GTTAATGGTA AAGACGCATT GATGATGGAC
CTGTTCAGTG GGGGGATTTA TGATTATCCG GAAAAAGAAA GCTCCGGTAA GCTTAGTTTT
AAATTTGATA TTCCAGCAGC GGGAAGCATG ATGTTTTTTG TGGCCGATAA GAAACAAACG
GGTTTCAAAG CACCTGCTTT AAATCGGGCT GAAACACTTG TAAAAACAAG CCAGTCAAAA
GTACTTCGCC CCGCAGAAAA TACGCTAATG ATTGACTTTT GTGACCTGGA ACTCAAAAAA
CCCGATACCG CTTATCGGGA CTTGCATGTT GGCATCGCTT CTAATACAGC TTTTAAACAT
TTTGGATTTA AAGATGGTGT AGGCAATCCA TGGAATAACC GTACCCAGTT TAAAGATCGG
ATTGTTGCCC GCGACACTTT TTCGGTAGGT ACCGGATATA TGGCGACCTA CCGTTTTGAA
ATAGCCCAAA ATGTTAACCT GAAAAATTTT CGTGCAGTAG TAGAACAACC AGGCTTGTGG
AATAGTGTGT CTGTAAACGG AACAAAGGTA AAGGCTTTGA AAGGTAAATG GTGGCTTGAC
CGTTCTTTTG GGGTTTTTGA AATTGGCAAT TACCTTAAGC CGGGTAAAAA CAGTATTTCA
CTGGCTGTTT CGCCAATGCG TGTTTATGCA GAAATAGAAC CTGTATATAT ATTGGGCGAC
TTTAACCTTG TTCCGGCAGT AAAGGGCTGG GAAATAACTG CAGCTGCACC GCTTACATTT
GGACCCTGGA ATAAACAGGG GCTGCCGTTA TATGGACAGG GTATCAGCTA TGTAAAGGAG
TTTCAGGTTC AGACCATGGG CAAGGAATAT GCCGTTAAAC TGAAAAAGTG GAAAGGCACA
GTGGCTGCAG TAAAAGTTAA CCATGTGCTT GCAGGAATTA TCAGTTCGGA ACCTGATGAA
CTTAATATTA CCCCTTATCT TAAAAAAGGC CTTAACCATG TTGAAATTGA GGTGATTGGA
AGTCTTAAGA ATTTACTTGG CCCACATCAC AATAAACCCT TACCTGGTTT GGTAGATCCC
GGCAAATGGT ATAATGTTAA AACATACCCT TCAGGAAATG ATTATCAGAC TTATGGTTAT
GGTTTGGAAG AAGATTTTGA TATTCATGAG ATCATCCCCC GATAA
 
Protein sequence
MKFKTLTAIV FISALVLILG FQLGGRKLKA DGFEVLKKAF KNPGNEYGTT PFWVWNTKVS 
GEMIDSMMLD YKKNDFGGVI VHARPGLITE YLSDEWFSLF EYAQQKGKQL GLKVWIYDEN
SYPTGFGGGL VPDQMPESYN QGQMLYMSEV TAIPGNLEEV LIVLREDKGQ FTDISLSLAA
EKGKAGKYRI FKKVNYQKSN RGTVAGPIGS SYVDLMAKGV TQKFMDITFK GYEKVAGHEF
GKTVPGIFSD EPTIINEGKD CVRWTPDLFA HFKEKWGYDL RLQLPSLFEE TGNWKKVRHN
YYQTLLQLFI DRWSKPMFNY TEQHNLIWTG HYWEHGWPSP YHGPDNMTMY AWHQMPGIDM
LFNQYNEDKP VQFGNIRAVK ELGSVANQLG KKRTLSETYG GGGWELTFKD MKRLGDWEFV
LGVNFMNQHL SFMSITGARK YDYPPSFSYH EPWWPFYKTL NQYFSRLSLC LSTGQQKNDV
LVLEPTTSAW MYYFRGKENK RFFEIGKNFN DFVTTLQRAH VEYDLGCENI IKDQGKIESG
KFVIGQRAYS TVVIPPGMDN IDRPTFNLLK EYARQGGKVI LFEKLSCLDG DANDALDFFT
AAKSNVLLAV GAAEQQIIQN HLLPADIRIS AVGNNRIGGN LYYQRRQLAD GQLIFLSNAS
MEAASKGMVT VNGKDALMMD LFSGGIYDYP EKESSGKLSF KFDIPAAGSM MFFVADKKQT
GFKAPALNRA ETLVKTSQSK VLRPAENTLM IDFCDLELKK PDTAYRDLHV GIASNTAFKH
FGFKDGVGNP WNNRTQFKDR IVARDTFSVG TGYMATYRFE IAQNVNLKNF RAVVEQPGLW
NSVSVNGTKV KALKGKWWLD RSFGVFEIGN YLKPGKNSIS LAVSPMRVYA EIEPVYILGD
FNLVPAVKGW EITAAAPLTF GPWNKQGLPL YGQGISYVKE FQVQTMGKEY AVKLKKWKGT
VAAVKVNHVL AGIISSEPDE LNITPYLKKG LNHVEIEVIG SLKNLLGPHH NKPLPGLVDP
GKWYNVKTYP SGNDYQTYGY GLEEDFDIHE IIPR