Gene Phep_4154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4154 
Symbol 
ID8255289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5026608 
End bp5027843 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content38% 
IMG OID644937819 
Productprotein of unknown function DUF214 
Protein accessionYP_003094407 
Protein GI255534035 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00192452 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAATACAC CTTTCTACAT AGCTCGGCGG TATCTTTTTG CAAAAAAATC TACCAATGCC 
ATCAACATTA TCTCTACCAT TTCGGTAGCA GGGGTTTTTG TAGGTAGTGC AGCGCTGATC
ATTATCCTGT CGGTATTCAA TGGTTTTGAA GAGGTGGTGT TAAAAATGTT TAATACCATT
ACCCCGCAAA TAGTGATCTC TCCTTTAAAA GGGAAAACAT TTAATCCCAA TACGACTTAT
TTTAATGAAT TGCGAAAGGA TAAGGAAATC TATTCTTTTA CTGAAGTGCT TTCGGAAAAT
GCTTTATTAA GGTATAATGA TAAACAGTCT GTTGGGCTGG TAAAAGGGGT AAGTACAGAT
TACCTGAAAA ACAAGAGCCT GGACAGCATT ACCATTGACG GTAATTTTGT GCTCGAAAGC
AAGAGCCTGA ACTACGCGGT AATTGGTTCG GCCATTCAAA ATTTTTTGAT GGTAAATACA
ACAGATCCTT TTTATCCGCT GCAGATTTTT TCACCTAAAA AGAAGGCGAA CCAGGCCAGT
TCAATCAATC CGGCAGATGA TTTTACGATT CTTTCTATTC CGGTATCCGG AGTTTTTGAA
GTGCAGCAGG ATTTCGATAA TATGGCTATA GTCCCATTGA GGTTTGCCAG AAAACTGCTT
GAAGAACCGC TAAACATTTC AGCCATCGAG ATCAATCTGC ATAAAGGTGC GGATGCCGAT
CTGTTCAGGC AAAAGATAGA AGATCAGATC GGGGAAAACT TTGAAATAAA GGATAGGATA
CAACAGAATA AAGTTTTATA TAATATTCTG GGCAGTGAAA AATGGGCCGT ATACATCATT
CTCACCTTTA TATTGATCAT TGCCATATTC AATATCATTG GTTCATTAAC CATGCTGGTA
ATTGATAAGT TGAAAGACAT TGCCATATTG AGCAGTTTGG GGGCTGGTAA AAAGTTAATT
AAACGTATCT TTTTGCTGGA AGGGATGATG ATTTCCATGA CCGGTTGTAT ATGTGGTCTG
CTTATCGGAC TTGGCTTCTC TTTGCTTCAA CAGAAGTTCG GACTCATAAA AATGTCCCAG
GATAATCTCG TGATTACGAA TGCTTATCCT GTGGCCTTAA AATGGAAAGA TTTTCTGCTC
GTATTCTTTA CCGTGAGTAT TTTTTCTTTT ATGGCATCAG CTTTGTCATC TAATTTGAGT
GTAAAGAAGA TAAACCATTT AAATCAAGAT CTATAA
 
Protein sequence
MNTPFYIARR YLFAKKSTNA INIISTISVA GVFVGSAALI IILSVFNGFE EVVLKMFNTI 
TPQIVISPLK GKTFNPNTTY FNELRKDKEI YSFTEVLSEN ALLRYNDKQS VGLVKGVSTD
YLKNKSLDSI TIDGNFVLES KSLNYAVIGS AIQNFLMVNT TDPFYPLQIF SPKKKANQAS
SINPADDFTI LSIPVSGVFE VQQDFDNMAI VPLRFARKLL EEPLNISAIE INLHKGADAD
LFRQKIEDQI GENFEIKDRI QQNKVLYNIL GSEKWAVYII LTFILIIAIF NIIGSLTMLV
IDKLKDIAIL SSLGAGKKLI KRIFLLEGMM ISMTGCICGL LIGLGFSLLQ QKFGLIKMSQ
DNLVITNAYP VALKWKDFLL VFFTVSIFSF MASALSSNLS VKKINHLNQD L