Gene Phep_4145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4145 
Symbol 
ID8255280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5016240 
End bp5017799 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content44% 
IMG OID644937810 
Productprotein of unknown function DUF187 
Protein accessionYP_003094398 
Protein GI255534026 
COG category[S] Function unknown 
COG ID[COG1649] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.227733 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAC TCATAATCTA TGCACTATTA ACAGTAATAA TAACCCCTAT TTCCCTAATA 
GCACAATCCC CATCAAAAAT TGCTCCAAAA AGAGAGTTTA GAGGAGTTTG GGTTGCAACT
GTTGCAAATA TAGACTGGCC GTCTAAGCCA GGTCTAAACA TTGATCAGCA AAAACAGGAA
CTCATAGGTC TGCTTGAACA ACATAAAGCA AACGGGATGA ATGCGATCAT TTTACAGGTC
AGACCTGCTG CTGATGCATT TTATTTAAAA TCAAGAGAGC CCTGGAGCCA ATGGTTGATG
GGTAAACAGG GCATGGCACC GGCCCCCGGT TATGACCCGC TTGCTTTTGC CATTAAAGAA
GCGCATTCCC GGGGTATGGA GCTCCATGCC TGGTTTAATC CTTACCGGGC AACGATGAGT
GCAAGTGCAG TAGTGAGCCC GGACCACATG ACCAGAAAAA GGCCGGACCT GTTTTTTGTA
TATGGCGGGA AAAAACAATT TGACCCTGGA ATCCCTGAGG TTAGGGAGTA CATTGTTCAG
GTAATTCTTG ATGTAGTAAA AGGCTATGAT GTAGACGGAA TTCATTTTGA CGACTATTTT
TATCCCTATA AGATTGCCGG ACAGAACATT AATGATGCTG CAACATTTAA TAAATACCCT
AATGGTTTTA GCAATATTGC CGACTGGCGC CGTAACAATG TGGATTTGCT GATCAAACAG
CTGGACGATA GCATTCATCA TTATAAAAAA TATGTGAAGT TTGGTGTGAG CCCATTTGGT
ATCTGGAAAA ACCTTTCTGA GGACAGTTTG GGCTCGGCTA CCAATGGACT ATCCAATTAC
GCAGAATTAT ATGCAGATTC CCGGAAATGG GTAAAAGAAG GCTGGGTTGA TTATATCAAC
CCCCAGGTTT ACTTTAGCTT TACCAGAAGG GCTGCGCCCT TTGCCACCAT CGCAGATTGG
TGGACAAACA ATGCTTTCGG AAGGCACGTT TATATTGGTC ATGGTGCTTA CCTGATCCAT
AATGGCAGTA CCAGAAAAGA AGCGGCCTGG GCTTTTCCTA ACCAGATCCC CAACCAGATC
AGGCATATAC GTGGGAGCAA TCTGATCCAG GGTAGTGTCT TTTTTAGTTC CAAGTCATTC
TCAACGGTTG CGCGGGCTCT TGGAGATTCA TTGAGAAACA ATTACTACAA GTATCCGGCA
CTCCCCCCTC AAATGCCCTG GCTTGATGAT GTCGTGCCGA ATCAACCGCT TAACCTTAAC
GCTGAAGCGC ATGCCGATGG GGTTCATCTG AAATGGGAAA GGCCGTTAAA AGCAAGCGAT
GGTGAAACAG CTTCGGGCTA TGTTATTTAT CGTTTTAATG AGGGAGAGAA AATCGATGTG
CTGGATGCGA GGAATATAAT GAGGATCAGC TTTGAAGATT TTCCCAGCTT TATAGATACT
AATGTAGAAC GGGGGAAAAG GTACAATTAC ATGGTTACTG CATTGGACAG GTTAAAAAAT
GAAAGTGAAC CCAGTGGTCC TGTTGGCGTG CAGACAAAAG AGCTGGCCAG TGTAGAATAA
 
Protein sequence
MFKLIIYALL TVIITPISLI AQSPSKIAPK REFRGVWVAT VANIDWPSKP GLNIDQQKQE 
LIGLLEQHKA NGMNAIILQV RPAADAFYLK SREPWSQWLM GKQGMAPAPG YDPLAFAIKE
AHSRGMELHA WFNPYRATMS ASAVVSPDHM TRKRPDLFFV YGGKKQFDPG IPEVREYIVQ
VILDVVKGYD VDGIHFDDYF YPYKIAGQNI NDAATFNKYP NGFSNIADWR RNNVDLLIKQ
LDDSIHHYKK YVKFGVSPFG IWKNLSEDSL GSATNGLSNY AELYADSRKW VKEGWVDYIN
PQVYFSFTRR AAPFATIADW WTNNAFGRHV YIGHGAYLIH NGSTRKEAAW AFPNQIPNQI
RHIRGSNLIQ GSVFFSSKSF STVARALGDS LRNNYYKYPA LPPQMPWLDD VVPNQPLNLN
AEAHADGVHL KWERPLKASD GETASGYVIY RFNEGEKIDV LDARNIMRIS FEDFPSFIDT
NVERGKRYNY MVTALDRLKN ESEPSGPVGV QTKELASVE