Gene Phep_0831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0831 
Symbol 
ID8251920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp982100 
End bp983599 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content40% 
IMG OID644934481 
Producthypothetical protein 
Protein accessionYP_003091115 
Protein GI255530743 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATT TACATCTAAT TAACAGTTTT GTATATAATA AATTAAAACA TATAAAAATG 
AAAAAACAGT TACTAGCTTC TATTTTTGGC CTGCTTGTAT TCTTAACAAG CTGCTCAAAA
AGCACTGATG ATGATGGAGA TATAAATCCA CCTGCCACGG GTACGGTTGA GGTTTCAGGA
GATATTACGA CAAGCACAAC ATGGAGTGCA GATAAAATCT ACCTTTTAAA AGGAAATGTT
TTTGTAACCA ATAATGCTAC ATTAACCATT GAGCCAGGGA CAATTATCAA AGGTGATAAA
GGTACTAAAG GTGCTTTAAT CATTACCAGA GGCGCTAAAA TTATGGCTGT TGGTACAGTT
GAGAAACCAA TTGTATTTAC TTCAAGCATT ACTGCTGGTG CCCGTAAAGA AGGAGATTGG
GGCGGTGTAA TTTTATTGGG TAAAGCGCAA AATAATTTGG GTACCTCAGT GCCGATTGAA
GGTATTTCTG ATGCAACTGA TAAAGGTAAA CACGGTGGTA CCGATAATAC TGATAATTCG
GGTATAATGA AATATGTACG TATAGAATTT GCAGGTATTG CGCTAAGTCC GGATAACGAA
ATCAACGGTT TAACTTTTGG GTCTGTTGGG TCTGGTACTA CAATCGATTA TATCGAGGTG
TATCGTTCGG GTGATGATGC TTACGAATGG TTCGGTGGTG CCGTTAACTG TTCTCACTTA
TTGGCTATCG ACAGCTGGGA TGATGATTTT GATACAGATA ACGGTTTCTC TGGTAAAGTT
CAATTCGGGT TAGCACAGCG TTTAGCTGTA ACTGCCGACG TTTCTGGTTC AAACGGTTTT
GAATCAGATA ACAATGCCGC TGGTGATAAC GCTACACCAC AAACATCAGC TGTGTTTTCA
AATATGACCA TTTTAGGTCC GGTAGCCTCT GGTGGCAGCA GTATTAATGC TAACTTTCAG
CATGGCGCTC AAATCCGTCG CAACTCTGCA ATGAGCTTAT TCAACTCAGT AATTGTTGGT
TACACAGAAG GTGTGTTTTA TGATGATGGA TTGCCAACTA CACCTGTAGG TGGTGTTCTT
TTGAATTCCT CCCTTAACCT TACGGCGGGT AGATCTGTAT TTGCTAACAA CTTAGTTTAT
AACAGCAATA GTAAAAACAA TCAGATTAAA GCATCTAATG CAACAGCGCT AGGTGTAATT
ACTCCATTGT TAACAGTTGC GAATACATTT GATGCAAGTG CTACAGCAGA GAGTATCTTT
ATCAGTCCTT ATAAATATTC TGCAGATTTA GTTGCAGCCG CCAGAGTTGG TACACCAGAT
TTTACTGTAA AAACTGGCTC TGCCGCAGCT TCAGGTGCTG CATTTACCAA CGCAAAATTA
GCGTCAGGAT TTACATCTGT AGCTTACAGA GGTGCTTTTG GTACTGATAA CTGGGCTGCA
GGCTGGGCTC ATTTCGATCC GCAATCTTTA CCTTACACTA CGCCTGGTGC TGTAAAATAA
 
Protein sequence
MSDLHLINSF VYNKLKHIKM KKQLLASIFG LLVFLTSCSK STDDDGDINP PATGTVEVSG 
DITTSTTWSA DKIYLLKGNV FVTNNATLTI EPGTIIKGDK GTKGALIITR GAKIMAVGTV
EKPIVFTSSI TAGARKEGDW GGVILLGKAQ NNLGTSVPIE GISDATDKGK HGGTDNTDNS
GIMKYVRIEF AGIALSPDNE INGLTFGSVG SGTTIDYIEV YRSGDDAYEW FGGAVNCSHL
LAIDSWDDDF DTDNGFSGKV QFGLAQRLAV TADVSGSNGF ESDNNAAGDN ATPQTSAVFS
NMTILGPVAS GGSSINANFQ HGAQIRRNSA MSLFNSVIVG YTEGVFYDDG LPTTPVGGVL
LNSSLNLTAG RSVFANNLVY NSNSKNNQIK ASNATALGVI TPLLTVANTF DASATAESIF
ISPYKYSADL VAAARVGTPD FTVKTGSAAA SGAAFTNAKL ASGFTSVAYR GAFGTDNWAA
GWAHFDPQSL PYTTPGAVK