Gene Phep_4100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4100 
Symbol 
ID8255234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4942804 
End bp4944720 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content42% 
IMG OID644937764 
ProductRagB/SusD domain protein 
Protein accessionYP_003094353 
Protein GI255533981 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATATG CTATAAAAAG GCAGTGGTGC TTAAAAGTTG CCATCCTTAT TGTTATTTCC 
GGATTAAGTT CCTGTAAAAA ATATCTGGAC ATTATCCCTG ATAACGTCGC AACCATAGAC
AATGCATTTG CGATGCGCTC AGAAGCGGAA AAATACCTCT ATACCTGTTA TTCTTACATG
CCAAAGGATG GTAGCCTAGA CCAGGACCCT TCAATTTTAG GTGGCGATGA AATATGGGCA
ATTGACCGTC CGCCAAAACC AAATTTCAAC CATGGAATTT TTGAAATTGC AATGGGGCGT
CAGGCTACGG TTAATCCTGT CGGAGAATTT ATTTGGGTAA ACCTCTATAA AGGCTTGCGC
GACTGTAACA TCTTCCTGGA AAATGTAGAG AGGGTACCCG ACCTTAAGCC AGAAGAAAAA
AGAGAATGGA TTGCTGAGGT CAAATTCCTT AAAGCTTATT ACCATTTTTA TCTGGTACGT
ATGTATGGGC CAATTCCGCT GATCAAAGAG AATTTGCCGG TTGATGTAAA CATCTCCTCT
GTGAAAATTA CCAGAGCACC TGTAGACAGC TGTTTTGCTT ATATCAAGCA ATTGCTCGAT
GAATCAAAAG ATGTATTACA GCCTACCATC AATGATCCGG TTAAAATGCT GGGCCGTATC
ACCAAACCTA TTGTGTATTC TTTAAAGGCT AAAGTTATGG TAACGGCTGC AAGCCCATTG
TTTAATGGAA ATGCCGATCA GGCCAGTTTA AAAAACGCTG ATGGGACACA ACTATTTAAC
CAGCAGGTGG TTCCTGCTAA ATGGGATTCG GCTGTAGTAG CCTGTAAACA GGCAATAGAC
GTATGCGAGG CAGCAGGGTT AAAACTGTAT GAGTTCAATG CGGCTTCATC AACCTTCGTG
TTGTCGGATC AGATGAAATT GCAACTCAGC TTAAGAAATG CTTTCGCAGA AAAATGGAAT
TCTGAAATTA TCTGGGCAAA TACCCAAAGC CTTTCAGCTA ACATCCAGTT AAATGCAACA
CCTAATGTCA ACCCATTATA CCAGGACAAT CCCTTCATTG GATACGAACT GGCCCCGCCG
TTAAAAATCG TAGACATGTT TTACACCGAG AATGGCGTAC CTACAACGGA AGATAGGACC
TGGAACCTGA ACCAGCCAGC TACAAGGGCA GGTACGGTTG AAGATCAGCG GCTCATTAAA
TTAAATTATG AAACCTCCTC TGTCAATTTC GACCGTGAGC CACGTTTTTA TGCCAATCTG
GGCTTTGATG GGGGCATTTG GTACGGACAG GGTTATTTTA ATGATGCAGT ACCTGCAAGT
ACATATTATG TAATGGCCAA AAAGGGGCAG CAAAATGGTA AAGGAAAACC AGATTATGGT
TCGGTAACTG GCTATTTTAT TAAAAAATAC GTGCATTATC AAAATACCCA GGGTAGTGCA
ATGACCGATT ATAGCGTCAA CAATTATCCG TGGCCGTTAA TCCGTTTGTC GCAATTATAT
CTGTTATATG CAGAAGCACT GAATGAGAAA AGCGGACCTG TTGCAGAAGT ACATACTTAT
ATCAATAAAG TACGTGCCCG GGCTGGTCTC AAATCAGTTA AAGAATCCTG GGATCTGTAT
GCAAATAATA CCAAGTACAC GACTCAGGCC GGGATGAAAG ATATTATCCA CAGGGAAACC
TTAATAGAGC TTGCTTTTGA AGGTGCCCGT TTCTGGGATC TTAGAAGATG GAAAGAAGCT
CCTCAGGAGT ATATCAAGCC GATTCAGGGA TGGGACATCG AGCAGTCTAC TGCAAATTTA
TACTACCGCA GAAAACTGGT GTTTACACCC AGATTTTCAA TGAAAGACTA TTTCTGGCCG
ATTCGTGATA ACAATATCCT GAACAATAAG AATTTAATCC AAAATATTGG TTGGTAA
 
Protein sequence
MKYAIKRQWC LKVAILIVIS GLSSCKKYLD IIPDNVATID NAFAMRSEAE KYLYTCYSYM 
PKDGSLDQDP SILGGDEIWA IDRPPKPNFN HGIFEIAMGR QATVNPVGEF IWVNLYKGLR
DCNIFLENVE RVPDLKPEEK REWIAEVKFL KAYYHFYLVR MYGPIPLIKE NLPVDVNISS
VKITRAPVDS CFAYIKQLLD ESKDVLQPTI NDPVKMLGRI TKPIVYSLKA KVMVTAASPL
FNGNADQASL KNADGTQLFN QQVVPAKWDS AVVACKQAID VCEAAGLKLY EFNAASSTFV
LSDQMKLQLS LRNAFAEKWN SEIIWANTQS LSANIQLNAT PNVNPLYQDN PFIGYELAPP
LKIVDMFYTE NGVPTTEDRT WNLNQPATRA GTVEDQRLIK LNYETSSVNF DREPRFYANL
GFDGGIWYGQ GYFNDAVPAS TYYVMAKKGQ QNGKGKPDYG SVTGYFIKKY VHYQNTQGSA
MTDYSVNNYP WPLIRLSQLY LLYAEALNEK SGPVAEVHTY INKVRARAGL KSVKESWDLY
ANNTKYTTQA GMKDIIHRET LIELAFEGAR FWDLRRWKEA PQEYIKPIQG WDIEQSTANL
YYRRKLVFTP RFSMKDYFWP IRDNNILNNK NLIQNIGW