Gene Phep_3979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3979 
Symbol 
ID8255113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4797174 
End bp4799330 
Gene Length2157 bp 
Protein Length718 aa 
Translation table11 
GC content44% 
IMG OID644937643 
Productpolysaccharide lyase family 8 
Protein accessionYP_003094232 
Protein GI255533860 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGGA GACTTTTTAT AATTATCTTA CTTATACAAT TGGTGCTTAC AGGTGTTGCA 
TCGGTTTTTG GTCAGCATAA AAGCAGCCCT CAGGTATGGG CAAAAGATAT TAAAACCATC
ACAGAACGCC TTGCTGAAGA TTATCTGGCT GAAGGGGTTG ATTACCATAA AGTTGAAACG
GTAGTTTCAT CTTTAAAACA GGATGGCAGC TGGCCTGGTA TTGATTATGT TACCGTTTCA
AATGATTTCC CTGCCGGGCT TCACCTTAAA AAGCTGATGC TCATGGCCAA TGCGTATGCC
AGGCAAGGCA GTGGTTTTTA TCGCAACAAA GAACTAAGAC AAAAAATCCT GCTCGGGTAT
AATTACTATC TGGATAAAAA GCCAACATCA AAAAACTGGT GGTACAATGA TATAGGTGCC
CAGCAGGATT ATATGGCAGG CTTGATCTTG ATGAAGGGCC AGATTGCAAA TGCAGACCTT
ATCCGTTATG CTTCTTATCT AAAGGACCTT ACAGACAATC CTGCCCACAG GGGTATGAAC
AGGATATGGG TGTCGGCCAT TACCATTGCC AAAGGATGCC TGGAAAACGA CTATTTATTA
GTCGGCAAAG GCTTCCAAAG TGTAGCTGCT ACCCTGGTAA TCGCAAACGA ACAAGGCATA
GAGGGCATAA AAACCGACAA TAGTTTTCAC CAGCACCGGC CACAATTGTA TTCAGGTGGA
TATGGGATGG GCTTTGCAGA AAGTACAGCA AAGTTAATGG CCCTTTCTGC AAATACCTCT
TTTCATGCTA CCTTCTCTGC CGAGAAAAAA AGAATATTTT CAGACCTGCT CTTGTACGGA
CACCAGCTTT TCAGTTATCG TGGGGCGGTC GACTTTGGAA CTATTGGCAG AAATATCGCC
CGTCCAAATT CAATAAGTCC CATCAGTCTG GTAACATTAG ATCAGATGAT GGTCATCGAC
AGCCAGCGGA AGTCGGCATT CCGGGATTGG AAATCCCATA TACAAGGAGC GGCTTTCCCC
AAACCTTTTC TGGGGAGCAG GTATTTCTGG AAGTCAGACA TCATGACCTA CCATGGAGAA
GACTACTATC TGTCGGCAAA AGTAATCTCT ACCCGTACCA ACGGAACGGA AATGTTAAAC
GCTGAGAACC TGAAAGGATA TAACCTTCCA TTGGGGGCAA CCAATATCAT GAAGACAGGA
GGAGAATATA AAAATATCTT TCCCATATGG GACTGGACAC GTGTTCCCGG CACTACTGCA
GTGATGAACC AATCGGCTAC AGTACTGCCC TGGTATTTAT TTGGAACAAA TGAATTTGCA
GGAGGGATCA GCAATGGCGA AGCAGGCGTA ATTGCTTATG AGCACAGCTA TAATGGCGTA
CAGGCAAAAA AGGCTTATTT TCTTGTGGAT GGCTCTATGC TTTGCCTGGG TGCAGGTATC
AATGCAATCA GAACACAGCA GGTGGTCACC TCAGTTAATC AATGTTATCA GGATGGCGAG
GTGGTTACTG GTGGAAAAGG CAATGTAGAA GGAACCCGCT TTACAGATAG CCTGAGCACC
TCTAAAGGCG TGGAACTTCA TTGGGTTTAC CATGCCGGTG TGGGCTATAT CTTTCCTGCC
GGAGGAAACC TTACCCTGAA GAACGCTGTG CAGACTGGCT CCTGGAAATC CATCAATCAA
AGTGGAAGTG AGGAGCTCAT CAGCAAACCG GTTTTTAGTC TGTGGCTTAA CCATGGCACA
GCTCCATCCG AAGACAGTTA CTGTTATATC GTGCGCCCGG AGCATTCTTT AGAAAGCTTT
AAAAGTCATG TTCATGCAAA TGGCTTTATA ATCCTTAAAA ATGACAAGAA TATACAGGCC
GTAAAATATG GACATAGCTA TTTTGTTGTA TTTTACAAAC CGGGAGCAGT TGATCTCGGA
AAAAATCTGC AGGTTTCTTC AGATTCAAAA GCAGTATTCA TGATGGAAGA GAAAGAGGAT
GGTTATCAGT TGTCGCTTGC AGATCCTACC CATCAGCAAA AAGAAGCGAA CTTATCCTTC
AGCTTATTAA AAGATGCAGC TACCCCCTCT CAGGAAAATG GTACAACAAG ACTCAACTTT
ATATTTCCAC AGGGTGACGA TCAGGGTAGT GCCGTAAACA GATTTTATAA AAAATAA
 
Protein sequence
MARRLFIIIL LIQLVLTGVA SVFGQHKSSP QVWAKDIKTI TERLAEDYLA EGVDYHKVET 
VVSSLKQDGS WPGIDYVTVS NDFPAGLHLK KLMLMANAYA RQGSGFYRNK ELRQKILLGY
NYYLDKKPTS KNWWYNDIGA QQDYMAGLIL MKGQIANADL IRYASYLKDL TDNPAHRGMN
RIWVSAITIA KGCLENDYLL VGKGFQSVAA TLVIANEQGI EGIKTDNSFH QHRPQLYSGG
YGMGFAESTA KLMALSANTS FHATFSAEKK RIFSDLLLYG HQLFSYRGAV DFGTIGRNIA
RPNSISPISL VTLDQMMVID SQRKSAFRDW KSHIQGAAFP KPFLGSRYFW KSDIMTYHGE
DYYLSAKVIS TRTNGTEMLN AENLKGYNLP LGATNIMKTG GEYKNIFPIW DWTRVPGTTA
VMNQSATVLP WYLFGTNEFA GGISNGEAGV IAYEHSYNGV QAKKAYFLVD GSMLCLGAGI
NAIRTQQVVT SVNQCYQDGE VVTGGKGNVE GTRFTDSLST SKGVELHWVY HAGVGYIFPA
GGNLTLKNAV QTGSWKSINQ SGSEELISKP VFSLWLNHGT APSEDSYCYI VRPEHSLESF
KSHVHANGFI ILKNDKNIQA VKYGHSYFVV FYKPGAVDLG KNLQVSSDSK AVFMMEEKED
GYQLSLADPT HQQKEANLSF SLLKDAATPS QENGTTRLNF IFPQGDDQGS AVNRFYKK