Gene Phep_3402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3402 
Symbol 
ID8254521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4048409 
End bp4050613 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content44% 
IMG OID644937054 
ProductAlpha-N-acetylglucosaminidase 
Protein accessionYP_003093658 
Protein GI255533286 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0013857 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATAG GGACAGGAAT CAGCCTCATT GCGCAAACCA CTCAAAATGT ATTAAATAAA 
GAAGCAGCCT ATGAACTGAT CAAAAGAATC CTTCCTGCCT ACGCCCATAA ATTTGAAGTG
GCCTACGTTC CTAAAGAAAA CGACAGCGAT GTGTTTGAGT TGGAAAGCAA AGCTGGTAAA
ATCGTCCTTC GTGGAAACAA TGGCGTTGCT GTTGCCAGTG CGCTCAACTA TTGGCTCAAG
AATTATGCCC ATTGCGAAAT CACCTGGAAT GGCACCAACC TCAATATTCC AAAACCTTTT
CCAATGGTCA GCAAAAAAAT AAGAAAGGTG ACGCCCTATG AATACCGTCA TTATTTTAAC
TATTGCACAT TTAACTATAC AGCGACCTGG TGGGACTGGG AACGTTGGCA ATGGGAAATT
GATTTCATGG CTTTAAATGG TGTGAATATG CCATTAGCGC TGACCGGTCA AAATTCAATT
TGGGATAAAG TATATCGTAG CATGGGCTTC AACGATAAAG ATATGGATGC TTTTTTTAGC
GGACCCGCGT ATACCAACTG GTTTTGGATG GGCAATCTGG ATGCCTGGGG AGGCCCGATG
TCTAAAAACT TTATGGCTAA GCAGGAAGCT CTCCAGAAGA AGATTCTGGC CAGGGAGCGT
GCGCTGGGAA TGACACCCAT ACTGCCTTCT TTTACCGGGC ATGTGCCTCC TTCTTTTAAA
GATAAATTTC CGGATATAAA AGTGAACACC CAGCAATGGG GCATTAACGT GTCGCCAGCT
TATGTACTCA ACCCAGAAAC ACCGATGTTT AAAGAAATCG GCAGAAAATT CCTCACAGCG
TTAATCAACA CTTTCGGAAC GGATCATTTG TATTCCGCAG ATACCTTTAA TGAAATGACC
CCGGTAAGTA ACGATTCTAC TTACCTCAAC GGAATGGCCA AAAAGATTTA TGAATCTATG
GCCGCTGTAG ACACCCAGGC GGTATGGATC ATGCAAGGCT GGATGTTTTT AGACCGGCCC
AACTTCTGGC AGCCTACTCA AATGAAAGCC TTGTTTAGTG CCGTACCACA GGATAAATTG
ATTGTACTGG ATTTGAATAG TGAATTAAAT CCGGTATGGA GCAGAACAGA TGCTTTTTAC
GGCGAAAAGT GGATCTGGTG CATGCTGCAC AATTTCGGCG GGCGCCTCAG CATGTTCGGC
GATATGTCCA GAATCGGGAA TGATCCGGCA GCTGCATTAA AAAACGACCA AAGAGGTAAA
ATGTCCGGGA TTGGGTTGAC CATGGAGGGC ATTGAACAAA ACCCAGCTAT CTATTCGCTA
ATGCTGGAGC ACATATGGAA CGATAAACCA ATTGATTTAG ACAATTGGTT AAAGGGTTAT
GCGCAACGCC GTTATGGGAA GAGGAACAGC AATGCTGAGA AAGCCTGGGA AGTATTAAAA
AATACGGTTT ACAGTCACCA GCCGTGGTGG GGTACCAATA CCATCATTAC GGGCAGACCA
ACCTTCGACG CAGCAACAGT CTGGACCTAT ACAGCTATTC CATATTCCAG TAAGGAACTG
ATGAAGGCCT GGTCGTATCT GCTGACTGCA TCAGACGAAT TAAAATCAAG CGATGGATTT
CAGTATGACC TGGTAGATGT TACACGACAG GTCCTTGCCA ATTATGCCAA CGTACTGCAA
CAGGATTTTG CCAGCTCGTA TAAACAAAAG GATATGGCCA CCTTCAACAA AAAGAGCGCT
CAGTTCTTGG AATTGATCGA CGATATCGAT CAGTTGCTGG GTACCAGATC AGACTTTCTG
TTAGGTAAAT GGATCAACAA CGCCAAAGCG TTGGGCGATA ATCCGGCAGA GAAAAAATTA
TTCGAACGCA ATGCCCGCGA CCTGATCACT TTGTGGTTAG ACAAAGATTG TAATATTCAT
GAATACGCCT GTAAAGAATG GGCCGGTATG ATGAAAGGCT TTTATAAACC ACGCTGGCAG
CAATTTTTTG ACGAAGTACG GCTGCAGGCC AGTGCTGGAA AAGAAATTGA TCAGATTAAG
TTTGAAAATA CCATAAAAGA CTGGGAATGG AAATGGGTAA ATGCAAATGA AGCTTATACC
GATAAACCTA CAGGAAACCC GGTTACAGTG GCTAAAGCGT TATACGCCAA GTATAATCAC
AAAATGAACA ATGCATTTCC AACTGTCTAC ACTAACACTA AATAA
 
Protein sequence
MLIGTGISLI AQTTQNVLNK EAAYELIKRI LPAYAHKFEV AYVPKENDSD VFELESKAGK 
IVLRGNNGVA VASALNYWLK NYAHCEITWN GTNLNIPKPF PMVSKKIRKV TPYEYRHYFN
YCTFNYTATW WDWERWQWEI DFMALNGVNM PLALTGQNSI WDKVYRSMGF NDKDMDAFFS
GPAYTNWFWM GNLDAWGGPM SKNFMAKQEA LQKKILARER ALGMTPILPS FTGHVPPSFK
DKFPDIKVNT QQWGINVSPA YVLNPETPMF KEIGRKFLTA LINTFGTDHL YSADTFNEMT
PVSNDSTYLN GMAKKIYESM AAVDTQAVWI MQGWMFLDRP NFWQPTQMKA LFSAVPQDKL
IVLDLNSELN PVWSRTDAFY GEKWIWCMLH NFGGRLSMFG DMSRIGNDPA AALKNDQRGK
MSGIGLTMEG IEQNPAIYSL MLEHIWNDKP IDLDNWLKGY AQRRYGKRNS NAEKAWEVLK
NTVYSHQPWW GTNTIITGRP TFDAATVWTY TAIPYSSKEL MKAWSYLLTA SDELKSSDGF
QYDLVDVTRQ VLANYANVLQ QDFASSYKQK DMATFNKKSA QFLELIDDID QLLGTRSDFL
LGKWINNAKA LGDNPAEKKL FERNARDLIT LWLDKDCNIH EYACKEWAGM MKGFYKPRWQ
QFFDEVRLQA SAGKEIDQIK FENTIKDWEW KWVNANEAYT DKPTGNPVTV AKALYAKYNH
KMNNAFPTVY TNTK