Gene Phep_3009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3009 
Symbol 
ID8254121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3601473 
End bp3603056 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content41% 
IMG OID644936658 
Productpeptidase S41 
Protein accessionYP_003093269 
Protein GI255532897 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.237965 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTTTA TCATGATGGC CCTTGGGGCA TGTAAAAAAT CTAAAGTAAC ACCCGAACCA 
GATCCGCCAG CTTCGGGCTC AAATGTAAAG CAAACGCCAA CTACCAACAG AACAGAATTA
ACCAATGATT CTTTGTTTTT ATATGCCCAG CAAATCTATT ACTGGAATAC TGCGTTGCCA
TCTTATGATG ATTATGTGCC CCGTCAGTAT AACACCGCAA GTACCGACCT CATTAACTAT
GAGAACAACT TGTTTAATAT CGTAAAATCT TCCGGATCCG CAGATTACAT CGCAGGAAAC
TCAGATCCAA AATACTCCTA TATTGAAGAC ATTACGACCA GAAATCCTGC TGCCGTATCG
GCTGTACCCA ATTCAAGGCT GTCTGTAGAC CTTGATGGCA ATGGAAACGA TACCGGGGTA
ATGTGGATTC CATTTGGTAC CAATAACAGC TATACTATTT TTGTTACTGT TGTATATCCT
GGTTCGGATG CAGAAGCAAA AGGTGTAAAA AGAGGTTGGG CAATCACCAA AATTAACGGA
CGGTCATTCG GTACAAATTA TAACGGTGAA TACAATGCAA TTTATGATGA GTTTGCCAAA
AGTTCAGTTA CAATAGAAGG GTATAAATTT TCCAATGATG TTCAGGGAGA AGCGTTTAAC
CTTACCCTGA CTAAAAAATC GTATAGCAGC AGTCCGGTAT ATGTGTCTAA AGTAATTACT
TCGGGAAGTA AAAAGATCGG TTATCTGGCT TATGGCCGTT TTTCAAATGA AAACAATTCC
TTTACTGCTT TAGGTAATGT ATTCAGCAGC TTTGCTGCCC AGAATGTAAC CGATCTGGTT
GTAGATTTAA GGTATAATGG AGGCGGTTAT ATCAGTACTG CCGAACGTTT GATTAACCTG
ATCGCGCCTA CGACAGCTAC TGGGGTAATG TATAAGGAAT ATTATAATGC AACACTGAGA
GATCAGAAAG CAACCATAAT GAAAAATCAG CCATTACTGG ATGATAATGA TAAGGTTCGT
TACAAAAACG GCCGGATGAT GAATTATTTT GATGATATAG ATTATACGGT AGCCGGAAAT
ACATATTCAT TTAGCAAAAT TGGCAACCTG ACAGGGGTAA GCAATATTGT TTTTCTGGTT
TCCCGCAATA CAGCATCAGC CAGCGAGCTG GTGATCAATT CTTTAAAACC TAAAATGAAC
GTGAAGCTGG TGGGCCAAAC CACTTATGGA AAGCCAATCG GTTTTTTCCC TGTCAGACTT
CAAAACAGGT ATGACGTTTT TTATTCCTTG TTTGAGACTA AAAATTCGGC CGATCAGGGC
GGATATTTTT CAGGTATGGT GCCGGACGCT GCTTTGTCTG AAAACCCAAC TTACCAGTTG
GGTGATGAAA AAGAGGCTTA CCTGGCCAAG GCAATTGGTC TGTTAAATTC TGCGGCAATT
ACAAGTAGTA ATACAGTGGG TTCTAAAGCC GTCATGAGTG TAGGTAGCAA AACCATTGCT
CTTGACAATC AGAATTTCAA TGCACCGGTA GGGGACGGGG CAAGCTTTGT AGGGATGATA
GAAAACCGCC ATAAAGGAAG GTGA
 
Protein sequence
MLFIMMALGA CKKSKVTPEP DPPASGSNVK QTPTTNRTEL TNDSLFLYAQ QIYYWNTALP 
SYDDYVPRQY NTASTDLINY ENNLFNIVKS SGSADYIAGN SDPKYSYIED ITTRNPAAVS
AVPNSRLSVD LDGNGNDTGV MWIPFGTNNS YTIFVTVVYP GSDAEAKGVK RGWAITKING
RSFGTNYNGE YNAIYDEFAK SSVTIEGYKF SNDVQGEAFN LTLTKKSYSS SPVYVSKVIT
SGSKKIGYLA YGRFSNENNS FTALGNVFSS FAAQNVTDLV VDLRYNGGGY ISTAERLINL
IAPTTATGVM YKEYYNATLR DQKATIMKNQ PLLDDNDKVR YKNGRMMNYF DDIDYTVAGN
TYSFSKIGNL TGVSNIVFLV SRNTASASEL VINSLKPKMN VKLVGQTTYG KPIGFFPVRL
QNRYDVFYSL FETKNSADQG GYFSGMVPDA ALSENPTYQL GDEKEAYLAK AIGLLNSAAI
TSSNTVGSKA VMSVGSKTIA LDNQNFNAPV GDGASFVGMI ENRHKGR