Gene Phep_1900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1900 
Symbol 
ID8253004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2194384 
End bp2195625 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content40% 
IMG OID644935551 
Productproteinase inhibitor I4 serpin 
Protein accessionYP_003092170 
Protein GI255531798 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4826] Serine protease inhibitor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.274558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA ACTACAAATT ACGCTTAGTT GTTGCAGTTT TAATGTGCGT GGCTACATCC 
TGCAAAAAGG GCAACAAAAC GCCAGACACC GGAAAAGATT TAATATTGAC CGAAAGAGAG
CAGCAAAAGG CAGAAGCCGA TAATGCCTTT ACCTTAAAGC TGTTTAAGGA AATTATAGCC
AAACCTTTGG CGGGTAAAAA CCTTATGTTG TCGCCTTTAA GTGTAAGTAT AGCATTGGGG
ATGACCAGTA ACGGAAGCAG CGGCACAACT TTGGAAGCCA TAAGAAATAC CATGGAATTT
AAGGATTTTA CTGAAGCTGA AATAAACAGC TATTACCATA AAATAGCAAC AGAATTACCC
CAGTTAGATC CTAAGGCTTC ATTAAAAATT GCCAATTCCA TCTGGTACAG AAATACATTT
ACCACATTGC CTGCTTTTCT TAATGTGAAC AGGGACAATT ACAATGCTGC AGTTGAGGGC
CTGGACTTTG CCAATCCTGC TGCAAAAGAC AAAATTAACA ATTGGGTAAA TAACAGCACA
AATGGAAAAA TCCCAACTAT AATTGATGCA ATTGGCAGCG ATATGGTCAT GTACCTGATC
AATGCCGTTT ATTTTAAAAG CGACTGGAAG TATAAATTCG ATAAGGATAA AACTGCAAAA
TCGGATTTTA ACCTGGACGC CAATAATAAG GTACAAACAG ATTTTATGGT TGCAAAGGCA
ACTGTTAATC ACTTACGCTC AGAGGAGGCC TACATTTATG AACTTCCGTA TGGGAACGAA
AAATACAGCA TGGTCATTGC ATTACCTGCT ACCAATACCA ATATTGCTGA ATTTGTCGCC
TCAGTTAGTC CGGCAAAATG GAAAGGGTGG ATGGCAGGCC TGCAGAAAAC CGGTGTTGAA
ATTAAAATGC CCAGGTTTAA ATTTAGCTAC AGCAGCATAT TAAACGATCA GCTAACCAAT
CTGGGTATGG GAATTGCCTT TGGTAAAACC GGGGCAGCAG ATTTCAGCAG AATGAGTGCT
GCAGGTTTAC AAATAAATGA GGTGAAGCAT AAAACCTTTG TTGAAGTAAA TGAAAGCGGT
ACAGAAGCTG CCGCTGTAAC CTCTGTTGGG ATGGAGCTTA CTTCTGTACA AGAGCCGGCC
CCGGTTCTTA TTAACCGCCC TTTTGTATTT GTGATCCGCG AGATGAAAAC CGGGCTGATT
TTATTTACCG GTATCGTAAA TAACCCCTTA TTGGATAATT AA
 
Protein sequence
MKKNYKLRLV VAVLMCVATS CKKGNKTPDT GKDLILTERE QQKAEADNAF TLKLFKEIIA 
KPLAGKNLML SPLSVSIALG MTSNGSSGTT LEAIRNTMEF KDFTEAEINS YYHKIATELP
QLDPKASLKI ANSIWYRNTF TTLPAFLNVN RDNYNAAVEG LDFANPAAKD KINNWVNNST
NGKIPTIIDA IGSDMVMYLI NAVYFKSDWK YKFDKDKTAK SDFNLDANNK VQTDFMVAKA
TVNHLRSEEA YIYELPYGNE KYSMVIALPA TNTNIAEFVA SVSPAKWKGW MAGLQKTGVE
IKMPRFKFSY SSILNDQLTN LGMGIAFGKT GAADFSRMSA AGLQINEVKH KTFVEVNESG
TEAAAVTSVG MELTSVQEPA PVLINRPFVF VIREMKTGLI LFTGIVNNPL LDN