Gene Phep_3469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3469 
Symbol 
ID8254589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4120351 
End bp4122687 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content43% 
IMG OID644937121 
Productputative aminopeptidase 
Protein accessionYP_003093724 
Protein GI255533352 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0337605 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGA TACTGCTGTT TTTTATTGTG CTTTTATGTA GCGCCAAAGC TTTCGGTCAG 
CATATACTTC ATAACCCCGG TTCTAATCAC GGGAACAAAT TTGAACAGCT TGGCACCATC
CTTTCCGGCC CTAACTCCTA CCGTTCGGCT TCCGGTGCAC CTGGCCCAGC ATACTGGCAA
CAGCGTGCTG ATTATGAAAT TAATGCAGAA CTGGATGAAA AGAACCTGCG TTTAAGCGGC
TCAGAAACCA TTACTTATTA CAACAACTCT CCCGACCCAC TAAGTTATTT ATGGGTACAG
CTGGATGAAA ATGAGCATAA GGCCAGCAGC GATAATAAGC TGACAGAACA GAGCAGGATG
ACAGATCAGA TGGGTTATAA GGCCTTATCA GATATCATTA ACCCTGAAAA TGACCTGGGC
GTTAAAATAC TAAAGGTTAC AGATGAAAAG GGGACTGCCC TGCCTTATGT CATCAACAAT
ACCATGATGC GTATAGACCT GCCCTATGTA TTACAGTCCA AACAAAAATA TAAACTTAAA
ATAACCTGGA ATTACAAAAT CATCAACCGC GTAATAGATG GAGGAAGAGG TGGTTATGAA
TATTTTGCTG AGGACGACAA CTATCTGTTT ACCATTGCCC AGTGGTTTCC ACGCATGGCT
GTTTATTCTG ATTTCCAGGG CTGGCAAAAT AAACAGTTTG AAGGTAGGGG CGAATTTGCA
CTTGTATTTG GCAATTATAA GGTAAACATG ACTGTACCGG CCGATCATGT GGTAGGTGCT
ACAGGCGAAT GTCAGAATTA TGCCCAGGTG CTGAGTGCTG CCAGTTTAAA ACGATGGAAT
GCCGCACAAT CTACAAAAGT ACCTATTGAA ATTGTAAACC TGGCCGAAGC AAAATCAGCC
ATCACCAAAA AATCGACTGC TAAAAAAACC TGGACCTACT TTGCTGAAAA TGTAAGGGAT
TTTGCACTGG TATCGTCAAG GCGATTGGTT TGGGATGCCA TGGCAACCAG TATAGAAGGT
AAAAAGATAA TGGCTATGTC TTACTATTGC CCGGAAGCTT ATTCACTTTA CAGCAGGTAT
TCGACCAAAG TGGTAGCACA TACTTTAAAG ATCTATTCCC AGCACACTAT CCCCTATCCT
TATCCCGTAG CCATTTCTGT AGAGGCTGCC AACGGAATGG AATACCCGAT GATCTGTTTT
AATTTCGGAC GTACAGAAAA AGACGGCACT TACACGGAGG CCATTAAATA TGGTATGATC
GGTGTAATTA TACATGAAGT TGGGCATAAC TTTTTCCCGA TGATAGTGAA TTCGGACGAG
CGACAATGGA CCTGGATGGA CGAAGGTTTA AATACTTTTT GCCAGTACAT GGCCCAGCAG
GCATGGGACA TTAACTACCC TTCGCAGCGC GGCCCTGCAC ATAAAATAGT GGATTACATG
AAATTGCCTA AAGACCAGCT GGAGCCCATT ATGACCAACT CTGAAAACAT CGTTAATTTT
GGACCAAATG CCTACGCAAA ACCAGCTACG GCACTGAATA TTTTAAGGGA AACCGTTATG
GGCCGTGAGC TTTTCGATTA TGCTTTTAAA GAATATGCAA AACGATGGGC CTTTAAACAT
CCTACCCCTG CAGATCTGTT CAGGACGATG GAAGACGCAT CGGCGGTGGA CCTTGATTGG
TTTTGGAGAG GCTGGTTTTT TAGTACTGAC CCGGTAGACA TTTCACTGGA TGATGTACGC
TACTACCGTA TGAACAGTAT GAATGCGGCT ATTGAAAATG TAGAACTTAA AAAGGCTTAT
GACAAGGACC TTTACAATAT CAGCAGGGAA CGCAACCGCA AAGAAGGCAT AAAATTCGCT
ATAGAACAAG ATACAAGCCT ACAGGATTTT TACAATAAAT TTAACCGCTT TGAGGTGAGT
AAGTCTGCCG ATCAAGAATT TCAGCAATAC TTCAGCAACC TTTCCGCAGC AGAAAAAAAA
CTTTACGAAA GCAAGAAGAA CTTTTATGAA CTGGATTTCT CGAATATAGG GGGACTGGTG
ATGCCCATCA TTATTGAATG GACCTTTAAG GATGGCAGCA AGGAGGTGGA CCGCATTCCT
GCCTATATCT GGAGAAAAAA TGAAAATAAA GTAACCAAAG TTTTTGCGAA AGATAAAGAA
GTAATTGCTG TACAACTGGA TCCATACCGC GAAACTGCCG ATATAGATGA AAGCAATAAC
TCATGGCCAA GAAAAAATCA GCCAACCAGA TTTGAACTGT TTAAACAACA GCAGGCACCC
CGTGGGTCAT CTACTGAACA GACCAACCCA ATGCAGCAAT CCAGACAAAG GCAATAG
 
Protein sequence
MNKILLFFIV LLCSAKAFGQ HILHNPGSNH GNKFEQLGTI LSGPNSYRSA SGAPGPAYWQ 
QRADYEINAE LDEKNLRLSG SETITYYNNS PDPLSYLWVQ LDENEHKASS DNKLTEQSRM
TDQMGYKALS DIINPENDLG VKILKVTDEK GTALPYVINN TMMRIDLPYV LQSKQKYKLK
ITWNYKIINR VIDGGRGGYE YFAEDDNYLF TIAQWFPRMA VYSDFQGWQN KQFEGRGEFA
LVFGNYKVNM TVPADHVVGA TGECQNYAQV LSAASLKRWN AAQSTKVPIE IVNLAEAKSA
ITKKSTAKKT WTYFAENVRD FALVSSRRLV WDAMATSIEG KKIMAMSYYC PEAYSLYSRY
STKVVAHTLK IYSQHTIPYP YPVAISVEAA NGMEYPMICF NFGRTEKDGT YTEAIKYGMI
GVIIHEVGHN FFPMIVNSDE RQWTWMDEGL NTFCQYMAQQ AWDINYPSQR GPAHKIVDYM
KLPKDQLEPI MTNSENIVNF GPNAYAKPAT ALNILRETVM GRELFDYAFK EYAKRWAFKH
PTPADLFRTM EDASAVDLDW FWRGWFFSTD PVDISLDDVR YYRMNSMNAA IENVELKKAY
DKDLYNISRE RNRKEGIKFA IEQDTSLQDF YNKFNRFEVS KSADQEFQQY FSNLSAAEKK
LYESKKNFYE LDFSNIGGLV MPIIIEWTFK DGSKEVDRIP AYIWRKNENK VTKVFAKDKE
VIAVQLDPYR ETADIDESNN SWPRKNQPTR FELFKQQQAP RGSSTEQTNP MQQSRQRQ