Gene Phep_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0789 
Symbol 
ID8251878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp935300 
End bp936820 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content44% 
IMG OID644934439 
Producthypothetical protein 
Protein accessionYP_003091073 
Protein GI255530701 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATGC TGAATAAACT AGCCGGATAC TTATTGCCGA TCATGGTGCT GCTGAATGTG 
GCACCATGCT TAGGTCAGGT TGTTGCTTCA AATGAAACTT TATACCAGGT TGTAAAGGAG
GTAAAACCCG GTGGTCTGGT ACAGATTGCC GATGGGACTT ATAAAGATGT TCAGCTGATT
GTCAGCAATT CAGGAAAATC TGGTTTGCCC ATCACTATTA AAGCCCTGAA CCCGGGTAAG
GTTTTTTTTA CCGGAGATGC TAAAGTAGAG CTGAGGGGCG AGCACCTGAT ACTGGAAGGC
ATCTGGTTTA AAGACGGGAA CAGAGCTATT CAGGCATGGA AATCACATGG ACCCGGATTG
GTGGCTATAT ATGGTAGCTA TAACCGCATT ACCGCATGTG TATTTGATTG TTTTGATGAA
GCCAATTCTG CTTACATTAC TACTTCGCTT ACCGAAGACG GAAAGGTACC TCAACATTGC
CGCATAGACC ATTGCAGTTT TACCGATAAG ATCACTTTTG ACCAGGTAAT TAACCTGAAC
AATACAGCCA GAGCTATTAA AGACGGTTCG GTGGGAGGAC CGGCGATGTA CCATCGTGTT
GATCACTGTT TTTTTTCCAA TCCGCAAAAA CCGGGTAATG CCGGAGGGGG AATCAGGATT
GGCTATTACC GTAATGATAT AGGCCGTTGT CTGGTAGACT CTAACCTGTT TATGCGTCAG
GATTCGGAAG CAGAGATCAT CACCAGCAAA TCGCAGGAAA ATGTTTATTA TGGTAATACT
TACCTGAATT GCCAGGGCAC CATGAACTTT CGTCACGGTG ATCATCAGGT GGCCATTAAC
AATTTTTATA TAGGCAATGA CCAGCGATTT GGATACGGGG GAATGTTTGT TTGGGGAAGC
AGGCATGTCA TAGCCTGTAA TTATTTTGAG CTGTCCGAAA CCATAAAGTC GAGGGGGAAC
GCCGCATTGT ATTTAAACCC CGGTGCTATG GCTTCGGAGC ATGCTCTTGC TTTCGATATG
TTGATAGCCA ACAACGCTTT CATCAATGTA AATGGGTATG CCATCCATTT TAATCCATTG
GATGAGCGCA GAAAAGAATA TTGTGCAGCC AATAGGCTTA AGTTCGAAAC CCCGCACCAG
CTAATGTTAA AAGGCAATCT TTTCTTTAAG GATAAACCTT ATGTTTACCC ATTTTTTAAA
GATGATTATT TTATAGCAGG GAAAAATAGC TGGACTGGTA ATGTAGCCTT AGGTGTGGAA
AAGGGAATCC CTGTTAACAT TTCGGCCAAT AGGTCTGCCT ATAAGCCGGT AAAAATTAAA
GATATCCAGC CCATAGAAGG AATCGCTCTT GATCTCAATG CGCTGATCAG CAAAGGCATT
ACAGGAAAGC CCCTTAGCTG GGATGAAGTA AGGCCCTACT GGTTAAAAGA AATGCCCGGG
ACGTATGCTT TAACGGCCAG GCTTTCTGCA GATAGGGCTG CAAAGTTTAA AGCCGTAATT
AAAAGAAATA AAGAGCACTG A
 
Protein sequence
MKMLNKLAGY LLPIMVLLNV APCLGQVVAS NETLYQVVKE VKPGGLVQIA DGTYKDVQLI 
VSNSGKSGLP ITIKALNPGK VFFTGDAKVE LRGEHLILEG IWFKDGNRAI QAWKSHGPGL
VAIYGSYNRI TACVFDCFDE ANSAYITTSL TEDGKVPQHC RIDHCSFTDK ITFDQVINLN
NTARAIKDGS VGGPAMYHRV DHCFFSNPQK PGNAGGGIRI GYYRNDIGRC LVDSNLFMRQ
DSEAEIITSK SQENVYYGNT YLNCQGTMNF RHGDHQVAIN NFYIGNDQRF GYGGMFVWGS
RHVIACNYFE LSETIKSRGN AALYLNPGAM ASEHALAFDM LIANNAFINV NGYAIHFNPL
DERRKEYCAA NRLKFETPHQ LMLKGNLFFK DKPYVYPFFK DDYFIAGKNS WTGNVALGVE
KGIPVNISAN RSAYKPVKIK DIQPIEGIAL DLNALISKGI TGKPLSWDEV RPYWLKEMPG
TYALTARLSA DRAAKFKAVI KRNKEH