Gene Phep_0784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0784 
Symbol 
ID8251873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp923574 
End bp926666 
Gene Length3093 bp 
Protein Length1030 aa 
Translation table11 
GC content44% 
IMG OID644934434 
ProductLyase catalytic 
Protein accessionYP_003091068 
Protein GI255530696 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACAA TTTTCAGAGA AAGACCAGAT ATGAACCGCC TGTTGTTTGT TTTTTTATTG 
ATTTGCAGCA TCACTGTAAA TGGTTATGCG CAAATGCAGA CCGATATTTG CGAAAATATA
CTGCCCAAAA ACTGGCAGGT TGTGAATGGG AAATTATCGC TTAGCGACCA GCATTTTAAG
CTGGGCAAAC AAGCTGTAAA ATGGTCCTGG ACAGCAGGAC AAAAAAGCCG GTTGCACATT
AGTGATAAAG CTTTTGAAGC GGTTGCCGGC AACCCGAGAA GTACTTTTGT AGTTTGGATA
TACAATGAAA CACCAATAAA TGACAAGTTG TTGTTTCAGT TTGGCAATGG AGTTAAAACC
GCCTCTGGTT TTGAATTTAA GCTGAACTTT AAAGGTTGGC GTACGGCATG GGTAATGTAT
CACAGGGATA TGCAGGGTAA GCCTGTTGAA GGAATGAATG CCATGCAGGT CATAGCGCCT
GCTTCCGTAA AAAACGGGAC CGTTTTTTTA GATCAGCTGA TGTATGATGT AACCATCAAC
CCCCGCTCCC CCATGCGTGA TGAACAATTG CCGTTTATTA ATCCCGATGC CGATAAAGCG
GCCAATGCGC ATTGGAACGC ACTGTATAAT TTTAGCCATA GTCCTCATTA TTTAACACTT
GCAGAAGGAG TAACCAGGCA GGAACTGCTC GACCTGGAGC TGATCAGCAG TCGGTACATA
GACATGATCA TGCCTGTAAA AGGTACTTTG GTAAAAAACA TGCTTGAAGA GATAGAAAAA
GGTTTTGCCT ACTGGAACAT TCGCCGCGAA GGCAGCAGAA TAAGTGGCAG GCCGGTCTAT
TCCATGAACG ATACGGAACT GGTATCTTTA TCACCTGCAG AAAATGTAAA GGAAGCCAAC
CGCTTATCTG GCATCAAGCG GTATACACAA TTGATGTTGC AGGTAGCACA GGCTTTTCAT
GCCGGTACTG ATGCGAAAGA AAAGCTGCGG CTGGAAAATA TTTTTATGGA TATGCTGGAC
CATCTTGAAG ATCAGGGATG GGCCTATGGC AGTGGGATGG GGGCTTTACA TCACCATGGT
TACAATCTGG AAGGCTATTA TCCGTCGTGT TTGCTAATGA AAGACGTAAT TAAAAGACAA
GGTAAACTGG AACGTACTTT TCAGAGCATG AGTTGGTTTA GTGGTTTGGG CCGTACTTTA
CAACGGCCAT TGCCATCGTC CAATATAGAT GTATTTAATA CACTGCTGGG CAGTATGCTT
TCCAGCATTT TAATTATGGA CAACTCGCCC GAAAAACTGC GTTATCTGCA TAGTTTTTCC
AATTGGTTGT CGGAAAATGT TAAGCCAGAT CATACCATAG AAGGGGCCTT TAAACCTGAT
GGGGCTGTGT TTCATCATGG AAACCTGTAC CCAGCCTACG GAATTGGTGG TTATACAGGC
ATTTCGCCAG TCGTTTTTGC ACTAAGCGGT ACAAGCTTCC GGATGGACCA AGAGGCCCAT
GAAAGTTTAA GGAATAGTTT GCTGATGATG CATTATTATA CCCATCCATT TAAATGGCCG
GTTAGTGTGG CAGGGCGGCA TCCTACGGGC AGCTGGCGCA TTGCCGATCT GCCTTATGCC
TATATGGCCA TGGCAGGTAC ACCCGACGGA AAAAACAAAA CCGATAGCCT GATGGCGGCA
ATTTATTTAA AGCTGAATGA AGGCAAAAAA AACAGGTGGA TTGCTGCTTT TAAAGCAGCT
AAGATTAAAC CGGCTGGCTA CAGCAGCGGG CACTGGAACC TGAATTATGG CCTTTTCGAC
ATTCACAGGA GGAAGGACTG GTTGCTGACC ATCAGGGGGC ATAACCGCTA TCTGATCAGC
AATGAATCCT ACCCTGGGGC AAATGTATTT GGAAGGTATG TAGCCTATGG ACAGCTGGAG
GTGCTCTTTC CGGAAACCAA AACGGATGAT GGCAGCAATT TTCGGGATGA AGGCTGGGAC
TGGAACAACA TCCCCGGTAC TACTACCCTG CATGTGCCCA TCGCAAAGCT GAGGGCAAAC
ATCATCAATG CGGATGATTT TAGTGGCATT GAAGAAATGC TGATCACTGA TGAGCGCTTT
GCCGGAGGAA CCACATTTAA AAAACAAGGT ATGTTTGCCA TGAAACTGCA TGGACATGAT
AAATATGATA TGGGAAGTTT CAGGGCTACA AAATCCTGGT TTATGTTTGA TAGCCTGGTG
GTATGCCTGG GCTCGGATAT TCGCAATACT ATCCCTGATT ATCCTACACA AACTACCCTT
TTTCAAAACT ATTTGAAAAA AAACAGTGAT ACGGTCGTGG TTAATGAGCG GGTAGTAACT
GCATTTCCCT ATAAAGAAGA AGGACAAAAA GGGAAAGCCT TAAGTGTGAT AGATAACCGG
GGTATTGGTT ATTATCTTCC AGATGCTACA ACGGTTTTAC TTACAAAAGC AAAACAATTG
TCGAGAGACC AGAAAGATAC ACGTGAAACT ACTGGTAATT TTGCTAAATT AATACTTGAA
CATGGAAATG CACCTGTAAA TGCAGGCTAC GAATACGCCA TGCTGGTTAA AACAGATAAG
CAGGAAATGG AAAAAATGGT CTCGTTGATG CAAAGCAAAC AACCTTTGTA TAAAGTGCTG
CGGAAAGACA GTATTGCCCA TACCGTATGG TATGCGCCGG AACAACTTAC CGCAATGGCT
GTTTTTAACA GCAACAAACA GTTGAATGAT TCCCTGCTGA TCGGCAACAA CAGGCCCTGT
CTGCTCATGT ACCATAAGGA GGGCAGTAGT TTGTCTTTGT CGGTCACCGA TCCTGACCTT
GCTTTTTATG AAGGGCCAGA TGATAGTCCG ATTAGTCCAT CCGGCAAAAG AGAAGAAGTA
AGTATTTATT CGCGCAGCTG GTACAGATCA CCCTCAAAGC CTTCGGTAGT AAAGCTGCTG
ATCAAGGGCA GATGGACTGC AGATCCAGCT AACAAAGCAT TAAAAGCAAT ACCTCAGGCT
GGCGGAAACA CCCTTGTCAG TATAAATTGT AAAGATGGCC TGGTGTCATC TGTCCAACTA
ATTAAAAGTA ATAATAAGGA GAATGTAAAA TGA
 
Protein sequence
MNTIFRERPD MNRLLFVFLL ICSITVNGYA QMQTDICENI LPKNWQVVNG KLSLSDQHFK 
LGKQAVKWSW TAGQKSRLHI SDKAFEAVAG NPRSTFVVWI YNETPINDKL LFQFGNGVKT
ASGFEFKLNF KGWRTAWVMY HRDMQGKPVE GMNAMQVIAP ASVKNGTVFL DQLMYDVTIN
PRSPMRDEQL PFINPDADKA ANAHWNALYN FSHSPHYLTL AEGVTRQELL DLELISSRYI
DMIMPVKGTL VKNMLEEIEK GFAYWNIRRE GSRISGRPVY SMNDTELVSL SPAENVKEAN
RLSGIKRYTQ LMLQVAQAFH AGTDAKEKLR LENIFMDMLD HLEDQGWAYG SGMGALHHHG
YNLEGYYPSC LLMKDVIKRQ GKLERTFQSM SWFSGLGRTL QRPLPSSNID VFNTLLGSML
SSILIMDNSP EKLRYLHSFS NWLSENVKPD HTIEGAFKPD GAVFHHGNLY PAYGIGGYTG
ISPVVFALSG TSFRMDQEAH ESLRNSLLMM HYYTHPFKWP VSVAGRHPTG SWRIADLPYA
YMAMAGTPDG KNKTDSLMAA IYLKLNEGKK NRWIAAFKAA KIKPAGYSSG HWNLNYGLFD
IHRRKDWLLT IRGHNRYLIS NESYPGANVF GRYVAYGQLE VLFPETKTDD GSNFRDEGWD
WNNIPGTTTL HVPIAKLRAN IINADDFSGI EEMLITDERF AGGTTFKKQG MFAMKLHGHD
KYDMGSFRAT KSWFMFDSLV VCLGSDIRNT IPDYPTQTTL FQNYLKKNSD TVVVNERVVT
AFPYKEEGQK GKALSVIDNR GIGYYLPDAT TVLLTKAKQL SRDQKDTRET TGNFAKLILE
HGNAPVNAGY EYAMLVKTDK QEMEKMVSLM QSKQPLYKVL RKDSIAHTVW YAPEQLTAMA
VFNSNKQLND SLLIGNNRPC LLMYHKEGSS LSLSVTDPDL AFYEGPDDSP ISPSGKREEV
SIYSRSWYRS PSKPSVVKLL IKGRWTADPA NKALKAIPQA GGNTLVSINC KDGLVSSVQL
IKSNNKENVK