Gene Phep_3863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3863 
Symbol 
ID8254997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4634604 
End bp4636574 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content46% 
IMG OID644937527 
ProductChondroitin AC lyase 
Protein accessionYP_003094116 
Protein GI255533744 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.569924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA AGACCTGCTT TTTTTTCATC AGCTTACTAT TCATTTGTCA TCAGGTGAAG 
GCACAGGCCG ATACCATTTT GAGCCGCTAT AAGCAATACT TGCTTGGCAC TGTGAAACCG
GAAGGTGATA TAAACCTTAT GCTGGGTTCA TTGAATGCTG AAGGGCAATG GCCGGATATC
AATTATACAG ATACAGAAAA AGCGAACTGG AAAAATCTGA TCCATTTAAA AAGAGTACGC
GACCTGGCCC TTGTTTGGGA AAAGCCGGGT TCAGCCCTGT ATCATAACAT ACAATTAGAG
AAGGCCATTA ACTTAGGGTT AAACCATTGG CTGGATAAGC GCTATAGGAA CTCCAACTGG
TGGCACAACG AAATTGGTGT TCCTCAATAC ATGCGCGATA TCATTATTCT TATGAAAAAA
GAATTGAGGC CTGAACAGCT CAAAGCAGCA TTGGAAGTGA TGGCCCAGCA CCGGGTGCAG
GAAAACTGGG TAGGCGCAAA CCTTACCTGG AGTGCCGATC TGGGCTTTCA TTACGGCGCC
TTAACCGGTA ATGTCCGGAT GATGGAGCTT TGCCGGAACC TGATTGTCAA AGAGATCAGG
ATTTCTACAG AGGAGGGTGT ACAGCCTGAT TTCAGTTTCC ATCAGCATGG CGCCCGTTTA
CAGATGTACC AGTATGGAGC CGCTTTTTTA AAGGAGAACA TCAGGCTGGC CTGGGAGCTT
AGGGGCACGG CAATGGCTTT CCCTAAAGAG AAAATCGGTA TACTTACTGA TTTTGCTTTG
AAAGGCTGGC AGTGGATGGC CAGGGGAATA CATACTGTGC CAGGTACTAT GGACAGGTCT
GCAAGCAGGG TAAATGCGCT GGACAATGCC GATCTTCGTG AGTTTATTCC CTACTTCATT
GCGCTTAGTC CCGAGAATAA GAACGCTTTC TGTCAGCTGG ATGAAATACA GCAGGGGAAA
GGCGCGCTTA CAGGCTACCG TTATTACCCT TATTCCGATT TTGCTGCATT TCATCAAAAA
GACTTCAGCT TTTTCCTCAA AACGATTTCC TCACGTACCC TGGCAACAGA ATCCATCAAC
AGCGAAAACT TAAAAGGAAA CCTGCTTAAC AGCGGTGACG CCTATCTGAT CAGGGATGGG
AAGGAGTATT TTAACCTGAT GCCGGTATGG AACTGGGCCT GTCTGCCTGG TGTTACCACT
TTTGTTGGGG CAGATAAGGT GAACAGGCAA GCCTTTACAG GGAGTGTAAG CAATGGGCAA
GCCGGACTCA CAGTAATGGA TTATCAATTA GAGAATAAAG ATAAAAGTAA ATTACTTCGT
GCAAAGAAGT TTTGGGCTGT GGCTGGTTCG AAGGTGCTCT GTTTAATTGC GGGTCTTGAA
GGTTCGGGAA TAACAGCAGC ATACACTACA TTGGACCAAT GCAGGTGGCG TGGCGGCGAA
AGTGTCAATG ATTCCTGGAT CTACCATGCG GGATTTGCTT ACATCCCCCT GGGAGCTGCA
AAGATAAGCC TGCATGTAAC AGATGCCACC GGCTCGTGGA AGGAAATCAA TGCCGCAGAA
AGTAATACCC CGCTCACAGA AAAAATATTT ATGCCGGTGT TGGAACACAA AACGCTTGAA
AATGGCAATT CCGGATATGT CATTTCGGCC TGTAAAAACG CTAAAGAAGC AGCAAGGCTA
GTGGCGGGTC CGCAATGGAA AGTGCTCTGC AATAATAAGG AGATACAGGC TGTTTCATTT
AATGACGGTA CAATAATGGC TGCTTTTTAT CAGCCTGGAG TACTTAAAAC CGGACAAGGT
CACCAGCAGC TTAGCGTAGA TCAGCCCTGT CTGGTTTTAA TCAGTAACAA AAAGATCTGG
TTAAGCAATC CTGCATGTAA GCCGCTTAAG GTAAAGCTGG GTATAAATGA TCAATATAAA
GTGGTTGAAC TACCTGCAGA TGGCAGTTCC ACACCTGTCC GCTTCCATTA A
 
Protein sequence
MKKKTCFFFI SLLFICHQVK AQADTILSRY KQYLLGTVKP EGDINLMLGS LNAEGQWPDI 
NYTDTEKANW KNLIHLKRVR DLALVWEKPG SALYHNIQLE KAINLGLNHW LDKRYRNSNW
WHNEIGVPQY MRDIIILMKK ELRPEQLKAA LEVMAQHRVQ ENWVGANLTW SADLGFHYGA
LTGNVRMMEL CRNLIVKEIR ISTEEGVQPD FSFHQHGARL QMYQYGAAFL KENIRLAWEL
RGTAMAFPKE KIGILTDFAL KGWQWMARGI HTVPGTMDRS ASRVNALDNA DLREFIPYFI
ALSPENKNAF CQLDEIQQGK GALTGYRYYP YSDFAAFHQK DFSFFLKTIS SRTLATESIN
SENLKGNLLN SGDAYLIRDG KEYFNLMPVW NWACLPGVTT FVGADKVNRQ AFTGSVSNGQ
AGLTVMDYQL ENKDKSKLLR AKKFWAVAGS KVLCLIAGLE GSGITAAYTT LDQCRWRGGE
SVNDSWIYHA GFAYIPLGAA KISLHVTDAT GSWKEINAAE SNTPLTEKIF MPVLEHKTLE
NGNSGYVISA CKNAKEAARL VAGPQWKVLC NNKEIQAVSF NDGTIMAAFY QPGVLKTGQG
HQQLSVDQPC LVLISNKKIW LSNPACKPLK VKLGINDQYK VVELPADGSS TPVRFH