Gene Phep_3740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3740 
Symbol 
ID8254872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4463674 
End bp4466838 
Gene Length3165 bp 
Protein Length1054 aa 
Translation table11 
GC content48% 
IMG OID644937402 
Producthypothetical protein 
Protein accessionYP_003093993 
Protein GI255533621 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTCAA ATTTTTGGAC AAGGCAGATT TTGTTGGTTT TGTTACTGCT GGCAGGAAGT 
CCTGCCCTTG CCCGCAGCTG GATGCCCGGC TATAATTTCA GAAAGAAAAT AACCATTGAT
CAATCGAAAG TTTCCGGTAC GGCCAGTCTT TTAAACTTTC CTGTGCTGAT TGTCCTTGAG
GATGCTGAAC TCAGGTACAT CGGCAATTGC GAAGGAAGGT TGCAGAACAG CAGGGGGCTG
GATATCTCCT TTGCGGCAAC CAATGCCCCT CAGCTGCCGC TTGCTTTTCA GCTTGACCAT
TATGATGCGG TTAATGGGAA ACTGGTGTGC TGGGTAAACA TTCAGGAGCT ATTTACCGGC
AGCAATCCTG GCCATAACGA AATCTATCTT TATTATGGCT CAACCTATAT CCACGACCCG
TTTACGCTTT CTGCAAGGGC CACCTGGCCG GCCAGCTATC AGCAGGTCTG GCACCTGAAC
CTGGATGCTG CACCTTCCAT AAGCCGCAGT GCAAATCATG GTCCGGAGAT GAGTATGACA
GGGAGTCCGG GAACAGGGCC GGCTAACTTC CCTGCCGCCC ATATTGGCAA TGGACTGTTG
CTGAATGGCA GTACGGATGG GATGGCTGCT GCAGCAGATA CCAATACGAC GGTTTGCATT
ACCGCCTGGA TTAAAATGGA TCACAGGGGC ACTGAGCAGG TGATTTTAGC CAGCGACACC
ACTGCTGGCG GATATGTCAT TAAGGTGAAT GCCCAGGGTA ACCTTGTTTT TGAAACTAAA
AATGCTGAAG GTTTCAGAAG TGCCACAACC GCCGAGGTAT TGGCAGTAAA TACCTGGTAC
AATCTTGCCT GTATTTTTAT TAAAGGGATA AGAAGAATAT ACATCAATGG GGCATATAAA
GCTGGCGGCG GATCAAATGG GGTAAAGCTT GGGCGGTCGG GGGCATTGAG TATTGGAAAA
AGTAAACAGA ATGGCAGCTA TTTTAAGGGC ACAATTGATG AACTGAGGAT TCAAAATGTA
GAGCGTAATG TCGACTGGAT CAGCACTGAA TACAGGAACC AGGCTAATCC GGCCAGTTTT
ATCAGCGTTG CAGCTGAGGA AATTAATCCG GCAAATGTTC CGGTGGTCAA CGAGTTTACC
GGAGCTGCCG GAACGGAAGA CTGGTCGGAC GAGGGGAACT GGAGTTTGGG GGGCTTACCT
GCACAAAATG CTCAGGTGAT CATTAAAGGA GGTAAAAAAA CGAAGCTTTT AGCTCCTGCG
GCAACCATCA TTAATCAGCT CACTTTAGAA CCCGGGGCAA GACTTCATCT GCAGAGCGGC
CTTGAAGTGA ATTGTGCGGC CAGTATTGCA GCAAATGCTG CTGTAGTACT GGAGGATGGA
GCAAGGCTCA CTTTTAAATA CGATGTGCTG AACCATGGCC GCATTGGTTT AAATAAGGGT
CATGGCAGTC TTGTTTTCAG GGGAGGGCAT GCATTGCAAA CGCTTTCAGG TACAGGCCTG
GTTACTGTTT CGCGGCTCGA AGTAGAGCTG GCCTCGGCTA CCCATACGCT GTTGCTGCAA
TCCCCGCTAA ATGTGGGCAG ACAGGTGCAG CTGATCAGGG GAACCCTCAA TTCCAACGGA
AACCTCAGCC TGCTTGCCGA TACCTTAAAT TATGGTGCCG CATTGATGCC GGTAACAGAT
CCCGGTAATA CACAGCTCAC AGGAGATGTA CACGTACAGC ATTTTGTGAA AGGTAACTTT
GCTGCCCCTT CAACGGCAAG GGGCTGGTGC TTGCTGGCTG CTCCCGTATA CCGGTCTGAA
TTAAACGGGC AGCCGCAAAA TAATTTTGCG GCCATTCAGG CCAGTGTCTT TGTAACCGGT
ACTGGCGGCA CTTTAAATGG CTTTGATGCC TCGCCCAATA ATGGAGGAAC GATTTATACC
CACGATCAGT CCTTACCCGG CAGCCTTTCA CAAAAATACA AGGCCATTCC CAGTATGGAT
GTCAGCATCC CATCCGGAAA GGGTTTTTAT CTGTTTTCAA GAGGCAGCAG GAATATTCCC
GATGCCTACC TGCACCAAAT CCAGACACCT CCGTTTTCTA ATCCAGGGCC TTACACCATT
ACCTATACCG GTAAATTGTA TACAGGCGAT TTAACGGTAA ACCTTTTTAA CAGGAACAGC
GGGGGAGAGG GCGATGGTTT TAACCTGCTC GGAAACCCGT ATGCTTCGGC AATCCGCTGG
GCCGCATTGC AAAAACAAAA TGTTAGTCCC TTCATCTGGG TATTCAATAC ACAAAACAAT
GCCTATCAGG TTACAGACGA CCCGGATTAT ATCATCCCTT CGGGAACTGG CTTTTTTGTG
CGGGTAAACA GCGGAAATGC CAGTGGTTCG CTCAGTTTTC AGGAAAGCGC AAAATATACC
GGTACTACTG TTCCGGCAGC GCAGATGGCC CTTAGGGAGT CCCGAAGGGC AAAGGAAACA
ACAAGCAGGT TAAAAATACA GCTGTATGCA GCCGGACTGA CCGACAGTTA TACATTGATT
TTTAGCAGCA AAGGTAATGA TGGGATAAAT GATGCAGATG CCGGAAAAAT TGGAGAAGGT
TACCTGAGCA TTGCAGGGAT AGCCGGGAAC GGAACAAAAT TGTCTATAGA TGAGCGCGCA
ATAGACACCC TCCGAAAAGA GGTCTGTTTG TACCTAAAGG GCTGGGCAAG CGGAAATTAT
ACTTTAAATT TAAAAGCATC GCTTAAACCC AATGAGGAAA TTGTACTGGC CGACCGTTAT
CTTGGTATCA ATAAACGTTT AACAGAACCG GAAAGCAATT GCCATTTTTT TATAGACACC
GCCATTCCGG CATCTTACGG TCAGCAACGC TTTGCTATTC TGTACAGGGA GCTCCCGGAA
GTAAAGCAAC AAGATACCGA AACCGATAAA AACATTGTGG TATATCCCAA TCCCTTTAAG
GAGTGGCTTT ACCTGAAGTC GGCCAGGCTG ACCTATAAAA ATCTGAAAGT TTTAATCAGG
GATATCACGG GCAGGGTAGT CTGGAGTAGC GTGCTTCCCA TTCTGGATGC GGGTATTCCG
GTTCAGCAGT ACTGCGGGCA GCTGGTAAAA GGGGTCTATT TTTTACAACT GCTTAATCCG
AAAAACAATA AAGTGGAGGC TGTTTTTAAA GTATTGCGAA ACTAA
 
Protein sequence
MYSNFWTRQI LLVLLLLAGS PALARSWMPG YNFRKKITID QSKVSGTASL LNFPVLIVLE 
DAELRYIGNC EGRLQNSRGL DISFAATNAP QLPLAFQLDH YDAVNGKLVC WVNIQELFTG
SNPGHNEIYL YYGSTYIHDP FTLSARATWP ASYQQVWHLN LDAAPSISRS ANHGPEMSMT
GSPGTGPANF PAAHIGNGLL LNGSTDGMAA AADTNTTVCI TAWIKMDHRG TEQVILASDT
TAGGYVIKVN AQGNLVFETK NAEGFRSATT AEVLAVNTWY NLACIFIKGI RRIYINGAYK
AGGGSNGVKL GRSGALSIGK SKQNGSYFKG TIDELRIQNV ERNVDWISTE YRNQANPASF
ISVAAEEINP ANVPVVNEFT GAAGTEDWSD EGNWSLGGLP AQNAQVIIKG GKKTKLLAPA
ATIINQLTLE PGARLHLQSG LEVNCAASIA ANAAVVLEDG ARLTFKYDVL NHGRIGLNKG
HGSLVFRGGH ALQTLSGTGL VTVSRLEVEL ASATHTLLLQ SPLNVGRQVQ LIRGTLNSNG
NLSLLADTLN YGAALMPVTD PGNTQLTGDV HVQHFVKGNF AAPSTARGWC LLAAPVYRSE
LNGQPQNNFA AIQASVFVTG TGGTLNGFDA SPNNGGTIYT HDQSLPGSLS QKYKAIPSMD
VSIPSGKGFY LFSRGSRNIP DAYLHQIQTP PFSNPGPYTI TYTGKLYTGD LTVNLFNRNS
GGEGDGFNLL GNPYASAIRW AALQKQNVSP FIWVFNTQNN AYQVTDDPDY IIPSGTGFFV
RVNSGNASGS LSFQESAKYT GTTVPAAQMA LRESRRAKET TSRLKIQLYA AGLTDSYTLI
FSSKGNDGIN DADAGKIGEG YLSIAGIAGN GTKLSIDERA IDTLRKEVCL YLKGWASGNY
TLNLKASLKP NEEIVLADRY LGINKRLTEP ESNCHFFIDT AIPASYGQQR FAILYRELPE
VKQQDTETDK NIVVYPNPFK EWLYLKSARL TYKNLKVLIR DITGRVVWSS VLPILDAGIP
VQQYCGQLVK GVYFLQLLNP KNNKVEAVFK VLRN