Gene Phep_3959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3959 
Symbol 
ID8255093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4761010 
End bp4764210 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content42% 
IMG OID644937623 
Producthypothetical protein 
Protein accessionYP_003094212 
Protein GI255533840 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.383952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTAA CATTTACCAA GCAACTAAAA ATGCCTTTGC TGGCATTTAT GTTTTTATGG 
ATGGGCGTTT TACATGTTCA AGCGCAATTG ACACATCCGG GGATACTGTT CAACGCTGCT
GGCCTGGCCA GGTTAAAGAC CTATGCAAAT ACGGAACGGC AGCCATGGGC AGCGACATAT
GCTAAATTAC TTGCCTATAA CGATATAAAC TATGTGCAGG AACCTGCTTA TGCCATTGTT
AATAGAGAGT TCGCAGGGGC AACCAGTACT GAGTCAAGGG CAATGTCTGC CCAAAGTCAG
AGAGCTTACA GGTGTGCAAT TTTGTGGGTC ATTACTGGAA ATCAAATATA TGCAGATAAA
GCTAAAACAA TATTGAACTC GTGGTCAGGT ACCCTGGATA GTATTACCGG AGGCGCTGCC
AAGTTATGTG CGGCCTGGTA TGGTTTTGGT TTTGTTAATG CTGCGGAAAT TCTGCGTTAT
ACAAACTCAG GCTGGAGTAC GACAGATATA CAGCGCGCGG AATCTATGTT CAGGTATAAA
TTTTATCCGG TAATTGAACC TTTCCAGGGG GGATGGGCGG GGAATTGGGA TACCGCCATT
TGTAAAACTA TGATGGGAAT AGGAGTTTTT ATAAATGACG TTGCCATTTA CACCAGAGGG
CGCAATTATT TGTGGTCAAC CACCGAAACC GCTTCAGGAA CACTGAACAA TTACATTTAC
CCTACTACCG GCCAGTGTTT TGAAAGCGGT CGCGATCAGG AGCATACCCA AATGGGCATA
GGGGGCTTAG CGGAAGCATG CGAAATAGGG TATAACCAAG GTACGGATCT TTATGGTTTG
TTTTCAAACA GGTTACTGTT AGGAACCGAA TATACCGCTA AATATAATTT AGGGTATAGC
GTACCCTATA CAACAAATCA TTATGGATCT GTGATTTCTC CTGATTTAAG GGGAGAGTTT
CTTCCGTTTT ATGAATTGGT TTACAATCAT TATGTAAACA GAAAAGGGAT GTCCGGAGAG
CCGGTTAAAT TTACCAAAAT GGTTGTGGAA AAAATAAGGC AAGACAATGG AGGAGAAAAT
GGTACTGCGA TATTATCAGG ATACGGATCA TTGTTGTTTA ATGAATATGT ATTTAAATAT
GTTCCTGCTG CGGGTGACTA TCGGACAACA GGATTGAGTA GCAGTATGGG TACCCCATCA
CAGTTTGAAG TTTTTACGAA CGGAGATTGG GTAACAGCAA CAACTGCCCT TGGATTAACA
ACTAATTTGT TGGTAAGAAA TGGGCAAAGT GCATTGGCAG CAGGTACTAG AAATTTAAAG
AACCTGATAG TTGGAGAGGG GGACGGAGCT GTTTTAAGGG CACAGGTCAA TGCAGGTACA
GTATCGGCAT TAAACATTGT AGAGCCAGGT AATTTAAGCA GTATTCCTGC GCTATCTTTC
GTTAACGGTG GACAAGCTAC AGGCTCTACC AGTGCCGTTG CAACGATTAC AAGCGTAAAA
GTAACAGGTG CAGATATAAA AAACCGGGGT TCTGGCTATA CTAATGCCGG TGTTACTTTT
AGCGGTGGTG GCGGAAGCGG AGCCACGGCT ACTGCAATAG TGAGTAATGG TAAAATTATG
GATATTGTGA TCACCAATTC AGGATCTGGC TATACATCCA TACCAACTGT GTCTGTTACA
GGAAATGGTA CAGGAGCAGC TGTTTCGGCA AAAGTTGGTA TTACAGAGAT AAGCATTAGT
AGCGGAGGGA CCGGTTATAC CAAAGCGCCA GCAGTGATAG CAGGAACCTT TCTGAGAGTG
AATGACGGAA TTGCTCTAGG TGTATCAAAT GATGTATCTT TTCAAAGAGG ATCATCTGTG
TATAGCGGGG GAGCCTCTTC GGCCGTTACA GGTACATTGA ATATCAGCGG CAACCTGATA
GCAGAAGATA CTGTTAATTT TATCTCTGTA GCTCACGATA ATACCATTTC ATCACTTACA
GTAAATTTTA AAAAATCTAC CGTAGATAAT ATCGGTTTCA TAGGCGGAAA TGTTACTTTT
AGCGCACTGG GTGTTGAAGC TGCGAATACC CTTAAATTGA AGTTAGGTGC CGTAATGAAT
GTTGCAGATG CTATTGGTCT AAACAGTACT GGTATAATTG ATGCAACTAA TGGCACTATA
GGTTTTGTAA ATATCTCTCC TTCTATATCG ATAGCAAGAA CCATCGCAGC CAATACTTTT
AAAGATGCAA CTGTCAATAA AATGGTAGTC AATTCAGCCG CCGGTGTTAC GCTGAACCAG
GGGCTGACGA TAACAAAACT GGATATGCAG AAAGGATTGC TGAATATTCC GGAAACTTCA
GAGATTACTG TTTCAAGTGT TTCAGGCGGA AGCAACACTT CCTATATCAA CACCATGTCG
TCAGCAAGTT CCGGTGCAAT TGCGAAAGTT AAAGTAACCG GATTAACAAC TGCTCAGGGA
GATATTCCTT TAGGAAATGG CGGAAATTAT CTGCCTGTAC GGATTACCCC ACCTGCAGAA
ACTGCATTTA ACTTTACAAT GAGTGTACTT ACAGGCCTTA CAGCTAATGG TTTGCCAGAT
GGCGGTGTAG TTGCAGACAA GAGCCAGTTT GTAAATGCGT CATACCATGT TATCCGTACA
AGCGGAACAG GAGATTATAC GTTTCGTGTT GGCTTTCCTG CAAGTCTGAA AGGCAGTGCT
TTTACCCCAT CATCTCCTTT CGGTATTTCA AAATATAATG GAAACAGTTG GTTACCTGTT
ATAGGCAGTG GTAATTATGC ACTAAATACC GCTACCGCTA CTTTTAATAC CAATGGCTTA
CGCCACCGGA TACAGATTGG TGGGGCTCAG CCATTGAATT TAACAGGTAC CAATGCCAGC
GGCTATACAG AACGCAAGGC GATTAACTTA ATCGAGCCGG AAGATTTACT GAAGATCAAG
GCAACCAATA TCCTATCTCC AAACGGTGAT GGGGTGAATG ACAAATGGGT GGTGGATAAT
ATTGATTTTT ATCCAAATAA CGAGGTGAAG ATCTTTGAGC GTACGGGCAG ATTAATGTAT
AGTAAAAAAG CTTATGACAA TAGTTGGGAA GGTACCTTAA ATGGTGTGCC GCTGGCTGAA
GGAACTTATT ACTATATCAT AAATTTTGGA ACAAGCAGGC CAAGTTTAAG CGGTTTCATT
ACCATTACCA GACCAGAGTA A
 
Protein sequence
MNLTFTKQLK MPLLAFMFLW MGVLHVQAQL THPGILFNAA GLARLKTYAN TERQPWAATY 
AKLLAYNDIN YVQEPAYAIV NREFAGATST ESRAMSAQSQ RAYRCAILWV ITGNQIYADK
AKTILNSWSG TLDSITGGAA KLCAAWYGFG FVNAAEILRY TNSGWSTTDI QRAESMFRYK
FYPVIEPFQG GWAGNWDTAI CKTMMGIGVF INDVAIYTRG RNYLWSTTET ASGTLNNYIY
PTTGQCFESG RDQEHTQMGI GGLAEACEIG YNQGTDLYGL FSNRLLLGTE YTAKYNLGYS
VPYTTNHYGS VISPDLRGEF LPFYELVYNH YVNRKGMSGE PVKFTKMVVE KIRQDNGGEN
GTAILSGYGS LLFNEYVFKY VPAAGDYRTT GLSSSMGTPS QFEVFTNGDW VTATTALGLT
TNLLVRNGQS ALAAGTRNLK NLIVGEGDGA VLRAQVNAGT VSALNIVEPG NLSSIPALSF
VNGGQATGST SAVATITSVK VTGADIKNRG SGYTNAGVTF SGGGGSGATA TAIVSNGKIM
DIVITNSGSG YTSIPTVSVT GNGTGAAVSA KVGITEISIS SGGTGYTKAP AVIAGTFLRV
NDGIALGVSN DVSFQRGSSV YSGGASSAVT GTLNISGNLI AEDTVNFISV AHDNTISSLT
VNFKKSTVDN IGFIGGNVTF SALGVEAANT LKLKLGAVMN VADAIGLNST GIIDATNGTI
GFVNISPSIS IARTIAANTF KDATVNKMVV NSAAGVTLNQ GLTITKLDMQ KGLLNIPETS
EITVSSVSGG SNTSYINTMS SASSGAIAKV KVTGLTTAQG DIPLGNGGNY LPVRITPPAE
TAFNFTMSVL TGLTANGLPD GGVVADKSQF VNASYHVIRT SGTGDYTFRV GFPASLKGSA
FTPSSPFGIS KYNGNSWLPV IGSGNYALNT ATATFNTNGL RHRIQIGGAQ PLNLTGTNAS
GYTERKAINL IEPEDLLKIK ATNILSPNGD GVNDKWVVDN IDFYPNNEVK IFERTGRLMY
SKKAYDNSWE GTLNGVPLAE GTYYYIINFG TSRPSLSGFI TITRPE