Gene Phep_3632 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3632 
Symbol 
ID8254763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4339869 
End bp4342769 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content44% 
IMG OID644937293 
ProductTwo component regulator three Y domain protein 
Protein accessionYP_003093885 
Protein GI255533513 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCACT CTGGCCTTAA ATACCTTTTA TTTTATCTCG CTTTACTCTT ACCTTTTTTT 
GCAACGGCAG ATAACATACA ACGTATCGGG GTTCCTTATG TGCAAAACTA TCCCAAATCG
GTTTACCTTT CGGGTAATCA GAACTGGTCT ATAGCCAAAG ACAAATACGG CATCATGTAT
TTTGGCAATG CCCAGGGTCT GCTCAGCTTT GACGGCAAAT ACTGGCAGCA GTACAAACTG
CCTAACCGGC AAATTGTACG TTCGGTAGCT ACCGCTGCAG ACGGCACCAT TTACACTGGC
GGATTTGGTG AATTCGGCTA CTGGTCGGAC AAAAACAAAC ACCTTAGCTA CACTTCTTTA
ACCAACCTTA TCCCCCATCC GCACCGGCTT AAAGACGAGA TCTGGAAAAT ATATACCTTT
GGCAAAAAAG TCATCTTTCA ATCCTTCTCG GCCATTTATA TTTACGAGAA CAAAAAGATC
AGTGTAGTTA CGGCACAGCA ATCGCTCCTT TTTCTGCACC AGGTTGGCCA GCGTTTTTAT
GTGGAAGTAA ATGGGAAGGG GCTCTTTGAG CTTACAGGCA ACAAATTAAT CCCCTTAAAA
AACAACGGCT TAGTCCTGCC AAAAGGTGTA TTGTCTATAC TGCCCTATCA AAACGGCAGC
CTTTTGATCG GTACAAGTAA AGATGGGCTT TTTGTATATA ACGGTGAAAA CTTTAACCCC
TTAAACACTC CGGCAAATAC CTTCCTTAAA ACCTACCAGT TAAACAATGG TACACGTATA
CAGGACCGCT ATTATGCTTA CGGCACCATT CTGAACGGTC TGATCATCAT AGACGAGACG
GGAAACATCG TACAGCGTAT CAATAAATCG AGTGGCCTGC AAAACAATAC GGTACTGAGC
CTATATGCCG ATCAGGACCA GAACCTGTGG GCCGGTCTCG ATAACGGGAT AGACCGGATA
GAACTCAATT CACCCCTGTA CTTTTACTTT GATAAAACAG GACAGTTCGG AACGGTATAC
TCCAGCCTGA TCTATAAAAA CAACATTTAC CTGGGCACCA ACCAGGGCCT ATTTTACAGT
ACCTGGGCAT CCGGAAGGGG AAACCTTTTC AATACCTTCG ATTTCAAACT GATCCCCAAT
TCACAGGGTC AGGTATGGGA CCTTACCTTA ATAGACGACC AGTTATTTTG TGGCCACAAT
GATGGTACTT TTAAGGTAAC GGGTAATAAA CTGGAAAACA TTTCGACGGT AAAAGGGGGC
TGGACCATCA AAAAACTACA CTCAAATCCC AATTTTTTAA TTCAGGGCAC TTATAACGGG
CTGGTGCTGT TTAAAAAAGA TGGAGCCGGA CAATGGGTAT TCTGGCATAA GATAGAGAAT
TTTGGAGAAC CTTCCCGTTA TGTAGAGCAG GATACCAGGG GCGACATTTG GGTAAGCCAT
GCCTACAAGG GCCTGTATAA GCTAAGCCTC AGTCCGGACT TTAAAAAGGT CACCACCATA
AAAACCTATG ATGAAAGGAA TGGCCTGCCG GGGGATTATA ACATCAATAT CTTCACGCTG
GAAAACCGTT TGGTTTTTTC TTCAGATGAA GGATTTTTTA TTTACGACGA GATCAGCAAC
CGTTTTACCA AATACACAAC ACTGAACAAG GAACTGGGCA GTTTTGCCGG TGCCAATAAG
ATCATTGATG CGGGCTCAAA AAAATACTGG TTCATCAACC ATGGAAAAAT GGGACTTGTG
CACTTACTGG AACCAGGAAA AGTCCAGGTA GATTCCAGCA CTTTCAGCAT CCTGGACGGC
AGGATGGTAC AGTACTATGA AAACATCAGC AAAATAAGCG ATAAGATCTA TCTGATGAGC
GTGGACGATG GTTTTGTCAT TTACAACGCC ACTGAAAACC GAAATGGGAA AAACCATACA
ATACTCCCCC AGGTACTCAT CCGAAAAATT GAGGACATCA CCGATACCTA CCATACGATC
AGCGAGTTTG GCGACAAGGA TACTGAAATT GAGATTCCTT TTAGCCGCAA CAGCATCCGT
ATTTCTTTTG CCCTGCCCTG GTACAGGCAA TCCAAAATTA AATTTCAGTA CTATCTGGAG
GGTTATTCCA AACAATGGTC TGACTGGAGT GCAGCCTCAC AAAAAGATTT CACCAACCTG
GGCCAGGGTA GTTATGTCTT TAAAGTAAGG GCACGCATCA ACGAGAGCAC GGTTAGTAAG
GTTACCGAGT TCAGGTTTAC CATACACCCT CCATTTTATG CCAGTAACTG GGCCATTGCT
TTATATCTTA TCCTGTTTAT ATTGCTGCTC TTTACCTTTA AACGGCTTTA TGAGCGTAAG
CTAAGAAAAG ATCAGCGGGC CATATCTGAT AAATTACAGG CAGAGAAAGA GGCATTCCTT
AAAAAAGAAG CTGAAGCAAC AGAAAAGCAG ATCATCAAAT TGCAGACAGA GAAACTCCAG
GCCGAACTGG CGGGCAAGAA CAGGGAACTG GCCAACTCGG CCATGAGCCT GGTTTACAAA
AATGAACTGC TGCAGAAGCT GAGCCAGGAA ATCCTTAAAC TGAAAGATGA AAGCGGAAAA
CCGCTTGCCG AAGATCAGCT CAGAAAAATC CAGAAGGTAA TAGATGAAGG TATGAATGAT
GAACGCGACT GGAACCTTTT TGAAAGCAGC TTCAACGAAG CCCACGAGAG CTTCTTTAAA
AAACTGAAAG TAAACCATCC CGATCTGGTA CCCAACGATC TTAAACTTTG TGCTTACCTG
CGCATGAACA TGAGCAGTAA AGAAATGGCA TCTTTATTGA ACATTTCTTT AAGAGGTGTA
GAAATACGGC GTTACAGACT GCGTAAAAAG CTGGATGTGC CCCATGACAA GAACCTTGTA
GAGTTCCTGA TGGAGCTGTA A
 
Protein sequence
MKHSGLKYLL FYLALLLPFF ATADNIQRIG VPYVQNYPKS VYLSGNQNWS IAKDKYGIMY 
FGNAQGLLSF DGKYWQQYKL PNRQIVRSVA TAADGTIYTG GFGEFGYWSD KNKHLSYTSL
TNLIPHPHRL KDEIWKIYTF GKKVIFQSFS AIYIYENKKI SVVTAQQSLL FLHQVGQRFY
VEVNGKGLFE LTGNKLIPLK NNGLVLPKGV LSILPYQNGS LLIGTSKDGL FVYNGENFNP
LNTPANTFLK TYQLNNGTRI QDRYYAYGTI LNGLIIIDET GNIVQRINKS SGLQNNTVLS
LYADQDQNLW AGLDNGIDRI ELNSPLYFYF DKTGQFGTVY SSLIYKNNIY LGTNQGLFYS
TWASGRGNLF NTFDFKLIPN SQGQVWDLTL IDDQLFCGHN DGTFKVTGNK LENISTVKGG
WTIKKLHSNP NFLIQGTYNG LVLFKKDGAG QWVFWHKIEN FGEPSRYVEQ DTRGDIWVSH
AYKGLYKLSL SPDFKKVTTI KTYDERNGLP GDYNINIFTL ENRLVFSSDE GFFIYDEISN
RFTKYTTLNK ELGSFAGANK IIDAGSKKYW FINHGKMGLV HLLEPGKVQV DSSTFSILDG
RMVQYYENIS KISDKIYLMS VDDGFVIYNA TENRNGKNHT ILPQVLIRKI EDITDTYHTI
SEFGDKDTEI EIPFSRNSIR ISFALPWYRQ SKIKFQYYLE GYSKQWSDWS AASQKDFTNL
GQGSYVFKVR ARINESTVSK VTEFRFTIHP PFYASNWAIA LYLILFILLL FTFKRLYERK
LRKDQRAISD KLQAEKEAFL KKEAEATEKQ IIKLQTEKLQ AELAGKNREL ANSAMSLVYK
NELLQKLSQE ILKLKDESGK PLAEDQLRKI QKVIDEGMND ERDWNLFESS FNEAHESFFK
KLKVNHPDLV PNDLKLCAYL RMNMSSKEMA SLLNISLRGV EIRRYRLRKK LDVPHDKNLV
EFLMEL