Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3632 |
Symbol | |
ID | 8254763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 4339869 |
End bp | 4342769 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644937293 |
Product | Two component regulator three Y domain protein |
Protein accession | YP_003093885 |
Protein GI | 255533513 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCACT CTGGCCTTAA ATACCTTTTA TTTTATCTCG CTTTACTCTT ACCTTTTTTT GCAACGGCAG ATAACATACA ACGTATCGGG GTTCCTTATG TGCAAAACTA TCCCAAATCG GTTTACCTTT CGGGTAATCA GAACTGGTCT ATAGCCAAAG ACAAATACGG CATCATGTAT TTTGGCAATG CCCAGGGTCT GCTCAGCTTT GACGGCAAAT ACTGGCAGCA GTACAAACTG CCTAACCGGC AAATTGTACG TTCGGTAGCT ACCGCTGCAG ACGGCACCAT TTACACTGGC GGATTTGGTG AATTCGGCTA CTGGTCGGAC AAAAACAAAC ACCTTAGCTA CACTTCTTTA ACCAACCTTA TCCCCCATCC GCACCGGCTT AAAGACGAGA TCTGGAAAAT ATATACCTTT GGCAAAAAAG TCATCTTTCA ATCCTTCTCG GCCATTTATA TTTACGAGAA CAAAAAGATC AGTGTAGTTA CGGCACAGCA ATCGCTCCTT TTTCTGCACC AGGTTGGCCA GCGTTTTTAT GTGGAAGTAA ATGGGAAGGG GCTCTTTGAG CTTACAGGCA ACAAATTAAT CCCCTTAAAA AACAACGGCT TAGTCCTGCC AAAAGGTGTA TTGTCTATAC TGCCCTATCA AAACGGCAGC CTTTTGATCG GTACAAGTAA AGATGGGCTT TTTGTATATA ACGGTGAAAA CTTTAACCCC TTAAACACTC CGGCAAATAC CTTCCTTAAA ACCTACCAGT TAAACAATGG TACACGTATA CAGGACCGCT ATTATGCTTA CGGCACCATT CTGAACGGTC TGATCATCAT AGACGAGACG GGAAACATCG TACAGCGTAT CAATAAATCG AGTGGCCTGC AAAACAATAC GGTACTGAGC CTATATGCCG ATCAGGACCA GAACCTGTGG GCCGGTCTCG ATAACGGGAT AGACCGGATA GAACTCAATT CACCCCTGTA CTTTTACTTT GATAAAACAG GACAGTTCGG AACGGTATAC TCCAGCCTGA TCTATAAAAA CAACATTTAC CTGGGCACCA ACCAGGGCCT ATTTTACAGT ACCTGGGCAT CCGGAAGGGG AAACCTTTTC AATACCTTCG ATTTCAAACT GATCCCCAAT TCACAGGGTC AGGTATGGGA CCTTACCTTA ATAGACGACC AGTTATTTTG TGGCCACAAT GATGGTACTT TTAAGGTAAC GGGTAATAAA CTGGAAAACA TTTCGACGGT AAAAGGGGGC TGGACCATCA AAAAACTACA CTCAAATCCC AATTTTTTAA TTCAGGGCAC TTATAACGGG CTGGTGCTGT TTAAAAAAGA TGGAGCCGGA CAATGGGTAT TCTGGCATAA GATAGAGAAT TTTGGAGAAC CTTCCCGTTA TGTAGAGCAG GATACCAGGG GCGACATTTG GGTAAGCCAT GCCTACAAGG GCCTGTATAA GCTAAGCCTC AGTCCGGACT TTAAAAAGGT CACCACCATA AAAACCTATG ATGAAAGGAA TGGCCTGCCG GGGGATTATA ACATCAATAT CTTCACGCTG GAAAACCGTT TGGTTTTTTC TTCAGATGAA GGATTTTTTA TTTACGACGA GATCAGCAAC CGTTTTACCA AATACACAAC ACTGAACAAG GAACTGGGCA GTTTTGCCGG TGCCAATAAG ATCATTGATG CGGGCTCAAA AAAATACTGG TTCATCAACC ATGGAAAAAT GGGACTTGTG CACTTACTGG AACCAGGAAA AGTCCAGGTA GATTCCAGCA CTTTCAGCAT CCTGGACGGC AGGATGGTAC AGTACTATGA AAACATCAGC AAAATAAGCG ATAAGATCTA TCTGATGAGC GTGGACGATG GTTTTGTCAT TTACAACGCC ACTGAAAACC GAAATGGGAA AAACCATACA ATACTCCCCC AGGTACTCAT CCGAAAAATT GAGGACATCA CCGATACCTA CCATACGATC AGCGAGTTTG GCGACAAGGA TACTGAAATT GAGATTCCTT TTAGCCGCAA CAGCATCCGT ATTTCTTTTG CCCTGCCCTG GTACAGGCAA TCCAAAATTA AATTTCAGTA CTATCTGGAG GGTTATTCCA AACAATGGTC TGACTGGAGT GCAGCCTCAC AAAAAGATTT CACCAACCTG GGCCAGGGTA GTTATGTCTT TAAAGTAAGG GCACGCATCA ACGAGAGCAC GGTTAGTAAG GTTACCGAGT TCAGGTTTAC CATACACCCT CCATTTTATG CCAGTAACTG GGCCATTGCT TTATATCTTA TCCTGTTTAT ATTGCTGCTC TTTACCTTTA AACGGCTTTA TGAGCGTAAG CTAAGAAAAG ATCAGCGGGC CATATCTGAT AAATTACAGG CAGAGAAAGA GGCATTCCTT AAAAAAGAAG CTGAAGCAAC AGAAAAGCAG ATCATCAAAT TGCAGACAGA GAAACTCCAG GCCGAACTGG CGGGCAAGAA CAGGGAACTG GCCAACTCGG CCATGAGCCT GGTTTACAAA AATGAACTGC TGCAGAAGCT GAGCCAGGAA ATCCTTAAAC TGAAAGATGA AAGCGGAAAA CCGCTTGCCG AAGATCAGCT CAGAAAAATC CAGAAGGTAA TAGATGAAGG TATGAATGAT GAACGCGACT GGAACCTTTT TGAAAGCAGC TTCAACGAAG CCCACGAGAG CTTCTTTAAA AAACTGAAAG TAAACCATCC CGATCTGGTA CCCAACGATC TTAAACTTTG TGCTTACCTG CGCATGAACA TGAGCAGTAA AGAAATGGCA TCTTTATTGA ACATTTCTTT AAGAGGTGTA GAAATACGGC GTTACAGACT GCGTAAAAAG CTGGATGTGC CCCATGACAA GAACCTTGTA GAGTTCCTGA TGGAGCTGTA A
|
Protein sequence | MKHSGLKYLL FYLALLLPFF ATADNIQRIG VPYVQNYPKS VYLSGNQNWS IAKDKYGIMY FGNAQGLLSF DGKYWQQYKL PNRQIVRSVA TAADGTIYTG GFGEFGYWSD KNKHLSYTSL TNLIPHPHRL KDEIWKIYTF GKKVIFQSFS AIYIYENKKI SVVTAQQSLL FLHQVGQRFY VEVNGKGLFE LTGNKLIPLK NNGLVLPKGV LSILPYQNGS LLIGTSKDGL FVYNGENFNP LNTPANTFLK TYQLNNGTRI QDRYYAYGTI LNGLIIIDET GNIVQRINKS SGLQNNTVLS LYADQDQNLW AGLDNGIDRI ELNSPLYFYF DKTGQFGTVY SSLIYKNNIY LGTNQGLFYS TWASGRGNLF NTFDFKLIPN SQGQVWDLTL IDDQLFCGHN DGTFKVTGNK LENISTVKGG WTIKKLHSNP NFLIQGTYNG LVLFKKDGAG QWVFWHKIEN FGEPSRYVEQ DTRGDIWVSH AYKGLYKLSL SPDFKKVTTI KTYDERNGLP GDYNINIFTL ENRLVFSSDE GFFIYDEISN RFTKYTTLNK ELGSFAGANK IIDAGSKKYW FINHGKMGLV HLLEPGKVQV DSSTFSILDG RMVQYYENIS KISDKIYLMS VDDGFVIYNA TENRNGKNHT ILPQVLIRKI EDITDTYHTI SEFGDKDTEI EIPFSRNSIR ISFALPWYRQ SKIKFQYYLE GYSKQWSDWS AASQKDFTNL GQGSYVFKVR ARINESTVSK VTEFRFTIHP PFYASNWAIA LYLILFILLL FTFKRLYERK LRKDQRAISD KLQAEKEAFL KKEAEATEKQ IIKLQTEKLQ AELAGKNREL ANSAMSLVYK NELLQKLSQE ILKLKDESGK PLAEDQLRKI QKVIDEGMND ERDWNLFESS FNEAHESFFK KLKVNHPDLV PNDLKLCAYL RMNMSSKEMA SLLNISLRGV EIRRYRLRKK LDVPHDKNLV EFLMEL
|
| |