Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ppha_1041 |
Symbol | |
ID | 6462890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelodictyon phaeoclathratiforme BU-1 |
Kingdom | Bacteria |
Replicon accession | NC_011060 |
Strand | + |
Start bp | 1074299 |
End bp | 1076011 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642727290 |
Product | transposase IS4 family protein |
Protein accession | YP_002017940 |
Protein GI | 194336146 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00324527 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCTAAAC CAGTAACTGG AAAAACTCAC GTTGGCGAAC GGCGTGAAAG GCGCCCGAAT GGCGATATTT ACATCTATGA GCGGGTTACA GGCTATAATG AACGAACCAG AAAGACCTAT ACCGTCAGTC AGAAGCTCCA AGGGAAAATC AAGTCGGGAA CACAGGAGAT AATCCCTACA CGCCCGAAAA AAAGCAAAAA TGAAGGGGGC TTGGTTGGTG CGCTACGCAG GCACAGTGGC CTCACGGATC TTCTGGAGTG GGTCGGTAAA GCCTCCGGCA TTGACGATGA CGTGCGTTCT TCGTTCAGCG AGGGCGACGC TGCCAAGATT CTCTCTATTG CGCGTTACTG GATCGGTTCC GGCGGAAACA CCTTGCCACG CCTTGAGGGT TGGCAGGTAA TGCACTCCCT TCCCTACAGC GAAGTTATCA CTGAAGACGT TTACAGTGAC TTGTTCAAAT GTGTCGGGTG CGACGAAGAC CGCGTGCAGC GCTACTTCTC CTTCCGGGCG GATCGCCTGG GCAAGGCACC TGTGCTGGCG TTTGATTCGA CCACGATATC GACCTACTCC GAGAACCAGT CGGAAGCACG GCATGGGTTC AATAAGGATG GTGACGGACT GAAAACCATC AAGCTTCTAA CCTTGTATTC CGTAAAGGAA CGCGAGCCGT TAGCATTCGC CAAGCAGCCT GGCAATGTTC CGGACGTCAT ATCCATTGAG AACACCCTGA AGCAACTCAA GTGCTTTAAC CTTGAGAAAC CCTTGATCGT TACCGATAAC GGCTACTACA GTGAGGGGAA CATGATGAAA TTCGCCTTGC GCAACATGAA ATTCCTTACT CTGGTTGACC CCAATATCAC TTGGGTTCGC GAGACGGTGG ATGCGCTTCG GGAGACGCTG GCGGGCATGT CGAGCACTTG CCAGTTTGAT CCGTCGGTTT GCGGCGCCAC GGCGATGAGA ATGCACGAGT TCGGTCGAGT GCGCCAACTG TCGCGAAACG GCAAGCTGAG CGGAGAAGAA GAGAGGTTCG TGCGCCGCCT TTATGTCCAT GTCTTCTATT CCCCTAACAG CGACACCAAG AAGCAGCTCG ACTTCCGCAA GGAGCTGTTT GAACTCAAAG CTCAAGTAGA AGATGGGGTG ATGGAGTTCA CGAAATCTGC GCAACGAAAA ATCGAGCGAT ATCTGATCTG CTCAAAAAAG GGGCGTGGTG GACAATTGAG GGTCGGTTTC AACGACAAGG CCATTGTCGA AGCGACGAAA TACTTCGGCT ATTTTGCGCT CGTCAGCAAT CAGGCCATGA AGACCTTCAC AGCGTTGGCA GACTATCGAT TGCGTGAAAA AATAGAGGAG ATCTTCGCCG TGGTGAAGGG AGGCCTTGAT GGAGCAAGGC CGCGCACATG GCATCCAGAC AACCTGCGGG GAAGGCAATT CGTACAGTTT GTGTCACTGG GGTATCATTG CTTCCTGACC AAGAAAATCA AGGAAATGCG GGCAGAGCTT GGAAAGAACG GAAGCGAAAA AACCAAAGCG CTTGTCAATC TCGAAAAAAA ATTGGACCAG TGGCTTGAGC AACGATCTCT CGCTCAGATT CTGGATTGGT TCGATTGCAT TGAGACGACT CAAGTACGAA CTGTCATGGG CAACTACCGC TGGTCTACTG AATCCGTAGC CAGAGATCGG CTGTTCCTGG AATATCTGGG AGTGACCTCA TGA
|
Protein sequence | MSKPVTGKTH VGERRERRPN GDIYIYERVT GYNERTRKTY TVSQKLQGKI KSGTQEIIPT RPKKSKNEGG LVGALRRHSG LTDLLEWVGK ASGIDDDVRS SFSEGDAAKI LSIARYWIGS GGNTLPRLEG WQVMHSLPYS EVITEDVYSD LFKCVGCDED RVQRYFSFRA DRLGKAPVLA FDSTTISTYS ENQSEARHGF NKDGDGLKTI KLLTLYSVKE REPLAFAKQP GNVPDVISIE NTLKQLKCFN LEKPLIVTDN GYYSEGNMMK FALRNMKFLT LVDPNITWVR ETVDALRETL AGMSSTCQFD PSVCGATAMR MHEFGRVRQL SRNGKLSGEE ERFVRRLYVH VFYSPNSDTK KQLDFRKELF ELKAQVEDGV MEFTKSAQRK IERYLICSKK GRGGQLRVGF NDKAIVEATK YFGYFALVSN QAMKTFTALA DYRLREKIEE IFAVVKGGLD GARPRTWHPD NLRGRQFVQF VSLGYHCFLT KKIKEMRAEL GKNGSEKTKA LVNLEKKLDQ WLEQRSLAQI LDWFDCIETT QVRTVMGNYR WSTESVARDR LFLEYLGVTS
|
| |