Gene Ppha_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_1041 
Symbol 
ID6462890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp1074299 
End bp1076011 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content52% 
IMG OID642727290 
Producttransposase IS4 family protein 
Protein accessionYP_002017940 
Protein GI194336146 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00324527 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTAAAC CAGTAACTGG AAAAACTCAC GTTGGCGAAC GGCGTGAAAG GCGCCCGAAT 
GGCGATATTT ACATCTATGA GCGGGTTACA GGCTATAATG AACGAACCAG AAAGACCTAT
ACCGTCAGTC AGAAGCTCCA AGGGAAAATC AAGTCGGGAA CACAGGAGAT AATCCCTACA
CGCCCGAAAA AAAGCAAAAA TGAAGGGGGC TTGGTTGGTG CGCTACGCAG GCACAGTGGC
CTCACGGATC TTCTGGAGTG GGTCGGTAAA GCCTCCGGCA TTGACGATGA CGTGCGTTCT
TCGTTCAGCG AGGGCGACGC TGCCAAGATT CTCTCTATTG CGCGTTACTG GATCGGTTCC
GGCGGAAACA CCTTGCCACG CCTTGAGGGT TGGCAGGTAA TGCACTCCCT TCCCTACAGC
GAAGTTATCA CTGAAGACGT TTACAGTGAC TTGTTCAAAT GTGTCGGGTG CGACGAAGAC
CGCGTGCAGC GCTACTTCTC CTTCCGGGCG GATCGCCTGG GCAAGGCACC TGTGCTGGCG
TTTGATTCGA CCACGATATC GACCTACTCC GAGAACCAGT CGGAAGCACG GCATGGGTTC
AATAAGGATG GTGACGGACT GAAAACCATC AAGCTTCTAA CCTTGTATTC CGTAAAGGAA
CGCGAGCCGT TAGCATTCGC CAAGCAGCCT GGCAATGTTC CGGACGTCAT ATCCATTGAG
AACACCCTGA AGCAACTCAA GTGCTTTAAC CTTGAGAAAC CCTTGATCGT TACCGATAAC
GGCTACTACA GTGAGGGGAA CATGATGAAA TTCGCCTTGC GCAACATGAA ATTCCTTACT
CTGGTTGACC CCAATATCAC TTGGGTTCGC GAGACGGTGG ATGCGCTTCG GGAGACGCTG
GCGGGCATGT CGAGCACTTG CCAGTTTGAT CCGTCGGTTT GCGGCGCCAC GGCGATGAGA
ATGCACGAGT TCGGTCGAGT GCGCCAACTG TCGCGAAACG GCAAGCTGAG CGGAGAAGAA
GAGAGGTTCG TGCGCCGCCT TTATGTCCAT GTCTTCTATT CCCCTAACAG CGACACCAAG
AAGCAGCTCG ACTTCCGCAA GGAGCTGTTT GAACTCAAAG CTCAAGTAGA AGATGGGGTG
ATGGAGTTCA CGAAATCTGC GCAACGAAAA ATCGAGCGAT ATCTGATCTG CTCAAAAAAG
GGGCGTGGTG GACAATTGAG GGTCGGTTTC AACGACAAGG CCATTGTCGA AGCGACGAAA
TACTTCGGCT ATTTTGCGCT CGTCAGCAAT CAGGCCATGA AGACCTTCAC AGCGTTGGCA
GACTATCGAT TGCGTGAAAA AATAGAGGAG ATCTTCGCCG TGGTGAAGGG AGGCCTTGAT
GGAGCAAGGC CGCGCACATG GCATCCAGAC AACCTGCGGG GAAGGCAATT CGTACAGTTT
GTGTCACTGG GGTATCATTG CTTCCTGACC AAGAAAATCA AGGAAATGCG GGCAGAGCTT
GGAAAGAACG GAAGCGAAAA AACCAAAGCG CTTGTCAATC TCGAAAAAAA ATTGGACCAG
TGGCTTGAGC AACGATCTCT CGCTCAGATT CTGGATTGGT TCGATTGCAT TGAGACGACT
CAAGTACGAA CTGTCATGGG CAACTACCGC TGGTCTACTG AATCCGTAGC CAGAGATCGG
CTGTTCCTGG AATATCTGGG AGTGACCTCA TGA
 
Protein sequence
MSKPVTGKTH VGERRERRPN GDIYIYERVT GYNERTRKTY TVSQKLQGKI KSGTQEIIPT 
RPKKSKNEGG LVGALRRHSG LTDLLEWVGK ASGIDDDVRS SFSEGDAAKI LSIARYWIGS
GGNTLPRLEG WQVMHSLPYS EVITEDVYSD LFKCVGCDED RVQRYFSFRA DRLGKAPVLA
FDSTTISTYS ENQSEARHGF NKDGDGLKTI KLLTLYSVKE REPLAFAKQP GNVPDVISIE
NTLKQLKCFN LEKPLIVTDN GYYSEGNMMK FALRNMKFLT LVDPNITWVR ETVDALRETL
AGMSSTCQFD PSVCGATAMR MHEFGRVRQL SRNGKLSGEE ERFVRRLYVH VFYSPNSDTK
KQLDFRKELF ELKAQVEDGV MEFTKSAQRK IERYLICSKK GRGGQLRVGF NDKAIVEATK
YFGYFALVSN QAMKTFTALA DYRLREKIEE IFAVVKGGLD GARPRTWHPD NLRGRQFVQF
VSLGYHCFLT KKIKEMRAEL GKNGSEKTKA LVNLEKKLDQ WLEQRSLAQI LDWFDCIETT
QVRTVMGNYR WSTESVARDR LFLEYLGVTS