Gene Ppha_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_2029 
Symbol 
ID6462895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp2118667 
End bp2120313 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content49% 
IMG OID642728225 
Producttransposase IS4 family protein 
Protein accessionYP_002018855 
Protein GI194337061 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.417393 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGTCG ATTCCAGCAA GACCACCATT AATGGCAAAA CCTACCAGCG CCATCTTTTT 
CGTGAATCCT ATCGTGAAGA TGGGAAGGTG AAAAATCGCA CGTTGGGCAA GATCTCAAAA
TGTTCGGAAG GAGAGATTGC CGCCATTAAA CTTGCCCTGA AGTACAAAGA CAATTTGGCA
GCCCTGTTGC ATATTGAGGA TGTTGAACTA CATGAGGGGC TTCGTGTTGG TGTGGTATAT
GCACTCAAAA CTCTTGCCGA GAGGCTTGGT ATCAGCAAGA CGCTTGGTAA TACCCGCCAG
GGAAAGCTTG CATTGTGGCA GATTATGGCT CGATTGATCG GGCAGGGTTC ACGACTCGGT
GCGGTCAGGA TGGCAGCAAG TTATGGCGCC TGTGATGTCC TTGAATTAGA ACGCTTTACA
GAAGATGACC TCTATGCCAA CCTGGCTTGG TTGACGGAGC ATCAGGAGCG CATTGAGCAG
CAGTTATTCA AGCACAATTC AGCGGGTGCT GGCCCGGAAT TGTTGCTCTA TGATGTTACC
TCATCATATC TTGAAGGGAT GGAGAATGTG CTTGCTGCCT TTGGCTACAA CCGGGATGGC
AAAAAGGGGA AAAAGCAAAT CGTTATTGGT TTGTTGTGTA CCGCAGATGG CGATCCGGTT
GCTGTCCGGG TCTTTGCAGG CAATACCGGC GACAAGTCAA CCGTTGCAGA GCAGATTCGC
ACCGTTGCCA ACACTTTTGG CATTGAAGAG GTAACGATGG TTGGCGACAA GGGTATGATC
AAAACGCCGC AGGCCAAAGA GTTGACCGAT GCGGGATTTT ATTACATCAC CTCACTCTCG
AAACCCGAAA TCAGAACCCT GCTGAAGGCA GAGGTACTGC AAATGGATTT TTTCGATTCA
GAGCTGTATG AGGTTGAAAA TAAAACCGAT GGCGTTCGAT ATGTGCTGAG AAGAAATCCG
GTTCGAGAGG CCGAGATGGC AAAGAATCGC CAGGAACGGG TGAAAAAAAT CCAGCGTCTT
GTTGAGGAGA AAAACAGTTA CCTCGCCGGT TCCCTCAAGC GTGACAAGGA TGTTGCACAA
CGCTCTCTTC AGAAAAAGAT CAGCCAGTAT AAGCTCAATG ATGTGCTGGA ACTGACCCAT
CAGGAGAGGG TGTTCACGGT AACGGTCAAT GAAGAAATCC TGAAAGGAGT CGCTCTGCTG
GACGGCTGTT ATGTGATCAA AACCGATGTC AAGAAAGAGC TGCTTTCGAC CGAACAAGTC
CATGATCGGT ACAAAGATCT GGCCAAAGTG GAGCATGCAT TCCGGACGTT CAAGCAAAGT
CATCTTGAAA TCAGACCGGT TCATGTGCGA ACCGAAGCGA GCACTCGTGG CAATGTCTTT
GCCGTTATGC TTGCCTATAA AATCGAGAGG CAGTTATCAG AACTCTGGAA AAAATGTGAA
TGCACGGTAC CGGAAGGAAT TGATGAACTT GGCGCAATAC GCAGCACAAT CGTCACCCTC
AAAGGGTCAA GCTGTCAGAA AATTCCCCAG TCGAAAGGAT TGGCTGCTGA GTTGCTTGCC
GCTGCCGGGA TTACCCTTCC TTCGGTCATT GATGCCAAAA ATGTTGATGT AGTCACAAGG
AAAAAACTGG CCCCGAAGCG TAAATAA
 
Protein sequence
MYVDSSKTTI NGKTYQRHLF RESYREDGKV KNRTLGKISK CSEGEIAAIK LALKYKDNLA 
ALLHIEDVEL HEGLRVGVVY ALKTLAERLG ISKTLGNTRQ GKLALWQIMA RLIGQGSRLG
AVRMAASYGA CDVLELERFT EDDLYANLAW LTEHQERIEQ QLFKHNSAGA GPELLLYDVT
SSYLEGMENV LAAFGYNRDG KKGKKQIVIG LLCTADGDPV AVRVFAGNTG DKSTVAEQIR
TVANTFGIEE VTMVGDKGMI KTPQAKELTD AGFYYITSLS KPEIRTLLKA EVLQMDFFDS
ELYEVENKTD GVRYVLRRNP VREAEMAKNR QERVKKIQRL VEEKNSYLAG SLKRDKDVAQ
RSLQKKISQY KLNDVLELTH QERVFTVTVN EEILKGVALL DGCYVIKTDV KKELLSTEQV
HDRYKDLAKV EHAFRTFKQS HLEIRPVHVR TEASTRGNVF AVMLAYKIER QLSELWKKCE
CTVPEGIDEL GAIRSTIVTL KGSSCQKIPQ SKGLAAELLA AAGITLPSVI DAKNVDVVTR
KKLAPKRK