Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4342 |
Symbol | |
ID | 6412026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4668607 |
End bp | 4669776 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642714224 |
Product | Integrase catalytic region |
Protein accession | YP_001993313 |
Protein GI | 192292708 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2801] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.636154 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCTGGC GTGAGGTGTC GGCAGTGGAT CAGAGACGAG AGTTCGTCCG GCTTGCGATG CAGGAGGGAG CGAACCGGCG GGAGTTGTGC CGGCGGTTCG GCATTCATTG GACGACCGGC TACAAATGGT TGGAGCGATG CGCGGCCGGA GGTGATGTCG TCGACCTATC GCGGCGGCCG CACGAAAGCC CGCGGCAAAC CTCGGCGGCG TGCGAGGCAC AGGTGCTGGC GGTTCGCGAT GCGCATCCGG CGTGGGGGGC GCGCAAGATC GCTAGCTCGA TGAAGCGGTC CGGACACAGC GCTCCGGCGG TGTCGACGAT CCACGAGATC CTGCGCCGGC ACGGCCGGAT CAAGCCTGCC GCCGGCGGTC CACCGGCGAC ACAGCGGTTC GAGATGCCGG CGCCGAACAT GCTTTGGCAG ATGGACTTCA AAGGCTGGGT TCGGCTCGGC AACGACGTCC GCTGCCATCC GCTGACCGTG GTGGACGATC ACTCACGCTA CGATCTGTGC CTCCAGGCTT GCGCCGATCA GCGCGGCGAA ACCGTGCAGG ACAGGCTGCA AACGACGTTC CGGCACTACG GCTTGCCCGA CACGATCTTC GTCGACAACG GCTCGCCGTG GTCGGATAGT TCCGGCGAGC GTTGGACATG GTTCTCGGTG TGGCTGCTCA AGCTCGGCAT CAGGGTGATC CGTAGTCGTC CCTATCACCC GCAGAGCCGC GGCAAGAACG AGCGTTTTCA TCGCACGCTG GACGACGAGG TGTTTGCGCT GCGGCCCTTG CGCGACCTCG CCGAGGCGCA ACGTGCCTTC GACTCCTGGC GCGAGGTGTA CAACTTCGAG CGCCCTCACG AGTCGCTAGG CCAGCTGGTG CCTGCTGATC GCTACCAGCC GAGCCGGCGT TCTCTGCCGG ACCGCCTGCC GGCTCCAGAA TACGATGAGC GCGACATCGT CCGCTCCGTC CCGAAAACCA AGGCCTATGT CAGCTTCAAA GGTCGTTTGT GGAAGGTGCC GCAAGCCTTC GCCGGAGAGC GCTTGGCCAT CAGGCCGCTC TCCACCGACG GCAAATACGG CGTGTTCTTC GCCGCCCACC AAGTCGCCAC CATCGACTTG ACCGGCGGGG AAAGTGTCGG TCATGTCTCC GAACACGTGT CGACTATGTC TCCGGGCTGA
|
Protein sequence | MPWREVSAVD QRREFVRLAM QEGANRRELC RRFGIHWTTG YKWLERCAAG GDVVDLSRRP HESPRQTSAA CEAQVLAVRD AHPAWGARKI ASSMKRSGHS APAVSTIHEI LRRHGRIKPA AGGPPATQRF EMPAPNMLWQ MDFKGWVRLG NDVRCHPLTV VDDHSRYDLC LQACADQRGE TVQDRLQTTF RHYGLPDTIF VDNGSPWSDS SGERWTWFSV WLLKLGIRVI RSRPYHPQSR GKNERFHRTL DDEVFALRPL RDLAEAQRAF DSWREVYNFE RPHESLGQLV PADRYQPSRR SLPDRLPAPE YDERDIVRSV PKTKAYVSFK GRLWKVPQAF AGERLAIRPL STDGKYGVFF AAHQVATIDL TGGESVGHVS EHVSTMSPG
|
| |