Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4336 |
Symbol | |
ID | 6412020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4662464 |
End bp | 4663633 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642714218 |
Product | Integrase catalytic region |
Protein accession | YP_001993307 |
Protein GI | 192292702 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2801] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0872722 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCTGGC GTGAGGTGTC GGCAGTGGAT CAGAGACGAG AGTTCGTCCG GCTTGCGATG CAGGAGGGAG CGAACCGGCG GGAGTTGTGC CGGCGGTTCG GCATTCATTG GACGACCGGC TACAAATGGT TGGAGCGATG CGCGGCCGGA GGTGATGTCG TCGACCTATC GCGGCGGCCG CACGAAAGCC CGCGGCAAAC CTCGGCGGCG TGCGAGGCAC AGGTGCTGGC GGTTCGCGAT GCGCATCCGG CGTGGGGGGC GCGCAAGATC GCTAGCTCGA TGAAGCGGTC CGGACACAGC GCTCCGGCGG TGTCGACGAT CCACGAGATC CTGCGCCGGC ACGGCCGGAT CAAGCCTGCC GCCGGCGGTC CACCGGCGAC GCTGCGGTTC GAGATGCCGG CGCCGAACAT GCTTTGGCAG ATGGACTTCA AAGGCTGGGT TCGGCTCGGC AACGACGTCC GCTGCCATCC GCTGACCGTG GTGGACGATC ACTCACGCTA CGATCTGTGC CTCCAGGCTT GCGCCGATCA GCGCGGCGAA ACCGTGCAGG ACAGGCTGCA AACGACGTTC CGGCACTACG GCTTGCCCGA CACGATCTTC GTCGACAACG GCTCGCCGTG GTCGGACAGT TCCGGCGAGC GCTGGACATG GTTCTCGGTG TGGCTGCTCA AGCTCGGCAT CAGGGTGATC CGTAGTCGTC CCTATCACCC GCAGAGCCGC GGCAAGAACG AGCGTTTTCA TCGCACGCTG GACGACGAGG TGTTTGCGCT GCGGCCCTTG CGCGACCTCG CCGAGGCGCA ACGTGCCTTC GATTCCTGGC GCGAGGTGTA CAACTTCGAG CGCCCTCACG AGTCGCTAGG CCAGCTGGTG CCTGCTGATC GCTACCAGCC GAGCCGGCGT TCTCTGCCGG ACCGCCTGCC GGCTCCAGAA TACAAAGAGC GCGACATCGT CCGCTCCGTC CCGAAAACCA AGGCCTATGT CAGCTTCAAA GGTCGTTTGT GGAAGGTGCC GCAAGCCTTC GCCGGAGAGC GCTTGGCCAT CAGGCCGCTC TCCACCGACG GCAAATACGG CGTGTTCTTC GCCGCCCACC AAGTCGCCAC CATCGACTTG ACGGGCGGAA AAGGTGTCGG TCATGTCTCC GAACACGTGT CGACTATGTC TCCGGGCTGA
|
Protein sequence | MPWREVSAVD QRREFVRLAM QEGANRRELC RRFGIHWTTG YKWLERCAAG GDVVDLSRRP HESPRQTSAA CEAQVLAVRD AHPAWGARKI ASSMKRSGHS APAVSTIHEI LRRHGRIKPA AGGPPATLRF EMPAPNMLWQ MDFKGWVRLG NDVRCHPLTV VDDHSRYDLC LQACADQRGE TVQDRLQTTF RHYGLPDTIF VDNGSPWSDS SGERWTWFSV WLLKLGIRVI RSRPYHPQSR GKNERFHRTL DDEVFALRPL RDLAEAQRAF DSWREVYNFE RPHESLGQLV PADRYQPSRR SLPDRLPAPE YKERDIVRSV PKTKAYVSFK GRLWKVPQAF AGERLAIRPL STDGKYGVFF AAHQVATIDL TGGKGVGHVS EHVSTMSPG
|
| |