Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3997 |
Symbol | |
ID | 6411679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4283460 |
End bp | 4285205 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642713879 |
Product | integrase family protein |
Protein accession | YP_001992968 |
Protein GI | 192292363 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATTG CAACCAATAT TAGCCGCCGG CCCGGTAGCC GGAATTATTA TGTTCGGATG GCCGTGCCGC GCGATCTCCA GGTGCGCATG GGCACGCCCG GAAAGCCCCG TAGAGAGCTT CGCAAGTCGC TGAATACGCC GGACGCGCGG GAGGCCAAAC GCCTCTCACG GCCAATTCTG GACGAATGGG AGCGCACATT TGCCGAGCTG CGGCGCCCCA AGCAGTTGAC GGAAGCCGAG CTGCAGAACG CGATCTGGCG CCGATACCTT GAGCTGATCA ACGCCGACGA GAGGTTCCGG CAAGAGCTGC CGACCGGCGA CGAACTGAAT GCGATCTGGG AATATCTGGA AGCCGAGTTC GGCGAGCTGA ACATCACGGC CTACAGGATC TTCGAAGAGC TGCGGGACCG GTTCGAAAGC AACCAGCGGG AGCGAGTCGA GCGGCTGGCG CAGATGAAGG TAGAAGCCGC CCGCGGCGAA ACGAAGCTGA TCGCGGACGT GGTCGAGCAA GTCATCGAAG CCCGGCGGCT TGGGGTCGAT CCGGGAACGC CCGAATACCG CAAGCTGGCC CAAGGGCTCC AACGTGCCGA GCTGGAAGGG CTTAAGCGGA CGGTTGAGCG GGACGCTGGC GACTTCTCCG GCGAGTCCAA AGACAAGCTG GTGCAGCAGC CGACCGTATT CGATCCGCCG AAGGGCGAGG GCATCCTAGA GCTTTACGAT CGCTATGCGC GGGAGAAGTC GGGCAGGGTG TCGGCCGACA CTTGGGCGCA GAACCGGAAG GTGGTGGCGC TCTTTGACAA CTTCGTTGGA GGCAACGCGC ACATTTCAGC GCTGACTCGG AAGAACGTCC GGGAGTGGAA AGAGAAGTTG TTCGAATGGC CGGTGAAGGC GATCGAAGCA AGCGAGTTCC GCGGGCTGTC GTTCCTCGAC ACGATCGAAC GCAACAAGGT CGTCGGCAAG CCGGTGATCC AGCACAAGAC GATCAACCGA TATCTGGCTG CATTGGGCGG TTTCAGCGAC TGGTTGCTGG CGAACGACTT TATCGGCGAG CAGATCATGC AGGGCATGTA TCTGGAAGTC GATCGCCGGA AAAAGACGGT GCTGCCCTAC AGCGCCGATC AGATGCGCCG CATCTTCGAA TCGCCTCTCT TCCACCGCTG CGGTGGTGAT AAGCTGGAGC ACCAGAAGGG CAACGTTGAA GTCCGGGATT GGCGCTACTG GATACCTCTG ATCGCCGTCC ACTCCGGTGC CCGGCTCGGC GAGATTTGCC AGTTGATGAC GGCCGACGTT CGGCAGCTTC ACGACGTCTG GATTTTCCAC ATTACCGAGG AAGGCGGGGC GGGCACGAAG TCGACCAAGA CCGAAGGTTC GATGCGGGTG GTGCCGATGC ATTCGAAGCT GATCGAACTT GGTTTCTTGA AATACCATGC CCGCATGTCG GCGATGGGCG ACCGGTTGTT CCCTGAGATC AAGGCGGATG CGCGCGGCTA CATAAGCGGC AAGGCGTCAA CGTTCTTCAA TGATTACTTC CGTGCGATCG GCGTGAAGTC TGACCGCTCT TTGAATTTCC ACAGCTTCCG GCATGGCTTT GCAGACGCGC TGCGGCGGGC TGGCTACTAT GACGAACAGT TTGGGCCGCT ACTTGGGCAT ACGAAATCGA CCACGACGGG GCGCTATGGC ATCGAATCTG AAATGGTGAT CGCAGATCGT GTCAAGATGG TCGAAGCGGT CATCCAAGCA AAGTGA
|
Protein sequence | MAIATNISRR PGSRNYYVRM AVPRDLQVRM GTPGKPRREL RKSLNTPDAR EAKRLSRPIL DEWERTFAEL RRPKQLTEAE LQNAIWRRYL ELINADERFR QELPTGDELN AIWEYLEAEF GELNITAYRI FEELRDRFES NQRERVERLA QMKVEAARGE TKLIADVVEQ VIEARRLGVD PGTPEYRKLA QGLQRAELEG LKRTVERDAG DFSGESKDKL VQQPTVFDPP KGEGILELYD RYAREKSGRV SADTWAQNRK VVALFDNFVG GNAHISALTR KNVREWKEKL FEWPVKAIEA SEFRGLSFLD TIERNKVVGK PVIQHKTINR YLAALGGFSD WLLANDFIGE QIMQGMYLEV DRRKKTVLPY SADQMRRIFE SPLFHRCGGD KLEHQKGNVE VRDWRYWIPL IAVHSGARLG EICQLMTADV RQLHDVWIFH ITEEGGAGTK STKTEGSMRV VPMHSKLIEL GFLKYHARMS AMGDRLFPEI KADARGYISG KASTFFNDYF RAIGVKSDRS LNFHSFRHGF ADALRRAGYY DEQFGPLLGH TKSTTTGRYG IESEMVIADR VKMVEAVIQA K
|
| |