Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3845 |
Symbol | |
ID | 3911648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4390822 |
End bp | 4396767 |
Gene Length | 5946 bp |
Protein Length | 1981 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637885745 |
Product | hypothetical protein |
Protein accession | YP_487449 |
Protein GI | 86750953 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03174] CRISPR-associated protein Csc3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGCTGT CCGCGATTCA GGAAGCCCTG AACATCAGCG ATGCGCCTGC GGCGACCGAC GATCCGATCT CGGCGCGGAG CGACAATTCG CCGGCGCTCT CGCCGCCGCC CGCTTTCAAC GACGGCGCGC CGGAGCCGCG GATCGCAACC GACCGTCCCG GATTCGAACA ACACGACGAC GACCGTCAGT TCGCGCAGCG CCCCGCCAAT GACGACCGCG AGACCATCGG CCAGTTGCTG CAGGCGCTGC AGAAGGGCCG TCCGGCCCGC AACGTCTACA CCCTCGCCTC GATCTTCGCC GGGGTCTGGA TCTTCGGCGG CATCGTCCTC ACCATCAGCT TCCTGCCCTC GCTGCAGGCG ATGGTCGGCC AGGGCGCCGG CGGCACGATG GCGCTGGTCG GCCTGATCGC GCTGTTCCTC GCCCCGGTGA TGCTGTTCTA CTTCCTCGCC AGCCTGACCT GGCGCGGCCA GGAACTGCGG ATGATCGCGC AGTCGATGGC GCAGGTCGCG ATCCGATTCT CGGAGCCCGA GATCGCGGCG AGCGATTCCG TCGTGACCGT CGGTCAGGCG ATCCGCCGCG AGGTCGCGGC GATGGGCGAC GGCGTCGAGC GCGCGATCGC GCGCGCCGGC GAGCTCGAGA CCCTGGTCGC CAACGAGGTC TCGGCGCTGG AACGCGCTTA CTCCGACAAC GAAGTCCGCA TCCGCGCCCT GCTGCAGGAC ATCGCGCATC AGCGCGACAA TCTGGTCGGC CAGGCCGAGC AGGTGCGTAG CGCCATCTCC GGCGTGCAGA TCGATCTGCG CCACGACATC GCGCTGATCT CCGACGCGAT CGCCAGCCGC GTCGACGAAG TTGCGAAGAG CATCACCAGC GCGCTGGAAG AGCGCGGCGC CCACATCACC GGTGCGCTGA GCCACGCCGG CGACAACATG ATCCTCGCCC TCGGCGAGCG CGGCGGCGAT CTGCTCGACC GCCTGGAAGA AGCCAGCGCC GAGACCACGC GCGCCGTGCT CGACGCCAGC GAGCGGCTGA CCGCCAGCCT GAATTTCAAG ACCGGGCACG TTCACGACGA ATTCGTCGAA CTCGCCGACC GCGTCCACGA CATGCTCAAC GAGCGGCTCG ATCGCATCAC CGGCGAGTTC GAACAGCGTT CGTCCAGCAT CGTCGACGGC ATTTCCGAGC GCACCGAGCA GGTCCACGAC TCGCTGAAGA ACTCCTCGGA CTCGCTGCTG CTCGAACTCG AGCTGCGCAG CGGTGATCTC GTCAGCAAGA TCGACGACGC CGGCAACCGC CTCGCAGACC AGATCCTGAC CAGCGGCGAC AAGGCCAGCG AAGCGCTCGA CGTCACCGTG AATACGCTGG TGGCCAAGGT CGTCAGCCAG ACCGAGAACG CCCACGACGC CCTCAGCATG CAGATGAGCG CGTTCGACGA TCTGGTGAAG CAGCAGGGCT CCGAGCTCGC CGAGAAGTTC GCGCGCGATT CGGGCACGCT CGGCGCGCTG ATCACCCGCC ACATCTCCGA ATTCGACCGC ACCGTGAAGA CCTACGGCGG CGACATCGTC GAGCGCATGG GCCAGCGTAC CCAGGACATC TCGACCACGC TGAAGGACTA CGTCGACAAT TTCGACCAGC GCGTCACCGC GCGCGGCGGC GAGCTCAGCG CCACGCTGGA TCAGCGCCTG TCGCATTTCG AGAACGTGTT CGAGACCCGC GTCGGCAGCC TCGACGGCTC GCTCGACGTC AAGATCAAGT TGCTGGACGA AACCGTCAGC GAACACCTGC GCTCGCTCGA AGACGGCTTC GCATCGCGCA CCCAGATCGT CACCGAATCG ATCGACAGCC GGATCGCGCG GCTCTCGGAA ACGCTGACCA CCGGCTCGGA TCAGGCGATC GTCGCCATCG ACAGCCGGAT CGCCGAACTG TCGACCACGA TGTCGAGCGG CACGAGCCAG GCCGTCGATC TGATCGACAC CCGCATCGAC AAGCTCTCCG GCACGCTGAC GGCGGGCGCG GTGCAGGCGA TCGAGGCAAT CGATCAGCGC CTGTCGCATC TCACCAACTC GCTCACCGAC GGCACCGCGC AGGCGATCCA CGCGATCGAT CAGCGCATCG GCAACGTCAC CGAGACGATC GACAGCCGCA GCGCCGGACT GGCGGCGACC GTCAGCGCCC GCTTCCAGGA GATCCACGAA GGCCTCGAAG GCCGGGTCGG CACCGTGCTG ACCGACATCG ACCTGCGCGT GTCGCAATTC GAGGACCTGC TCGGCTCGCG CATCGACGCG GTGGCCGGTC GATTCGAGAA CACCGGGCGC GAAGCCAGCG ATCTGCTGAT GGCGCGCGCC GAAGAACTCT CGCTCGGCAT CAAGTCGCAT GTCGCCGACG CCGAGCGTTC GCTCACCGGC CTCGTCGTCA GCACCAGCGA CACGATCCAG AGCGGCACCC GCTCCGCCAA GGAAGCGATG CTGTCGGTGT CGACCGAGGT CGGCGCGCAG CTGCAACTCA CCTCGTCCGA GATCGAACGC GCCCTCACCG CGGTCGGCAC CAGCGCCGCG AACACGGTGC TGCTCAGCGC GACCGAGGCC CAGTCCAAGC TGGTCGCAGC GTCGAGCGAT ACGCTCGCCC AGATCAAGTC GCTGTCGGCC GACGTCGAGC GCACCCTCGC GGTCGCCGGC ACCTCGACCG CGGCATCGGT GCTGGCCGGC GCGCGCGAGG CGCAGACCAC GCTGGTCGCC ACCTCGACCG ACGTCACCAA CGAGATCAGG TCGCTGTCGG CCGAAATCCA GCGCACGCTG ACCGACGTCG GCGGCACCGC CTCGACCTCG GTGCTGAACA GCGCCCGCGA GGCGCAGGCG ACCCTGGTGT CGTCGTCGTC CGACGTCGCC AGCCAGATCC GCGCGCTGTC CGCCGATATC GAGCGCACCC TGACCGGAGC CGGCGACGCC ACCTCGGCAT CGGTGCTGGC CGGCGCCCGC GAGGCGCAGT CGACCCTGCT GTCGGCCTCC ACCGAGGTCA CCGGCCAGAT CCGGTCGCTG TCGTCGGATA TCGAACGCAC GCTGTACGCC GCCGGCAATG CCACTGCGGA GTCCGTGCTC GGCGGCGCCC GCGCCGCCCA GACCACGCTG GTGGCAGCCT CCGAAGAGGC CGCGGAGCGC GTCAAGTCGC TGAGCTCCGA CGTTCAGCGC ACGCTGTCCG AAGCCGGATC CGCGACCGCC AGCGCGATCG TCGCCGGCGC GCGCGAAGCC CAGAACACGC TGGTCGCCGC CTCCTCCGAG GCGTCGAGCC AGGTGATCTC GCTCGCCAGC GATGTCGAAC GCAAGCTGAC GGCGGCCGGT AGCGCCACCG TGGAGACCCT GCTGTCCGGC GCCCGCGAAG CCCAGCACAC CCTGGTGGCG ACCTCGACCG AGACGTCGAA CCAGCTCAAG GCGCTGACCG GCGACGTCGA GCGCACCCTG ACCACCGCCG GCAGCGCCAC CGCCGGCGCC ATTCTCGCCA GCGCCCGGGA CGTCCAGAGC ACGCTCGCCA ACGCTTCGAC CGCGGCCTCC GATCAGGCCC GCTCGCTCGC CGCAGAGGTC GAACGCAGCC TGATCACCGC CGGCAAGACC TCCGCCGAAT CGATCGTCAC CGGCGCGCGC GAAGCGCAGA CCACGCTGAT GACAGCCTCG GAGGAAGCCG CCAACCACGT CAAGTCGCTG GCGATCGACG TCGAGCGCAC GCTGACCACG GTCAGCGCCG CGGCGGCGTC GACCATTCTG GGCAGCGCCC GCGACGTGCA GTCGACCCTC ACCAACGGCT CGGCGGAAGC GGCGAGCCAG ATCAAGGCGA TCTCCGCCGA CATCGAGCGC ACGCTCGCCG GCGTCACCAT CAACACCACC GACAACATCC AGGTCAGCGC GGCGGCGGCC CAGAGCACGC TGGTCGCGGT GTCGAACGAG GTCAGCTCGA AGATCAAGTC GACTTCCGCG GAGATCGAAC GCTCGGTGTT CGCCGCCAGC GGCAGCTTCG GCTCGACCAT CACCGCCAAG AGCGACGAGA TCGTCAGCTA CGTTCAGCAG CAGGCCGAGC GCCTGTCGCA GATCATCGAC GGCAAGCGCG GCCTGCTGGT GGAAGCCATC TCCGGCAAGA CCAACCAGCT CACCATCGAG ATCGACCGCG TCACCACCGA CGCGCTCGGC GCCATCGAAA GCCGCGGCAA GGCGTTCTCG AATTCGATCC TGACCAACGG CAACGACGTC GCGCGCTCGA TCACCGGCGC CGGCGATCTC GCCACCGGCG CGATCAACAA GTCGCTCAAG GATCTCGAGC TCGCCTCGCG CTCCGCGATC GAACAGTCGC GCCAGGTCTC GGTCACCGCC GTCACCGAGA TGCAGGAGAC CAGCAAGATC CTGCGCACCG ACACGGTCGC TTTGTTCGAG CGGCTGCGCG AAGGCAACAT CCTGCTGCAG GAAGTGCTCA CCGGCGCCCA CGAGAACCTC AACTCGCTGG AGCGCGCGCT GGTCACCCGC GTGGCGGATT TCGTCACCAC CATGAACGAC GTCAACGCCC GCAGCGGCGC CGCCACGCAG GGCCTGGAAG ACCAGCTCAC CGGCTTCCAC GGCAAGACCG CCAAGGCGCT GGCCGATCTC AGCGCGTTGT CCGAGCAGTT CGAGATGCAC GGTCAGGCCC TGGCCGAAGC CGCGGAGCTG GTCCAGCAGA GCAACCGTAG CGCATCGACC TCGGTGCTGG AGCGCAAGGA ATCGCTGGAG TCGCTGGTCA CCACCATCGA CCTGCGCACC GCCGATCTCG ACCAGCGGCT CACCCGCTTC ACCGGCCTGC TCGACGAATC GCTCGCCGCG GCTGAGGAGC GCGCCCGCGA CATCGCGCGG ATCGTCGCCG AGACCGCAGG CGCCGGCTCC GCCAGCATCA CGCAGCAGTT CGAAGCGGTG CGCAGCGCCG CCGAGCAGGA GCGCCGCCTC ACCACCGAGG CGATGCACCA GGTCTATCAG CAGGGCACCC AGGAAGCCGA TGCGCTGTTC AAGCAATCGG CCGACCGCTT CGCGGCGATC GTCCAGGGGA TCAAGCAGAT GGCTTCGGAG ATGCACCACG AACTCGAAGC CACCCGCGAG GAGCTGCGAC GCGGCGTGCT CGAAATCCCG CAGGAGGCCG CCGAGAGCAC CGCGCAGATG CGCAAGGTGA TCGTCGACCA GATCGAGGCG CTGGCCGAAC TCAACCGCAT CGTCGCGCGC CACGGCCGCG GCATGGATGT GGTCGGTACC ACCCGCGCCA GCGCGGTCCG CGACGAAGAG CCGATGCTGG CGACCGCCGG CGGCCGCGGC GAGGCCATCG CGCGCGCGCC GGAGCCGATG TCACGGCCGG AGCCGGTGTC GCGTGAACCC GCGTCGCGGC CGGCACAGCG CCGCGAGGCC AGCGCCGCAA CGCTGCCGCC GCCGGACCTC GGCCTGCCGG GCCCTGCCCC GCGCCGCACC GAGGCCCCGC CGGTCGCGCC GACCGGCGCC GATCCGGGCC GCGACGGCTG GCTGTCGGAT CTCCTCAACC GCACCGACGC CGGTGCCGAG CGCGAACTTC CGCGCGCCCG GACGCCGCAG CAGCCCACGC CGCAGCAGGC GATGAGCGGC AATCCGCTGG AATCGCTGTC GCTCGACATC GGCCGGCTGA TGGACCGCAA CCTGGTGCTG GAGATGTGGG ATCGCTATCA GCGCGGCGAG GCCAAGGCGT TCAGCAAGCG GCTGTACACG CCGGCCGGCC AGAAGGCGTT CGACGAGGTC GCCCGCAAGT ATCGCAGCGA CCGCGCCTTC AAGCAGACGG TGGACCGCTA CATCACCGAG TTCGAACGCC TGCTCGACGA AGTCGCCCGC GACGAACGCG GCCCGCAGGT GCTGCGCACG CATCTGAGCT CGGAAACGGG CCTCGTGTAC ACGCTGCTGG CGCATGCGGC GGGGCGGCTG GGGTAA
|
Protein sequence | MALSAIQEAL NISDAPAATD DPISARSDNS PALSPPPAFN DGAPEPRIAT DRPGFEQHDD DRQFAQRPAN DDRETIGQLL QALQKGRPAR NVYTLASIFA GVWIFGGIVL TISFLPSLQA MVGQGAGGTM ALVGLIALFL APVMLFYFLA SLTWRGQELR MIAQSMAQVA IRFSEPEIAA SDSVVTVGQA IRREVAAMGD GVERAIARAG ELETLVANEV SALERAYSDN EVRIRALLQD IAHQRDNLVG QAEQVRSAIS GVQIDLRHDI ALISDAIASR VDEVAKSITS ALEERGAHIT GALSHAGDNM ILALGERGGD LLDRLEEASA ETTRAVLDAS ERLTASLNFK TGHVHDEFVE LADRVHDMLN ERLDRITGEF EQRSSSIVDG ISERTEQVHD SLKNSSDSLL LELELRSGDL VSKIDDAGNR LADQILTSGD KASEALDVTV NTLVAKVVSQ TENAHDALSM QMSAFDDLVK QQGSELAEKF ARDSGTLGAL ITRHISEFDR TVKTYGGDIV ERMGQRTQDI STTLKDYVDN FDQRVTARGG ELSATLDQRL SHFENVFETR VGSLDGSLDV KIKLLDETVS EHLRSLEDGF ASRTQIVTES IDSRIARLSE TLTTGSDQAI VAIDSRIAEL STTMSSGTSQ AVDLIDTRID KLSGTLTAGA VQAIEAIDQR LSHLTNSLTD GTAQAIHAID QRIGNVTETI DSRSAGLAAT VSARFQEIHE GLEGRVGTVL TDIDLRVSQF EDLLGSRIDA VAGRFENTGR EASDLLMARA EELSLGIKSH VADAERSLTG LVVSTSDTIQ SGTRSAKEAM LSVSTEVGAQ LQLTSSEIER ALTAVGTSAA NTVLLSATEA QSKLVAASSD TLAQIKSLSA DVERTLAVAG TSTAASVLAG AREAQTTLVA TSTDVTNEIR SLSAEIQRTL TDVGGTASTS VLNSAREAQA TLVSSSSDVA SQIRALSADI ERTLTGAGDA TSASVLAGAR EAQSTLLSAS TEVTGQIRSL SSDIERTLYA AGNATAESVL GGARAAQTTL VAASEEAAER VKSLSSDVQR TLSEAGSATA SAIVAGAREA QNTLVAASSE ASSQVISLAS DVERKLTAAG SATVETLLSG AREAQHTLVA TSTETSNQLK ALTGDVERTL TTAGSATAGA ILASARDVQS TLANASTAAS DQARSLAAEV ERSLITAGKT SAESIVTGAR EAQTTLMTAS EEAANHVKSL AIDVERTLTT VSAAAASTIL GSARDVQSTL TNGSAEAASQ IKAISADIER TLAGVTINTT DNIQVSAAAA QSTLVAVSNE VSSKIKSTSA EIERSVFAAS GSFGSTITAK SDEIVSYVQQ QAERLSQIID GKRGLLVEAI SGKTNQLTIE IDRVTTDALG AIESRGKAFS NSILTNGNDV ARSITGAGDL ATGAINKSLK DLELASRSAI EQSRQVSVTA VTEMQETSKI LRTDTVALFE RLREGNILLQ EVLTGAHENL NSLERALVTR VADFVTTMND VNARSGAATQ GLEDQLTGFH GKTAKALADL SALSEQFEMH GQALAEAAEL VQQSNRSAST SVLERKESLE SLVTTIDLRT ADLDQRLTRF TGLLDESLAA AEERARDIAR IVAETAGAGS ASITQQFEAV RSAAEQERRL TTEAMHQVYQ QGTQEADALF KQSADRFAAI VQGIKQMASE MHHELEATRE ELRRGVLEIP QEAAESTAQM RKVIVDQIEA LAELNRIVAR HGRGMDVVGT TRASAVRDEE PMLATAGGRG EAIARAPEPM SRPEPVSREP ASRPAQRREA SAATLPPPDL GLPGPAPRRT EAPPVAPTGA DPGRDGWLSD LLNRTDAGAE RELPRARTPQ QPTPQQAMSG NPLESLSLDI GRLMDRNLVL EMWDRYQRGE AKAFSKRLYT PAGQKAFDEV ARKYRSDRAF KQTVDRYITE FERLLDEVAR DERGPQVLRT HLSSETGLVY TLLAHAAGRL G
|
| |