Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1993 |
Symbol | |
ID | 4022475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2226929 |
End bp | 2230780 |
Gene Length | 3852 bp |
Protein Length | 1283 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637962186 |
Product | gene transfer agent (GTA) orfg15 |
Protein accession | YP_569129 |
Protein GI | 91976470 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.263252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.679187 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAGCCC TCGTCCTCTC CATCGCCGGC GGCGCGGTTG GCGCGCTATT CGGGCCGGTC GGCGCGATTG CCGGGCGGAT CGCCGGTGCG CTCGCCGGCA GTATGCTCGA CCACGCGCTG CTCGGCGGCG GCGACCGCAA CGTCGAAGGG CCTCGGCTCG CCGATCTCGA CATCATGGCG TCGACCGAAG GCGCGCCGAT CCCGCGCGTC TATGGCCGGG CGCGTCTGGC AGGCCAGGTG ATCTGGGCGA CGCAGCTCGA GGAAGTGATT TCGAACGAAC AGAGTTCATC GGGCGGCAAA GGCTTCGGCG GCCCGACCAC CACGACGACG ACCTACGCGT ATTTCGCCAA TTTCGCGGTC GGGCTGTGCG AGGGGCCGAT CGGCCGCGTC GCGCGGATCT GGGCCGATGG CAAGCCGCTC GACCTGTCGG GCCTCAATGT CCGCGTTCAT CGCGGCAGCG AGGACCAGAG CCCGGATGAT CTGATCGTCG CGAAACAGGG CGCCGGCGAT GCGCCGGCCT ATCGCGGGCT CGCTTATGTG GTGTTCGAGC GGCTGCCGCT GGCTGCCTAT GGCAACCGGA TTCCGCAACT CTCATTCGAG ATCGTCCGGC CGATCGGCGG GCTGGAAGCG ATGGTGCGCG CGGTGACGCT GATCCCCAGC ACCACGGAAT TCGGCTACGA GCCGTCGACG CGGGTGCAGA TCTCCGGCAA AGGCAGCGCG ACGCCGGAGA ACCGCCACGT CGCGCACGCC GCCTCCGACG TGGTCGACTC GCTCGACGAG CTGCAGGGCG TATGCCCGCG GCTCGAACGC GTGGCCGTGG TGGTGGCGTG GTTCGGCTCC GATCTGCGCG CCGGGCAATG CTCGGTGCGG CCCGGCGTCG AGAGCCGCGA TAAATGGGTC GATCGCGCGA TCTGGTCAGT CGCCGGCGCG AGCCGCGGCG ACGCCTGGCT GGTGTCGCAG GTCGACGGCC GGCCGGCGTT CGGCGGCACG CCGTCGGACG ACAGCGTCGT GCATCTGATC GGCGAGCTGA AAGCGCGCGG GCTGAAGGTC ACCTTCTATC CGTTCGTGAT GATGGACGTG CCGGCCGGCA ACGGCCTGCC GAATCCGTGG ACCGGCGCTG CGCCGCAGCC GCCTTATCCC TGGCGCGGCC GCATCAGCTG CGATCCGGCG CCCGGGCAGG CGGGATCGCC GCAAGGCACC GCGGCCGCGG CGGCGCAGGT CGAAAGCTTC TTCGCCGGTG GGACGTGGAA TTATCGCCGG ATGATCCTGC ACTACGCGCA ACTCTGCGCG GCGGCAGGCG GCGTCGATGC GTTCCTGATC GGCTCGGAGC TGCGTGCGCT GACGCGGGTG CGCTCGGGCC CGGGCGTCTA TCCGGCGGTG CAGCATCTCG TCGCGCTGGC CAGCGACGTC AAGGCGATCG TCGGCGGCGG CACGCTCGTC ACCTATGGCG CGGACTGGAC CGAATACGGC GCCGACGTCG TCACCGCCGA CGCCTCCGAG GTGCGGTTTC CGCTCGATCC GCTGTGGGCG TCGCCTGCGA TCGACGCGAT CGGGATCGAT TATTACGCGC CGCTCGCCGA CTGGCGCGAC GACGCCGGCC ATCTCGACGC AACGGCGGCC GCCTCGAACT ATGACCGCGC CTATCTCGAT CGCAACGTGT TCGGTGGCGA GGCGTTCGAC TGGTACTACC CGGACGATGC CGCGCGCGCC GCCCAAAGCC GTGCGCCGAT CACCGACGGT CTCGGCAAGC CCTGGACCTT CCGGGTCAAG GACATCCGGA ATTTCTGGTC GCAACCGCAT TATGAACGCG TCGGCGGCGC AGAGCTGACA TCGCCGACGG CGTGGGCGCC GATGAGCAAG CCGGTGTGGC TGACCGAGGT CGGCTGCCCC GCGGTCGACA AGGGCGCCAA CCAGCCGAGC GTGTTTCCCG ATCCGAAATC CAGCGAGAAC TTCGCGCCTT ATTTCTCCAG CGGCGACCGC GACGATCTGA TCCAGCGCCG TTATCTCGAA ACCATCATCG CGGCGTTCGA CCCGGTATTC GGCGCGAGCG AGGCGCGCAA TCCGGTGTCG CCGGTCTATG GCGGCCGCAT GGTCGATCCG TCCGCGATCC ATCTGTGGAC CTGGGATGCG CGGCCCTATC CGGTGTTTCC GGCCGCGCAG GAGGTGTGGA GCGACGCGCC GAACTGGCAG ACCGGGCATT GGCTGACCGG ACGGCTCGGC GGCGCGCCGC TCGATGCGCT GGTGGCGCGG CTATTGGCCG ACAGCGGCGT CGGCGGCGTC GATTCCAGCG CGCTGCAGGA GTGTTGCGAC GGCTATGTGG TCGACCGGCC GATGACGCCG CGGGCGATGA TCGAGCCGCT GGCGATGGCC TATGCGTTCG ACGCCACCGC CGCGGACGGT ACGCTTCGCT TCGTGCAGCG CGGCGGCGCG CCGGTTGTCG AACTCACCGA GGACGATCTG GTGCTGCCGG ACAGAGCGGC GCTGTCGCGG CTGACGCGCG CGCAGGAGAC CGAACTGCCG CGCGAGGTCG CGTTCGGCTT CTCGGATGCG ATCGCGGATT ATCGCCGCTC GGCGGCGTCG TCGCGCCGGC TGGTCGGCGG CGCGGCCCGC ATCGTTCACG CCGATCTCGC GGTGGTGACC AACGACGCCG CGGCCGCGCG CCGCGCCGAA ATCTTTCTGC AGGACCTGTG GGCCGGCCGC GAGACCGCGA GCTTCGCGCT CGGGCTGTCG CATCTGTCGC TCGCGCCCGG CGACGTCGTG GCGCTGACTC TGAACGGCCG GCGGCGGCTG TTCGAGATCG GCGAACTGGT CGACACCATG TCGCGCGCGG TCAAGGCGCG CAGCATCGAT CCGGAGGTGT TCGCACTGCC GCAGCGGGCG CCGCGCCTCG GCGCGCCGCA AATCCCCGCG GCGCTCGGGC CAGCGCATGT CGTGGCGCTC GATCTGCCGG TGATCGACGC CGCCGCGCCG GAGGTGCTGA CGCGGCTCGC GGTGTTCGCC AATCCCTGGC CCGGCGCGGA GCTGATCTAC GCCTCCGCCG ACGGCGCGAG CTACCAGCCG CTGGCGAGCG CGACCGCGCG CGCGATCCTC GGCGAGACGC TCGATCCGCT GCCGCGCGGG CCGCTCGGGT TGTGGGACCG CCGCAATCGC GTGCGGGTGC GGATCTATGG CGGCGCGTTG TCGTCGCTGT CGGACGCCAG CGTGCTCAAT GGCGGCAATG CGGCGGCGGT GCAGAATCCC GACGGCGAAT GGGAGATCCT GCAATTCGCC AATGCCGAAC TGGTCGACGG CAACACTTAT GCGTTGTCGC GGCTGCTGCG TGGCCAGGCC GGCAGCGAGC AGGCGATGCG CGATTCCTTG CCCGCGGCTG CGCCGTTCGT GCTGCTCGAC AGCCATCTGG TGCAGCTCGC GCGCGGCGTC GACGCGCTCG GGCGGCCGCT GCAACTCCGC GTCGTCGCCG CGGGCCGCAG CCACGACGAT CCGAGCGCAA CGGCGCTGGC GGTGACCCCG GGCCCGACCG CGCTGCGGCC GTTCGCGCCG GTGCATCTGA AGGCCGTGCG CACGACCGCC GGCGTCGCGC TGAGCTGGAT CCGCCGCACC CGGATCGGCG GCGATGGCTG GAGCGGGGAG GCGCCGCTCG GCGAGGAGTC CGAGGCCTAT GCGCTCGACA TCCGCGCGCC AAGCGGGGCG GTCGTCCGGT CGATCACGAC CGCTGCGCCG CAAGCGCTCT ACACGACCGC CGACGAGATC GCCGATTTCG GCGCGCCGCA GACCGCGCTG CGCATCCGCG TCGCGCAGCT TTCCGCCACC GTCGGTCCGG GCTTCGCCGC CGACGCCACG CTCGCGCTGT AG
|
Protein sequence | MVALVLSIAG GAVGALFGPV GAIAGRIAGA LAGSMLDHAL LGGGDRNVEG PRLADLDIMA STEGAPIPRV YGRARLAGQV IWATQLEEVI SNEQSSSGGK GFGGPTTTTT TYAYFANFAV GLCEGPIGRV ARIWADGKPL DLSGLNVRVH RGSEDQSPDD LIVAKQGAGD APAYRGLAYV VFERLPLAAY GNRIPQLSFE IVRPIGGLEA MVRAVTLIPS TTEFGYEPST RVQISGKGSA TPENRHVAHA ASDVVDSLDE LQGVCPRLER VAVVVAWFGS DLRAGQCSVR PGVESRDKWV DRAIWSVAGA SRGDAWLVSQ VDGRPAFGGT PSDDSVVHLI GELKARGLKV TFYPFVMMDV PAGNGLPNPW TGAAPQPPYP WRGRISCDPA PGQAGSPQGT AAAAAQVESF FAGGTWNYRR MILHYAQLCA AAGGVDAFLI GSELRALTRV RSGPGVYPAV QHLVALASDV KAIVGGGTLV TYGADWTEYG ADVVTADASE VRFPLDPLWA SPAIDAIGID YYAPLADWRD DAGHLDATAA ASNYDRAYLD RNVFGGEAFD WYYPDDAARA AQSRAPITDG LGKPWTFRVK DIRNFWSQPH YERVGGAELT SPTAWAPMSK PVWLTEVGCP AVDKGANQPS VFPDPKSSEN FAPYFSSGDR DDLIQRRYLE TIIAAFDPVF GASEARNPVS PVYGGRMVDP SAIHLWTWDA RPYPVFPAAQ EVWSDAPNWQ TGHWLTGRLG GAPLDALVAR LLADSGVGGV DSSALQECCD GYVVDRPMTP RAMIEPLAMA YAFDATAADG TLRFVQRGGA PVVELTEDDL VLPDRAALSR LTRAQETELP REVAFGFSDA IADYRRSAAS SRRLVGGAAR IVHADLAVVT NDAAAARRAE IFLQDLWAGR ETASFALGLS HLSLAPGDVV ALTLNGRRRL FEIGELVDTM SRAVKARSID PEVFALPQRA PRLGAPQIPA ALGPAHVVAL DLPVIDAAAP EVLTRLAVFA NPWPGAELIY ASADGASYQP LASATARAIL GETLDPLPRG PLGLWDRRNR VRVRIYGGAL SSLSDASVLN GGNAAAVQNP DGEWEILQFA NAELVDGNTY ALSRLLRGQA GSEQAMRDSL PAAAPFVLLD SHLVQLARGV DALGRPLQLR VVAAGRSHDD PSATALAVTP GPTALRPFAP VHLKAVRTTA GVALSWIRRT RIGGDGWSGE APLGEESEAY ALDIRAPSGA VVRSITTAAP QALYTTADEI ADFGAPQTAL RIRVAQLSAT VGPGFAADAT LAL
|
| |