Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2728 |
Symbol | |
ID | 4023226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 3048913 |
End bp | 3050217 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637962927 |
Product | curlin associated |
Protein accession | YP_569858 |
Protein GI | 91977199 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.718258 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGCGTT CGTTATTCCT GATTTCGGCC AGCGCGCTGG CGCTGAGCGC CGCGGCGGCT TTCGCCAACT CCAATACGAT TTATCTCGAC CAGACCGGCG ACGGCCAGAC GGCGTCTGTC GACCAGTCGC ACTCGGGCAA TCAGATCGGC ACCTTCGCCG ATCGGTTTGC GCAACTCAAC GGCGGCGGCA ACGGCGGCAA TTCGCTGACG GTGACCCAGT CCGGCGACAA CAATCTGCTG GGCGTCAACA AAGCCGGCTT TCAGTCCGGC AGCGGAAACT CCGCCAACAT CTCGCAGGCC GGCTCCGGCA GCAGCGTAGA TCTGCAGCAA ACCGGCACCG GCAATGGCGT AAAGAATATC GGCTGGACCA ATGGCCCGAT CTTTAACGGC ATTCGCCAGG ACCAAACCGC GCAGGCCAGC GCGGTCGATC TCAGCCAGAA CGGCGCCAAC AATGTGTTCG ACATCGCCCA GGGCGGCGCC TCGAACCGTA CCACCATCAA ACAGAGCGGC GCCAACGGTT TCGTTTATGT CCGTCAGGGC ACCTCACTCG CAGACACGAA TTTCCCACCT CAGAACGGCT ACGCCTCGTC CTACGGCAGC AACAGCACGG TCGACGTCAA TCAGACGACC ATCGCGGGCT GGACCTACGC CGCCGTGGCG CAGGGCGGCG GCAGCGGCAA CACCGTCAAT ATCGGTCAGA ACGGTTCCTA TCTCGGCGCG GGCGTCAGCC AGTCCGGCTT CGACAACCGC TTTAATGCGG TGCAAAGCGG CGACAGCAAC AATATCGGCC TGCAGGGCAA CGCCGGCCCC GACACGCCGA TCCGGCAATT CGGCGACCGC AACTCCTATC TCGCCTACCA GACCGGCTAC GGCAACCGCG CCAACGGCTC CCAGACCGGC AACAGCAACG ACGTCTGGAC CAGCCAGACC GGAACGCGCA ACGATCTGTC GGGCAGCCAG ACCGGCTTCG GCAACGGCGT GACCTCGTTT CAGAGCGGAA ACGGCGAAGT GCTAAGCTAC TCGCAGACCT CCTATCTGAT CGGCAATACC ATCCGGAGCA CGCAGTCGGG CGGCTCCGAC CATGCCGATC TCACCCAGAT CGGCAACAAC AACACCATTG TCGGCGCGCA GGCCGGCGGC CTCGGCAACC TCGCCACCGT CATACAGAAC GGCAATGGCA ATATCGGTTT GTATACCCAG ATTGGTAGCG GCAACGTTCT GACGCTGACC CAGATCGGCA ACGGTAATCT CGCCAATACC AGCCAGACGG GATCGAACAA CAGCATCAAG ATCGTTCAGC ATTAA
|
Protein sequence | MKRSLFLISA SALALSAAAA FANSNTIYLD QTGDGQTASV DQSHSGNQIG TFADRFAQLN GGGNGGNSLT VTQSGDNNLL GVNKAGFQSG SGNSANISQA GSGSSVDLQQ TGTGNGVKNI GWTNGPIFNG IRQDQTAQAS AVDLSQNGAN NVFDIAQGGA SNRTTIKQSG ANGFVYVRQG TSLADTNFPP QNGYASSYGS NSTVDVNQTT IAGWTYAAVA QGGGSGNTVN IGQNGSYLGA GVSQSGFDNR FNAVQSGDSN NIGLQGNAGP DTPIRQFGDR NSYLAYQTGY GNRANGSQTG NSNDVWTSQT GTRNDLSGSQ TGFGNGVTSF QSGNGEVLSY SQTSYLIGNT IRSTQSGGSD HADLTQIGNN NTIVGAQAGG LGNLATVIQN GNGNIGLYTQ IGSGNVLTLT QIGNGNLANT SQTGSNNSIK IVQH
|
| |