Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3773 |
Symbol | |
ID | 3837230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 4321247 |
End bp | 4322203 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637827898 |
Product | CBS domain-containing protein |
Protein accession | YP_428854 |
Protein GI | 83595102 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.163853 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGGTC ATTCCGCCTC CGGCGACAGC CCCTCCCGTA GTCGCGATGG CGACTCGGGC GATAAATCGG ACTCCGCCAG TCGCGGCGGT CTGGCGAGTT TCCTGCGCGG GGTGTTCCGC CCGCGCAACG GCGATGGCGG ATTGCGCGAT TCTCTTGATG AACTGATCGA GGACCGCGAG GACTCCGACC TGTCGATGAC CGATCAGGAA CGCGCGTTGC TGACCAATAT CCTGCGGCTG CGCGATATGA CCGTCGAAGA CATCATGGTG CCGCGCGCCG ATGTGGTCGC CGTCGAGGTT TCCACCCCGG TCGATGTGGC GATCGAGCGC ATCGCCGCTT GCGGTCATTC CCGGCTGCCG GTCTATCGCG ACACCCTCGA CAACACCCTG GGCATGGTTC ACGTCAAGGA TTTCCTCCGC CGCAAGTCGG GTGACGCCGG CTCGCTCGAG CGGGTGCTGC GCGAGATCCT GTTCGCCGCG CCCTCGGCCC GGGTCCTCGA CCTGCTGCTC GAAATGCGGC TCAAGCGCAT TCACATGGCC TTGATCGTCG ATGAATACGG CGGCATCGAC GGCTTGGTGA CGATCGAGGA TCTGGTCGAA CAGATCGTCG GCGAGATCGA AGACGAATAC GACAGCGTGA CCGAGCCCGA GCTGACCGTG CACGAGGATG GCACGGTGGT GGTCGACGCC CGGCTCGACC TCGAGGATTT CGAGGAGCGC TGCGGCCGGT TGTTCACCGA CGAGGAACGC GAGGAGATCG ATACCCTGGG CGGGCTGGTC TTCCGGCTGG CCGGTCGGGT GCCGGCGCGC GGCGAATTGG TGCTGCGCGA GGGCGGACCC GAGTTCGAGG TGATCGATGC CGATCCCCGG CGGATCCGCC GGCTGCGCAT TCGCGGCCTC AATCCGCTGC CCTCCCCCGA CGCTCCGGCC GGCGGCCCCT CCGGGACCCC GGTTTAA
|
Protein sequence | MNGHSASGDS PSRSRDGDSG DKSDSASRGG LASFLRGVFR PRNGDGGLRD SLDELIEDRE DSDLSMTDQE RALLTNILRL RDMTVEDIMV PRADVVAVEV STPVDVAIER IAACGHSRLP VYRDTLDNTL GMVHVKDFLR RKSGDAGSLE RVLREILFAA PSARVLDLLL EMRLKRIHMA LIVDEYGGID GLVTIEDLVE QIVGEIEDEY DSVTEPELTV HEDGTVVVDA RLDLEDFEER CGRLFTDEER EEIDTLGGLV FRLAGRVPAR GELVLREGGP EFEVIDADPR RIRRLRIRGL NPLPSPDAPA GGPSGTPV
|
| |