Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3457 |
Symbol | |
ID | 3911259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3963280 |
End bp | 3965115 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637885360 |
Product | hypothetical protein |
Protein accession | YP_487064 |
Protein GI | 86750568 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.801555 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.385071 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCCA CCGCCAATCT CGGGCTGCCC TTCATCGAGG CGAGCCAGGC GCAGAAGCAC GTCACCCACA ACGAGGCGCT GCGGATTCTC GACGCCGCGA TCCAGATCGC GGTCGCCGAT CGCGATCGCA CGGCGCCGCC GCCGAGCCCG GCCGAGGGCG CGCGCCACAT CGTCGCCGCC GGCGCGAGCG GCGCCTGGAG CGGGCAGGCG GATGCGGTCG CGACCTGGCA GGACGGCGCC TGGGCGTTCC TCGCGCCGAA ACAGGGCTGG TGCGTGTGGT CGGTCGCCGA CGATGGGCTG CTGGTGTTCG ACGGCGCACA GTGGCGCGAT CTGCAAATGT CGCTCGCCAA CGCGCCGTCG CTCGGCGTCA ATACCACGGC CGCCGCGCCG AACCTGCTCA GCGTCAAATC CGACGCCGCG TTGTTCGCCG CGATCGACCC GGCCGCCGGC GGCAGCGGCG ACGTCCGTCT GCAATTGTCC AAGGACAGCG CCGCGAACAC CGCCTCGGTG GTGTTCTCCA ACGCCTATTC GGGCCGCGCC GAATTCGGCC TGGTCGGCTC CGATGCGTTC CGGCTGAAGA TTTCCGCCGA CGGCGCGAGC TTCGTCGACG CCCTGCTGAT CGATCCCGTC AGCGGCAACG CGACGCTGCC GCGGGGCATC GCGCTCACCG GCGTGGTGGC GCCGCCGCCG CTGGGCAGCG ATCAGGACGA CTATGATGCC CCCGGGCTCG CGGCGGCGGC GGTGCTGCAG CTCTGGTCCG ATGCCGCGCG ACAGCTCTCC GGGCTGGCCG GCGGCGTCGA AGGCCGCGTC GTCACCGTGA TCAATGTCGG CAGCCAGCCG ATCACGCTGC TGGACGAGGC GGCGGCGTCG GCAGCGGCCA ACCGCTTCGC ACTCGGCGGG CCGCTGATCG TCGCCGGCAA GCAGGCCGCG ATCCTGCGCT ACGACGGCAC CGCGGCGCGC TGGCGGCCGA TCGCGGGCGG CAATTCCGGC GGCGTGATCC ATTCCGGCCC GCAATCGCTG ACCGCGGCGC AGCAGGACCA GGCGCTGGCC AATCTCGGCG GCGGCGACCT GTCGCTGCTG CGCGGCCACA TCAGCGGCGC GGTGGTGTCG AACGATGCGA CGAGCCCCGA CACGGTGATC GCGGTTTCGG CCGGCGCGGC GGTGTCGGCC GACGCGACGA CGCTGATGAA GATCGCCGCC GTCACCAAGA ACGCCAACGC CGTCTGGACG GCCGGCACCG GCAACGGCGC GGCGGACGGC GCGGCGGGCT ACACCGCGCT CGCCGCCTCG ACCTGGTACT ACGTCTTCCT GATCAAACGC CCGGACACCG GCGCGGTCGA TGTGTTGACG TCGAAATCGC CGACATCGCC GACGCTGCCG GCCGGTTTCA GCAAGGCCCG GCTGATCGGC GCGTTCCGGA CCAACGGCGC GTCGCAGATC ATGGCGTTCC ATGCCTTCGA CGACGGCGAC ACCGTGATGT GGGACGCCGT GCCGGCGATC GACTACACCA CCGCCAGCCT CGGCACCTCC TCGATCACGA TCACGCTCGT CAATGTGCCG GCGATCGAGG TGCAGGTGAT CGCCAACGTC TCGGCATTCA ACAGCGCCAA CACATCGTCG GTGTATTTGC GGCATCCGTC GGCGGCCGAC ATGACACCGA CCGCCAATTC GGCGAGCCCG CTCGGCACCA TCGTCTCACC CAACGGGCTG TTCGTCGGCC TCACCACCTT CGTGCGCGCC GACGCCTCGC GGCAGATCAA GGCGCGCGCC GTCGCCGTCG GCACCATTCT CAACATCGCG GTGCTGGGTT GGCGCTGGAA CCGGCGCGGG CTTTGA
|
Protein sequence | MTATANLGLP FIEASQAQKH VTHNEALRIL DAAIQIAVAD RDRTAPPPSP AEGARHIVAA GASGAWSGQA DAVATWQDGA WAFLAPKQGW CVWSVADDGL LVFDGAQWRD LQMSLANAPS LGVNTTAAAP NLLSVKSDAA LFAAIDPAAG GSGDVRLQLS KDSAANTASV VFSNAYSGRA EFGLVGSDAF RLKISADGAS FVDALLIDPV SGNATLPRGI ALTGVVAPPP LGSDQDDYDA PGLAAAAVLQ LWSDAARQLS GLAGGVEGRV VTVINVGSQP ITLLDEAAAS AAANRFALGG PLIVAGKQAA ILRYDGTAAR WRPIAGGNSG GVIHSGPQSL TAAQQDQALA NLGGGDLSLL RGHISGAVVS NDATSPDTVI AVSAGAAVSA DATTLMKIAA VTKNANAVWT AGTGNGAADG AAGYTALAAS TWYYVFLIKR PDTGAVDVLT SKSPTSPTLP AGFSKARLIG AFRTNGASQI MAFHAFDDGD TVMWDAVPAI DYTTASLGTS SITITLVNVP AIEVQVIANV SAFNSANTSS VYLRHPSAAD MTPTANSASP LGTIVSPNGL FVGLTTFVRA DASRQIKARA VAVGTILNIA VLGWRWNRRG L
|
| |