Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0004 |
Symbol | |
ID | 3969429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 5106 |
End bp | 6350 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637923118 |
Product | hypothetical protein |
Protein accession | YP_529902 |
Protein GI | 90421532 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACATCG CCTCCTCGAA CAGCAGCGCC GGCAATAGCT CCATCCAAAG CCAGGCGGCG GACCAACCGG CGCGGCAGAA GGCCGGCAGC GGCGCGTCGG ACGGCCCGGC CAACGACGCA GCTTCGTCGA AAGCCGCAGC CAACCCGGCG ACCAGGATCG AAATATCGAG CTACGCCCGC CGCTTATTGG CACGTGCCAG CGCCGAGCAA GCCGTCGTGG CGCAGCTGTT GGCGCAGATC AACGCGCTCC GGCACGGCAC GTCGTCGGTC GGCGCCCCGA GCGGCCCCGG CGACTACGCC ACGATCACCA CCGGGTCCGG CGACGACGTC ATCGAAGCCG CGCGCTACGC CACCATCAAT GCCGGCGACG GCAACAACAC GATCTCAACC TACGACTATG CCACGATCAG TACCGGGGCA GGCAACGACG CGATCGATAG CTACGGCTAC GCCACCATCG ACGCCGGCGG CGGCGACAAC CGGGTCAGCA CCTACGATCA TTCCACCGTC TCGACCGGCG CCGGCAATGA CGTCATCAAC ACCTATGGCT GGTCAACCGT GAACGCCGGC GACGGCAACA ACACGGTCAG CACCTACAAC CATTCCACCG TCGCCACCGG TGCCGGCGAC GACGTTATCA GCACCTATGG CTGGTCCACC GTGAACGCCG GCGGCGGCAA CAACACGGTC AGCACCTACA GCCATTCCAC CGTTGCGACC GGCGCCGGCG ACGACGTTAT CAGCACCTAT GGCTGGTCCA CCGTGAATGC CGGCGACGGC AACAACAGGG TCAGCACCTA CAACCATTCC ACCGTCGCCA CCGGATCCGG CAACGACGTC ATCAGCACCT ACGACCACTC CACCATCGAC GCTGGCGCCG GCAACGACGT CATCAGCACC TCCAACCACT CCACCATCGA CGCCGGCGCC GGCAACGACG TCATCACCGC CGGCGGCTTT TCCACCATCA CCGGCGGCCG TGGCGACGAT TCGATCCAGA TCGACGGTTG GGCCGCCACC GTGGCGTTCG GCCAGGGCGA CGGCCACGAC ACGATTCGCT CCGGCCGCGA CCTGACGCTG GCCATCAGCG GCTATTCGCA ATCCGACGTC ACCGTGACGC GCAAGGACGG CACCGCGGTG ATCAGTTTCA AGGGTTCCGA GGATTCGATC GTGCTCGATC TCGCCGGCAA TGGGTCGGCG AAGCTGAGCT TTGCCGACCA TTCGACGCTG AACGTCAGCG CGTAA
|
Protein sequence | MYIASSNSSA GNSSIQSQAA DQPARQKAGS GASDGPANDA ASSKAAANPA TRIEISSYAR RLLARASAEQ AVVAQLLAQI NALRHGTSSV GAPSGPGDYA TITTGSGDDV IEAARYATIN AGDGNNTIST YDYATISTGA GNDAIDSYGY ATIDAGGGDN RVSTYDHSTV STGAGNDVIN TYGWSTVNAG DGNNTVSTYN HSTVATGAGD DVISTYGWST VNAGGGNNTV STYSHSTVAT GAGDDVISTY GWSTVNAGDG NNRVSTYNHS TVATGSGNDV ISTYDHSTID AGAGNDVIST SNHSTIDAGA GNDVITAGGF STITGGRGDD SIQIDGWAAT VAFGQGDGHD TIRSGRDLTL AISGYSQSDV TVTRKDGTAV ISFKGSEDSI VLDLAGNGSA KLSFADHSTL NVSA
|
| |