Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3941 |
Symbol | |
ID | 3911748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4497283 |
End bp | 4498752 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885845 |
Product | hypothetical protein |
Protein accession | YP_487545 |
Protein GI | 86751049 |
COG category | [R] General function prediction only |
COG ID | [COG3106] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00618827 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTTCA GTTTTTCAGA TCTGGTCGAG GAAGCGTGGC TGTCGGCGCG GGCGCTGAAG GACTACAGCG AGAATATCTT CAATCCGACG GTCCGGCTCG GCGTCACCGG GCTGTCACGT GCCGGCAAGA CGGTGTTCAT CACCGCGCTG GTGCACGGCC TGTCGCGCGG CGGGCGGTTT CCGATCTTCG AATCGATGTC GACCGGACGA ATCGCCAAGG CCCGGCTGGC GCCGCAGCCC GACGACGCGG TGCCGCGGTT CGGCTATGAG GGCTTTCTCG CCACCCTGAT GGAGCAGCGC GACTGGCCGA GTTCGACGGT GGATATCAGC GAATTGCGGC TGGTGATCGA CTATCAGCGT AAGAACGGCG CCGAACGCAC GCTGACGCTG GACATCGTCG ACTACCCCGG CGAATGGCTG CTCGACCTGC CGCTGCTGAA CAAGAGCTAC GAGAAATGGG CGGCGGAGAG CCTGGCGCTG TCGCGACAGG ACCCGCGCCG CCGCGTCGCC GTCGACTGGC ATGCGCATCT CGCCACGCTC GATCCGAACG GCCGCGAGAA CGAGCAGGAA ACGCTGACCG CGGCGCGGCT GTTCACCACC TATCTGCGCG ACTGCCGCAA CGAGCAGTTC GCGATGAGCC TGCTGCCGCC CGGCCGTTTC CTGATGCCCG GCAACCTCGC CGGCTCGCCG GCGTTGACCT TCGCGCCTCT GGATGTGCCG GAGGGCGGCA CCGCGCCGGA GCGCTCGCTG TGGGCGATGA TGGTGCGACG CTACGAGGCC TACAAGGACG TGGTGGTGCG GCCGTTCTTC CGCGATCACT TCGCGAGGCT CGACCGCCAG ATCGTGCTGG CGGATGCGCT GTCGGCTTTC AACGCCGGCC CCGAGGCACT GCAGGACCTC GAAGCCGCGC TCGCCGGCAT TCTCGACTGT TTCCGGGTCG GGCGCGCGTC GATGCTGTCG ACGATGTTCC GGCCGCGGAT CGACCGCATC CTGTTCGCCG CCACCAAGGC CGACCATCTG CACCATTCCA GCCACGACCG GCTGGAGGCG ATCCTGCGCA AGCTGGTCGA GCGGGCGATG CAGCGCGCCG AATTCGCCGG GGCGACCGTC GACGTGGTGG CGCTGGCCGC GGTGCGCGCC ACCCGCGAGG CCCAGGTGCA GCGCGGCCGC GACCGGCTGC CGTCGATCGT CGGCACCCCG ATCAAGGGCG AAATGGCCGA CGGCGAGATC TTCGACGGCG AGACCGAGGT CGCTACCTTC CCCGGCGACC TGCCGACCAA TCTGCAGGGG CTGTTCAAGG GCGAGGACAC GTTTCGCGGC CTCGCCGCGG GCCGCCACGA GGACGCCGAT TTTCGCTTCC TGCGCTTCCG GCCGCCACGG CTCGACAATC GCGATCCGGA CGGCCCGGCA CTGCCTCACA TCCGCCTCGA CCGCGCGCTC CAGTTCCTGA TCGGAGATCA ACTGCAATGA
|
Protein sequence | MAFSFSDLVE EAWLSARALK DYSENIFNPT VRLGVTGLSR AGKTVFITAL VHGLSRGGRF PIFESMSTGR IAKARLAPQP DDAVPRFGYE GFLATLMEQR DWPSSTVDIS ELRLVIDYQR KNGAERTLTL DIVDYPGEWL LDLPLLNKSY EKWAAESLAL SRQDPRRRVA VDWHAHLATL DPNGRENEQE TLTAARLFTT YLRDCRNEQF AMSLLPPGRF LMPGNLAGSP ALTFAPLDVP EGGTAPERSL WAMMVRRYEA YKDVVVRPFF RDHFARLDRQ IVLADALSAF NAGPEALQDL EAALAGILDC FRVGRASMLS TMFRPRIDRI LFAATKADHL HHSSHDRLEA ILRKLVERAM QRAEFAGATV DVVALAAVRA TREAQVQRGR DRLPSIVGTP IKGEMADGEI FDGETEVATF PGDLPTNLQG LFKGEDTFRG LAAGRHEDAD FRFLRFRPPR LDNRDPDGPA LPHIRLDRAL QFLIGDQLQ
|
| |