Gene RPB_3055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3055 
Symbol 
ID3910856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3484682 
End bp3485917 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content53% 
IMG OID637884962 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_486667 
Protein GI86750171 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.363566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGG CAGGGTTTCT GGAGGAGCTG CTGGATGGGG CTGATGTGGA GTGGGAACCA 
TTGGGGGAGG TCACTCAACC AACAGCGAAC ATCAAATGGT CACAAGCCGA CGGCGTTTAC
CAATATATTG ATCTCACCTC CGTCGACATC AAAACCAAAC GCGTTACCGA GGCAAGCGAG
ATTACAGCCG AGACCGCGCC AAGCAGAGCG CAGAAGCTCG TTAAAGAAAA TGACGTCATT
TTCGCTACGA CGCGCCCCGC TCAACAACGA TACTGCCTAA TCGACTCCGA ACTGGCCGGA
AACGTCGCCA GCACGGGTTA CTGCGTGCTC AGAGCAAAGA AGGATCAGGT ACTACCTAAG
TGGATTTTGC ACTGGCTTGG CACAACAGAA TTCAAGAATT ACGTCGAGGA GAATCAGAGT
GGGGCTGCAT ACCCAGCGAT ATCAGACGGC AAGGTGAAAG CGTTCAAAAT TCCCATTCCA
TGCCCGGATG ATCCGGAGAA GTCGCTGGCG ATACAGGGGG AGATCGTCCG AATACTGGAC
ACATTCACCG AGCTTACCGC TGAGCTTACC GCGGGGCTTG CCGCCGAGCT TGCCCAGCGC
AAAAAACAAT ACAGCCACTA CCGCGACCAG CTCTTGACCT TCAATGAAGA TGAGGTGGAG
TGGAAGACGC TGGGGGATAT CGCGACTCTA CGCCGAGGGC GAGTTATGTC GAAGGGCTAC
CTGCGAGATA ACGCCGGTGT GTACCCGGTC TACAGCTCCC AAACTGCAAA CAACGGCATG
ATTGGCCAGA TCGACACGTT TGACTTTGAC GGTGAGTACG TCAGTTGGAC CACAGACGGA
GCAAACGCCG GAACTGTATT CTATAGAAAC GAAAAATTCT CGATTACTAA CGTTTGCGGC
GTAATAAAAG AAAATGGAAC GTGCCCGCTG GACCTAAAAT TTTTATCTTT TTGGCTTTCG
ACGGAGGCCA AGAAGCATGT TTACAGTGGA ATGGGCAACC CGAAGCTGAT GAGTCATCAA
GTCGAGAAAA TACCAATCCC GATTCCCTTT CCAGATGACC CTAAAATATC GCTAGAAGCC
CAAAAGCGCG TCGCCGCCAT CCTCGACAAG TTGGATGCGC TGACGACTTC CCTCACTGAG
ATCCTGCCGC GTGAAATCGA GCTGCGTGAA AAGCAGTATG CCTATTACCG CGATCAGCTG
CTGAGCTTCC CCAAGCCGGA CGCGGAGGCT TTCTAA
 
Protein sequence
MSAAGFLEEL LDGADVEWEP LGEVTQPTAN IKWSQADGVY QYIDLTSVDI KTKRVTEASE 
ITAETAPSRA QKLVKENDVI FATTRPAQQR YCLIDSELAG NVASTGYCVL RAKKDQVLPK
WILHWLGTTE FKNYVEENQS GAAYPAISDG KVKAFKIPIP CPDDPEKSLA IQGEIVRILD
TFTELTAELT AGLAAELAQR KKQYSHYRDQ LLTFNEDEVE WKTLGDIATL RRGRVMSKGY
LRDNAGVYPV YSSQTANNGM IGQIDTFDFD GEYVSWTTDG ANAGTVFYRN EKFSITNVCG
VIKENGTCPL DLKFLSFWLS TEAKKHVYSG MGNPKLMSHQ VEKIPIPIPF PDDPKISLEA
QKRVAAILDK LDALTTSLTE ILPREIELRE KQYAYYRDQL LSFPKPDAEA F