Gene RPC_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3601 
Symbol 
ID3971628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4001860 
End bp4003626 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content60% 
IMG OID637926710 
Productmyosin-cross-reactive antigen 
Protein accessionYP_533457 
Protein GI90425087 
COG category[S] Function unknown 
COG ID[COG4716] Myosin-crossreactive antigen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.949249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTATA GCAGCGGAAA CTATGAAGCA TTCGTCCGCC CCCGAAAATC GGAGAGCGCG 
GACAAGAAGA CTGCGTGGCT CGTCGGCTCG GGTCTTGCGG GACTGGCTGG CGCGGCCTTC
CTCATCCGCG ACGGTGGAGT GGCCGGTGAA CGCATCACAA TCCTCGAAGA GCTGGACGTT
CCCGGCGGCG CGCTCGATGG CCTCGATGTG CCCGAAAAGG GTTTCGTTAT CCGCGGCGGT
CGCGAGATGG AGGAGCATTT CGAATGCCTC TGGGATCTCT ATCGATCGAT CCCCTCGCTC
GAGGTCGAGG ATGCGAGTGT GCTGGACGAA TTCTACCGGC TCAACAAGGA CGATCCGAAT
TTCTCACTGC AACGCGCGAC CCAGAACCAG GGTCAGGACA CACCGGATAA GGCGCTGTTG
ACGCTGAACG ACAGGGCGCA AAAGGACCTT CTCTCCGTCT TTCTTGCCAC TCGTGAGGAG
ATGGAGAACA AGCGGATCAA CGAGGTCTTC AGCGAGGACT TTCTCAAGAG CAATTTCTGG
CTCTACTGGC GAACCATGTT CGCCTTCGAG GAGTGGCACT CCGCGCTAGA GATGAAGCTC
TATCTCCACC GCTTCATCCA TCATATCGGA AATCTCGCCG ACTTCTCCTC CCTCAAGTTC
AATCGCTACA ACCAGTATGA ATCAATGGTC CTGCCGCTGG TGAAGTGGTT GCAGGATCAC
GGCGTCCGGT TCCGCTACGG CATCGAGGTT ACCGATGTCG ATTTCGACAT CCACGCGGAG
GGTAAACAAG CGACGCGGAT TCACTGGACC GAAAAGGGCG TCGAGGGCGG CGTCGATCTT
GGCCCCGACG ATCTCGTCCT CATCACCATC GGGTCGCTGA CCGAGAATTC TGACAATGGC
GATCACCATA CCCCGGCAAA ACTCAGGGAG GGCCCCGCGC CGGCCTGGGA TCTGTGGCGC
CGCATCGCCG CCAAGGATCC CGCCTTCGGC CGCCCCGATG TGTTCGGCGC CCATGTCACC
GAAACCAAAT GGGCGTCTGC GACCATCACC GCACTCGACC AGCGCATTCC GCAGTACGTC
GAGAAAATTG CGAAACGCAA TCCGTTTACG GGCAAGATCG TCACCGGCGG TATCGTCACG
GTGAAGGACT CGAGCTGGCT CATGAGTTGG GCGGTCCATC GCCAGCCGCA TTTTAAGAAG
CAGCCGAAGG ACCAGATGAT TGCCTGGCTC TATGCGCTAT TCGTCGACCG GCCCGGTGAC
TATGTTAAGA AGCCGATGCT GGACTGCACC GGAGAGGAGA TCACCCAGGA GTGGCTCTAT
CACCTCGGCG TGCCGGTCGA GGATATCCCT GAACTCGCCG CGGCGGGCGC GAACACCGTG
CCGGTGATGA TGCCTTATAT CACCGCCTTC TTCATGCCGC GCCAGGCGGG CGACCGGCCG
GACGTGGTTC CTGAGGGCGC CGTCAACTTC GCCTTCATCG GCCAGTTCGC CGAGTCGAAG
CAACGCGACT GCATCTTCAC GACGGAATAC TCGGTGCGCA CGCCGATGGA GGCGGTCTAC
ACGCTCATGA ATGTCGAACG CGGCGTGCCG GAAGTGTTCA ACTCGACGTA CGACATCCGC
ACGCTGCTGG CGGCGATCGG ACCGCTCAGG GACGGCAAGG GCATCGATCT TCCCGGGCCG
TCCTTCCTGC ACAAACTGCT GATGAAGAAG CTCGAAGGGA CAGAGATCGC GGAACTGCTC
AAGGAGTTTC ACCTAATCTC GGAATGA
 
Protein sequence
MHYSSGNYEA FVRPRKSESA DKKTAWLVGS GLAGLAGAAF LIRDGGVAGE RITILEELDV 
PGGALDGLDV PEKGFVIRGG REMEEHFECL WDLYRSIPSL EVEDASVLDE FYRLNKDDPN
FSLQRATQNQ GQDTPDKALL TLNDRAQKDL LSVFLATREE MENKRINEVF SEDFLKSNFW
LYWRTMFAFE EWHSALEMKL YLHRFIHHIG NLADFSSLKF NRYNQYESMV LPLVKWLQDH
GVRFRYGIEV TDVDFDIHAE GKQATRIHWT EKGVEGGVDL GPDDLVLITI GSLTENSDNG
DHHTPAKLRE GPAPAWDLWR RIAAKDPAFG RPDVFGAHVT ETKWASATIT ALDQRIPQYV
EKIAKRNPFT GKIVTGGIVT VKDSSWLMSW AVHRQPHFKK QPKDQMIAWL YALFVDRPGD
YVKKPMLDCT GEEITQEWLY HLGVPVEDIP ELAAAGANTV PVMMPYITAF FMPRQAGDRP
DVVPEGAVNF AFIGQFAESK QRDCIFTTEY SVRTPMEAVY TLMNVERGVP EVFNSTYDIR
TLLAAIGPLR DGKGIDLPGP SFLHKLLMKK LEGTEIAELL KEFHLISE