Gene Rru_A2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A2037 
Symbol 
ID3835463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2351712 
End bp2354978 
Gene Length3267 bp 
Protein Length1088 aa 
Translation table11 
GC content67% 
IMG OID637826138 
Producthypothetical protein 
Protein accessionYP_427124 
Protein GI83593372 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.86999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTTCATA GTGGGGTGAG GGTGTTGTTC CGCGTTCTGG GCGCGCTGGG ATTGCTGACG 
CTCGCCCTTT TGGCCTTTTT CGGTTGGCGG CTGGAGCAGG GGCCGCTGTC GCTGTCCCTG
CTGACCCCTT ATATCGAAGA CGCCCTTGCC GACCCCGAGG GTGATTTCGA GGTGGTGATC
GACGGCACCG AACTCGCCTG GGGTGGCTGG CAGCGTCCGC TGAATTTGCG GGTGGTCGGC
GTGGTGGTGC GCGACGCCGA CCACCGGTTG ATCGCCCGGC TGCCGATGGT GGCGGTAACC
CTGGCGCCGC GCGCTCTGCT TGATGGCGAG GTGGCGCTGC GCCGGGTCGA AATCCTGTCG
CCCAGCCTTC ATATCACCCG CCTTGAAAGC GGCGAGATCT CGCTGGGCGG CGAGGCGCCT
TTGCCCATCG TTCCCTCCAT CGGATCCCCG GTCCACTCCC CGGGCGACGG CCAAAACCCG
GTCGGGCCGT CAGCGGTCGC CGGCGGGGAT GCCTCGGCCG GGGAGCGGGC CGGCCGCACC
CCCTTGTCCG AGCGCCTGGA TCGCTGGGTG GCGATGCTGA CCGATCAGGC CGGCGGCGTC
GCCCGGCTGA CCCTCCGCGA TGGGGCGATC GATTTCGATG ATCTGCGCAG CGGCGTCAGC
ATGCATATGC CCCGCGTCAA TGCGACGCTG ATCGCCGACA GCCAGGGGGT GGCCGGTGAT
ATCGCAGCCC ATCTTCACCT CGCCCAATCC CTGGCGCGCA TCGATCTATC GATCAGCCAC
CGGGTGGGCG AGGGGACCGT CGAGGCCGAG GCGACGATTT CCGGCCTGGA CGCCCGGCAA
CTGGCCCAGG CGGTGCCGGC CCTGGCCGCC GAGGGGACTT TGGATCTCGC CCTGTCGGGA
TCGGCGCGCG CCGTTCTCGA TCTCGACCTG CTCGAGGGTC CGGCGCCGCT GAGCGGCGTG
CTTGACGCCG GATTCGCCCT GGAAGGCCGC GATGGCGCCC TGGCCCTGCC CGAGCCGATG
GGAACCACCT ATCCTTTGCC GCGGATTTCC CTGAGTGGTG GCATCGAAGA CGGCTTTTCG
GCGGTTTTTC TTGATAAGGT CGATGTCGAT CTTGGCGGGC CGACGGTCCA TGCCACCCTG
CGCGCCGAGG ACCCGCTGAC CGCCCCCCGG GTGGCGATCA CCACCTCTGT CCTGGGGCTT
TCGATCGAGG ATTTGAAGCG CTTCTGGCCC CACGACAAGG TCGATGGCGC CCGGGAGTGG
ATCGCCCTCA ATCTGTCGGA GGGGACGATC CGCCAGGGTG ATTTCACCTT CGATCTGGCC
GGTCCGTCGT TGCAGGACAT CGACATCACC GCCCTGCAGG GCCTGGCCAT GGTCGAGGGT
ATTGCCGTCG ACTACCGCAA CCCCCTGCCG CCGCTTGAAC AGATGTCGGG CGAGATCGTC
TTCGGGCTCA AGGAGATCGC CGTCAAGGTG GCGACCGGCC GCCTGCGCGG AGTCGAGGGG
CTGGAGGTGG TCAAGGGTAC GGTGGTCTTC GGCGGCCTTG ACGCCGAGGA CCAGACGGCG
GCCATCGCTG TGAGCACCAA GGGGCCACTG GCCTCGGTGA TGGCGGTCAT CGATCACCAG
CCGCTCAATT ACGCCAAGGC CGTCGGCATC GATCCCAAAA CCGCCAAGGG CGCGGTCAAG
GCCGATCTGT CCTTCGCCTT TCCCTTGCTC AAGGATCTGA CCTTCGAGCA GATGGCGATC
CGTGTCGAAG GCGATGTCGA AGGCGTGGCC CTGCCAAAGG CCGCCTTCGG CCGCGATGTC
AGCGACGGCA AGCTGAAGGT GGTGCTTGAC CAGAACGGCA TGGATATCAG CGGCACGGCG
ACGCTGGCGA GCGTGCCGGT CGCCCTGGTC TGGCGCGAGA ACTTCGTCGA TGCGCCGTTT
TTGAGCCGCT ACAAGGTGAA GGGAACCCTC GATGCCAAGG GCATGGCCTC GCTGGGGATC
GATCCCGCGC TGATTTCACC CCCCTATGGC GGTGGCGTGG CCAAGGCCGA TCTGACCTAT
ATCCGCCTGC CCGGCGGGCG GGCGACGCTC GAGGGCCGGC TGGATCTGAC GGAGACGGTG
CTTGACCTCG AGGCGTTCGG CTGGCGCAAG CCCGCCGGCA GCCTTGGCGC CGCCAAGGTT
TCGGCAAGGC TTGGCGGCAA GGACGACCGC CTCGACATCG ACATGACGGC CGAGCCGCAG
ATGACGGCCA AGGCCAGCGT TTTCCTGAAC GCCAAGGGCG ACGAGATCAA GCGCGTCGAT
GTCGACAGCT TCGTGGTCGG CAATACCCGC CTTGCCGGTT CGGTCTCCCT GGGGGGATCG
GCGGGGCCCG ATATCCGCAT CGGCGATGGC GTTCTCGATC TGCGCCCCTA TTTCGATCGC
CGGCGCGGCG CGGCGACCTC GCCGACGCCG GCCGAAGGCG TGGCCGCCAA GCCGACCAAG
GATTTGCCGG CCTTTTCGGC GACGGTGACG CTTCGCCAGA TCATGCTGGC CAATGATGTG
GTCTTCGAGC ATGTGACGGC GCGGGCCGCC CGCGATGCCC ATCATTGGCG CCAAGCGGTC
GTCAATGCGG CGGTGCGCGG GTCATCGCCG CTGGCCCTGG CCCTGGAGCC CAAGGGGACG
CAGCGGCGGT TTACGCTTTC GGCGGCCGAT GCCGGAACGA TCCTGCGCGG GTTTGATGTC
ATTGAAACCG TGGCCGGCGG AAATCTGTCG GTCGAGGCGC TCAGCGACCC GGGCGGGGTG
GCGCGCGGCC GGGTTCTGGT CAAGGACTTC CGCCTGCAAA AGGCCCCGGT GCTGGCCCAG
ATCCTGTCGG TGGCCGGATT GACCGGCATC CTTGATGTGC TGGGCGGGCA GGGTATCGCC
TTTTCGACGC TGGTGGTGCC CTTCGTCTAT GACGATCCGG TGGTGACGGT CTCCGAGGCC
CAGGCCTATG GCAACGCCAT CGGCCTAACC GCCGAGGGGG CGATCAATCT TGACGACGAG
CGGCTTTCGC TGAAGGGCAC GGTGGTTCCC GCCTATGCGA TCAACAGCCT GCTCGGCAAG
ATCCCCTTGC TTGGCAGCAT CATCGTTGGC GGCAAAGGGC AGGGGGTGAT CGGGGTGAAT
TACGCCGTCA GTGGCGATGT GTCCAAGCCG GCGATTTCGG TCAATCCGCT ATCGGCGCTG
ACTCCGGGCT TTCTGCGCGG TATCTTCAAG ATCTTCGACC AGCCCGACCA GACGGCGACC
GAGAAGGTCG GCGGCACCGG CGGATAA
 
Protein sequence
MVHSGVRVLF RVLGALGLLT LALLAFFGWR LEQGPLSLSL LTPYIEDALA DPEGDFEVVI 
DGTELAWGGW QRPLNLRVVG VVVRDADHRL IARLPMVAVT LAPRALLDGE VALRRVEILS
PSLHITRLES GEISLGGEAP LPIVPSIGSP VHSPGDGQNP VGPSAVAGGD ASAGERAGRT
PLSERLDRWV AMLTDQAGGV ARLTLRDGAI DFDDLRSGVS MHMPRVNATL IADSQGVAGD
IAAHLHLAQS LARIDLSISH RVGEGTVEAE ATISGLDARQ LAQAVPALAA EGTLDLALSG
SARAVLDLDL LEGPAPLSGV LDAGFALEGR DGALALPEPM GTTYPLPRIS LSGGIEDGFS
AVFLDKVDVD LGGPTVHATL RAEDPLTAPR VAITTSVLGL SIEDLKRFWP HDKVDGAREW
IALNLSEGTI RQGDFTFDLA GPSLQDIDIT ALQGLAMVEG IAVDYRNPLP PLEQMSGEIV
FGLKEIAVKV ATGRLRGVEG LEVVKGTVVF GGLDAEDQTA AIAVSTKGPL ASVMAVIDHQ
PLNYAKAVGI DPKTAKGAVK ADLSFAFPLL KDLTFEQMAI RVEGDVEGVA LPKAAFGRDV
SDGKLKVVLD QNGMDISGTA TLASVPVALV WRENFVDAPF LSRYKVKGTL DAKGMASLGI
DPALISPPYG GGVAKADLTY IRLPGGRATL EGRLDLTETV LDLEAFGWRK PAGSLGAAKV
SARLGGKDDR LDIDMTAEPQ MTAKASVFLN AKGDEIKRVD VDSFVVGNTR LAGSVSLGGS
AGPDIRIGDG VLDLRPYFDR RRGAATSPTP AEGVAAKPTK DLPAFSATVT LRQIMLANDV
VFEHVTARAA RDAHHWRQAV VNAAVRGSSP LALALEPKGT QRRFTLSAAD AGTILRGFDV
IETVAGGNLS VEALSDPGGV ARGRVLVKDF RLQKAPVLAQ ILSVAGLTGI LDVLGGQGIA
FSTLVVPFVY DDPVVTVSEA QAYGNAIGLT AEGAINLDDE RLSLKGTVVP AYAINSLLGK
IPLLGSIIVG GKGQGVIGVN YAVSGDVSKP AISVNPLSAL TPGFLRGIFK IFDQPDQTAT
EKVGGTGG