Gene Rru_A0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A0104 
Symbol 
ID3833859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp117743 
End bp123022 
Gene Length5280 bp 
Protein Length1759 aa 
Translation table11 
GC content71% 
IMG OID637824175 
ProductAlpha-2-macroglobulin-like protein 
Protein accessionYP_425196 
Protein GI83591444 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGTCTGT CGCTCCGGGC CTTTGGGGCT TTGCCCGGCC GCGCCCGGCG GTTGGGGATC 
GCCACCCTTT TCGGGCTGAT CGCCCTGGTG TTGGCCCTGC CCTTGGTTGC CCGCGCCGCC
GATCCCGTCG CCTCGGACGC CGCCACCGCC CAGGCCCTGG CCGATCTGGG CCGCGAGGCC
GAGGGTCTGA AGGCCAATGT GCTCTATGTC GTTGAAAACG ACGGCCAGAG GGCGCAGGCC
AAGGCCGCCG AGGCGCGCGG CAAGGCCGAG CGCGGAGAAT GGGATGGCGC CGTCGACGCC
TGGGAGCAGG CCCTGCGCGC CGGACTGAAA CAAGCCGATC GCTGGCTGAT TCTCGCCGAT
TTCTATGACA AGGCCGAAGG GGCCGAACCG CGCGCCGTGG CCGCCGCCTT TCTGGCGATC
GCCGAGGCCG CCGGCGGGGA AGCCCGCGCC CTGGCCCAGA CGCGGACCGC CGAATTGCTG
ATGGCCGCCG AGCGCGATGA CGTGGCCTTG GCGTTGTTCC GCCGCGCCCA GGCCAGCCAT
CCCACCCCGC GCGCCGCCAA CGGCATCGCC GCCCTCGGCG GAACCGGTTT GGCGATCCGG
CGGACTCTGG TTGAAGCCGA CCGCGATCTG CCCCGCTTGT GTCTGGATCT GACCGCCGCC
CCCGCCGCCA TCGATCTCGC GCCCTATGTC AAGATCGAGC CCGCCGTGCC GGTGGCGGTT
AGCGCCACCG GCAAGCGCCT GTGCATCGAC GGCCTGCCCC ACGGCCAGAC GGTCGGCGTG
ACGCTGCGCG AAGGGCTGCC GACGCTGACC GGCGCCACCT TGGCCCGCTC CTTGACCCTT
GAGGCGCGTA TCGCCAACCG CGCCCCCCGC GTCGCCTTCG CCGGCAACGC CTATATCCTG
CCGGCGCGCG GGGCCCGCGA TGTCGCGGTG CGCACGGTCA ATGTCGATCA ACTGTGGTTG
ACCCTGCTGC GGGTCAACGA CCGGGGCTTG GCCGGACTGC TGCGCGACCA TGATTTGAAT
TCGATGGTGT CGGGCGGCGA TCTGCGCGAC ATCGCCGAGG ATTCGGGCGA AAAGATCTGG
GAAGGCAAGC TCGATATCGA CGCCGCCCTG AACCGCGAGA CGGCGACGGC CATTCCCCTC
GCCACCATCC TGCCCGATCC CCGCCCCGGT CTTTACCTGC TGGCCGCCGA GGACCGTTTG
CGCGGCTCGA CGCCCTGGTG GAACCAGCCT TCGCAATGGC TGCTGGTCAC CGATATCGGC
TTGCAGAGCG CGGTCGGCCT GGATGGGTTG TCGGTTTTCG CCCGCTCGCT GTCGAGCGGC
CGGGCCCTGG TCGGGGCCTC GGTCACCCTG GTCGCGCGCA ACAACGCCGA GGTCGCCAGC
GCCGTTACCG ATCGCAACGG CATGGTCCGC TTCCAGCCCG CCCAGTTGAG CGGCCGCGAC
GGCAAGACGC CGACCTGGGT GATGGTCTAT GGCCCGGCCG GCGATTTCGC CTATCTCGAT
GTCACCGGTC CGGCCTTCGA TCTGTCCGAT CGCGGCGTCG GCGGCCGCGA GGCCCCCGGT
CCGCTCGACG CCTTCCTCTA TACCGAGCGC GGCATCTACC GGCCGGGCGA GACGGTTCAT
CTCGCCGGGT TGTTGCGCGA TTCGGGGGCG CGCGGGCTGT CGGGCCTGCC TTTGACCGTG
AAGATCCTGC GTCCCGATGG CGTTCAGGTC GACCGCCAAG TCTTGAGCGA TCAGGGGGTG
GGAGCCTATG GCGCCGATAT TCCCCTTCTC AAGGAAGCGC GCAGCGGACG GTGGGAGATC
ACCGCCCACG TCGATCCCGC CGCGCCGCCG ATCGGCCGGG TGTCGTTCGT GGTCGAGGAT
TTCGTGCCCC AGACCATGGA GGTCACCCTC AAGACCACCG CCACGGCCCT TGATCTGACG
ACGGGCGAGC CGTCGGCAGC GACGGCTTCG GTCGCGGTGC AGGCCGATTA TTTCTACGGC
GCCCCGGTGG CGTCGCAGCC GGTGACCTCG GAAGTGGTGC TGAGCGCCGA TCCCGAGCCC
TTCGCCGCCT TCAAGGGCTT CACCTTCGGT CTGGCCCAGG AGGAGGTCGA ACCCCGGCGC
TTCGATCTGG AGCCGACGCT AACCAACGCC AAGGGCGCGG CCACCCTGGC CGTCGCCCTG
GATGGCGCGC CCGATACCTC GCACCCCTTG CGCGCCCTGG TGCGCACCAG CGTGATCGAT
GCCGGGGGCC GGGGGGCGGC GCGCACCCTG GTGCTGCCGG TTCGCCATCA GCCGATGGCC
GTCGGCATCA AGGCCCGTTT CGAGGGGGAC TCCCTGGCCG AGGGGGCGAC CCCGGCCTTT
GATCTGGCCG CCGTGAACCC GGCCGGCGCC CCCGTCGGCC AGCGGCCGCT CGATTGGGTG
CTCTATGAAG AGGTCAGCCG CTACACCTGG TATGAAAACG GCGGCGCCTG GAACTACCGC
CGCACCGTCA GCGACGAGGT GCGCGCCTCG GGACGGGTGA TGACCGCCGC CAATGGTCTG
GCCGAGGTCG CCGTGCCCGT CGGCTTTGGC GCCTTCCGCC TTGAGGTGTC CGATGGCGGC
GGGTCGCTCG CCGCCTCGTC CCAGCGCTTC CGCGCCGGCT GGTGGGTGGC CGGGTCCGAC
AGCCGCGATA CCCCCGATAC GCTGAAGGTC ACCCGCGAGG CCGAGGCCTA TCGGCCCGGC
GAGCGGGCGC GTCTGCGCCT GGAGGCGCCT TTCGCCGGTC ACGCCCTGGT TTCGGTGGTT
ACCGATCGTT TGCTTGATGT GCTTGACGTG CCGCTGCCCG AAGGCGGGCT GACGGTGGAA
TTGCCGGTCA CCGAGGCCTG GGGGGTGGGC GCCTATGTGG TGGTCACCGC CTTCCGCCCC
GGGGAGAACG CCGGATTGCG CGGTCCCGGG CGGGCGATGG GCGTGGCCTG GGTGCCCGTC
GACATGACGC GGCGAACCCT GTCGGTTGCG CTCGAGGCGC CGCAAACCGT GTTGCCGCGC
CAGCGTCTGG AGGTGCCGGT TCAGGTGGCG GGCGGCCAGG GTCCGGTGTT CCTGACCCTG
GCCGCCGTCG ATGAGGGCAT CTTGCAACTG ACCGATTTCC AAAGCCCTGA TCCGGTCGCT
TATTATTATG GCAAGCGCAG CCTGGGGCTG GCCTACCGGG ATGCTTATGG CCGGTTGCTT
GAAGGCACCA ACGCCCGCCC GGGCCGTTTG CGCGAAGGCG GCGACGCCTC GGGGCGTCAC
CTGATGGGGC TGCCGCGCAG TTCGGTTGAA ACCGTCTCCC TGTTCTCGGG GATCGTCGCC
GTCGGCAAGG ATGGGCGGGC GCTGGTGCCC GTCGATATCC CCGATTTCAC CGGCCGGCTG
CGGCTGATGG CCGTGGCCTT TTCCGGCGCC GGCGTCGGTC ACGGCGAAGC GGCGGTAACG
GTGCGCGATC CGATGGTGGC GATGGCGACG CTGCCGCGCT TCCTGGCGCC GGGCGATCGC
TCCTTCCTTA GCGTCTCGCT TGATAACATC GATGGCCCGG CCGGCGTCTG GACGCTTGCC
GCCGTGGGCG AGGGCGCCTT GAGCGTGCCC GAGGGGCCGC GCGCCGTCGA TCTGGCCAAG
GGCGGCCGCG ATCTGGTGCG TCTGGCCGTG GAGGGGCTCA ACCCGGGCCC GGGCACCTTG
CGGCTGGCGA TCACCGCGCC GACCGGCGAG CGCCGGGAAA AGACCTGGAG CTTCGACGTG
CGCCCCGGCG CGCCCCGGAT GACGCTGGCC CATCGCCTCG ATCTCGGCGG TCACGGCCGC
GCCCGTCTGG ACGGCGGCCT GCTTGAGGGC TTTTTCCGCC AGGGCGTCGA AGGGACTTTG
ACCCTGTCGT CGGTGCCCAA CCTCGATCCG ATCGGCAATG CCTCGGCCCT GCGGGCTTAT
CCCTATGGCT GCCTGGAACA GACCGTCAGC CGGGCCTGGG TGGCGCTTTA TGGCCATGAC
CTGGATACCC CGGCGCTGTG GAAGGGCTGG AGCGGCAAGG ACGTGCTGGC GACCGAGATC
GCCCGGGTGA TCGCCCTCCA GCGCCCCGAT GGCGGCTTCG CCCTGTGGTC CTCGTCGGGG
GGGCTTGATC CCTGGCTGTC GGCCTATGCC CTGGAATTCC TGCTGCGCGC CCGCGAGGAA
AAGGCCGCCG TGCCCGATTT CGTCGTGGAA CGCGGTTTGG CCTATCTCGA TGACGCCATC
GCTGATGGCG ATTTCTCCGA TGCCGGCCTG CCGGCGGCGG CCTATGCCCA TTACGTTCTG
GCCCGCGCCG GGGCGATGGA CCTGGGGCGC TTGCGCTATT TCGCCGATAC CTATCTGGCG
CGGATGCCCA CGCCCTTGGC TCGGGTGCAA ACCGCCGCCG CCCTGTCGCT GCTGGGCGAC
GGACCGCGCG CCGTCGCCGC CCTGAAACAG GGGCTGGGCG ACCGGGCGCG GTTGGTCAGC
GATCGCGACG ATTGGGATTA TGGCTCGGCC CTGCGCGACC GCGCCGCTAC CCTGGCGCTC
GGCATCGAGA CCGGCCTGCT CGGCGATGAT GCCCTGGCGA CGGCCGATAC CCTGGCCGAC
ACCCTGGCCC GCACCCCCCA TCCGTCGACC CAGGAACGCG GCTGGCTGGT GCTCGCCGCC
CGCGCCCTGG CCCGCCAGCA GGGATCGGTC GCCGCGACGG TGGACGGTGT GGGCGTCGAT
AGCGGCCGCA CCACGGTGAC CCTGCCCCTG AGCATCCAAG ACCTGGGCGA AGGGGTGGAT
ATCGTCAATG GCGGCGACCG GGCGCTGCGC GTCGTCGCCT CGCTTGACGG CCAGCCCGAG
GCGCCGCCGC CGGCGGCCAG CGAGGGGTTG ACCATCGACC GTCGCTTCCT GACCCTGGCC
GGCGAGGCGG TGGATCCGAC CAAGGTGCGC CAGAACGATC TGCTGGTGGC GCTGATCACC
GGCTCGGCCA CCGATCCCAA GGTCATCGCC CGTGAAAGGG GCAATCGCCT GCTGGTGGTC
GATCTGCTGC CCGCCGGGGT CGAGATCGAA AACCCGCGGA TCGGCGCCGG GGCGCCGGGG
AGCGAGGGGC TGTCCTGGCT GCCGGAGCTG ACGGACACCA CCCACGTCGA AGCCCGCGAC
GATCGCTATG TCGCCTCCGT CGATATCGAC GCCAAGACGC CGACGTTCAC CCTGGCCTAT
GTGGTGCGCG CCGTCAGCCC GGGAAGCTTC GTTGTTCCCG GCGCCGCCGT CGAGGATATG
GATCGCCCGG CCTTGCGGGC TAATCAGGGG GCGGGGCGGC TGTCGATCGC CGCCCGCTGA
 
Protein sequence
MGLSLRAFGA LPGRARRLGI ATLFGLIALV LALPLVARAA DPVASDAATA QALADLGREA 
EGLKANVLYV VENDGQRAQA KAAEARGKAE RGEWDGAVDA WEQALRAGLK QADRWLILAD
FYDKAEGAEP RAVAAAFLAI AEAAGGEARA LAQTRTAELL MAAERDDVAL ALFRRAQASH
PTPRAANGIA ALGGTGLAIR RTLVEADRDL PRLCLDLTAA PAAIDLAPYV KIEPAVPVAV
SATGKRLCID GLPHGQTVGV TLREGLPTLT GATLARSLTL EARIANRAPR VAFAGNAYIL
PARGARDVAV RTVNVDQLWL TLLRVNDRGL AGLLRDHDLN SMVSGGDLRD IAEDSGEKIW
EGKLDIDAAL NRETATAIPL ATILPDPRPG LYLLAAEDRL RGSTPWWNQP SQWLLVTDIG
LQSAVGLDGL SVFARSLSSG RALVGASVTL VARNNAEVAS AVTDRNGMVR FQPAQLSGRD
GKTPTWVMVY GPAGDFAYLD VTGPAFDLSD RGVGGREAPG PLDAFLYTER GIYRPGETVH
LAGLLRDSGA RGLSGLPLTV KILRPDGVQV DRQVLSDQGV GAYGADIPLL KEARSGRWEI
TAHVDPAAPP IGRVSFVVED FVPQTMEVTL KTTATALDLT TGEPSAATAS VAVQADYFYG
APVASQPVTS EVVLSADPEP FAAFKGFTFG LAQEEVEPRR FDLEPTLTNA KGAATLAVAL
DGAPDTSHPL RALVRTSVID AGGRGAARTL VLPVRHQPMA VGIKARFEGD SLAEGATPAF
DLAAVNPAGA PVGQRPLDWV LYEEVSRYTW YENGGAWNYR RTVSDEVRAS GRVMTAANGL
AEVAVPVGFG AFRLEVSDGG GSLAASSQRF RAGWWVAGSD SRDTPDTLKV TREAEAYRPG
ERARLRLEAP FAGHALVSVV TDRLLDVLDV PLPEGGLTVE LPVTEAWGVG AYVVVTAFRP
GENAGLRGPG RAMGVAWVPV DMTRRTLSVA LEAPQTVLPR QRLEVPVQVA GGQGPVFLTL
AAVDEGILQL TDFQSPDPVA YYYGKRSLGL AYRDAYGRLL EGTNARPGRL REGGDASGRH
LMGLPRSSVE TVSLFSGIVA VGKDGRALVP VDIPDFTGRL RLMAVAFSGA GVGHGEAAVT
VRDPMVAMAT LPRFLAPGDR SFLSVSLDNI DGPAGVWTLA AVGEGALSVP EGPRAVDLAK
GGRDLVRLAV EGLNPGPGTL RLAITAPTGE RREKTWSFDV RPGAPRMTLA HRLDLGGHGR
ARLDGGLLEG FFRQGVEGTL TLSSVPNLDP IGNASALRAY PYGCLEQTVS RAWVALYGHD
LDTPALWKGW SGKDVLATEI ARVIALQRPD GGFALWSSSG GLDPWLSAYA LEFLLRAREE
KAAVPDFVVE RGLAYLDDAI ADGDFSDAGL PAAAYAHYVL ARAGAMDLGR LRYFADTYLA
RMPTPLARVQ TAAALSLLGD GPRAVAALKQ GLGDRARLVS DRDDWDYGSA LRDRAATLAL
GIETGLLGDD ALATADTLAD TLARTPHPST QERGWLVLAA RALARQQGSV AATVDGVGVD
SGRTTVTLPL SIQDLGEGVD IVNGGDRALR VVASLDGQPE APPPAASEGL TIDRRFLTLA
GEAVDPTKVR QNDLLVALIT GSATDPKVIA RERGNRLLVV DLLPAGVEIE NPRIGAGAPG
SEGLSWLPEL TDTTHVEARD DRYVASVDID AKTPTFTLAY VVRAVSPGSF VVPGAAVEDM
DRPALRANQG AGRLSIAAR