Gene Rru_A3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3520 
Symbol 
ID3836974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp4052063 
End bp4053367 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content65% 
IMG OID637827642 
Producthypothetical protein 
Protein accessionYP_428601 
Protein GI83594849 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.365519 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGGGG CCGAAAAGGG CCCACAGCCG CGCCGCCCGG ATTTCACGGA AGCAGGGGTT 
ATACAGAACC GCGCCGCGGG GCGCAACAGG AGTCCTGATC CTTCGCCGGA CGGGGCGCAA
GCAAGACGCC GGAGCTGGCG CAACCAAGGG GCTTGGCTAT CGTTTGTGGC GTGGCCGCAC
CGCAGGGGTT CGGCCACAGT GGAGCCCCCC CCTTTGACCC TTCGCCAACA ACTGACAACG
GTCCGCTGGC TGTTGACCGG CCTGTTCGCC ATGATGCTGC TGGTCTTTTT GTACTTCGGC
CGCGACGTGC TGATGCCGAT CGTTCTCGCC TTGCTGCTCG CCCTGATGCT GAGTCCGATC
GTCCGCCTGC TGCGCCGGCG CAAGGTGCCC GAGGGGGTGG CGGCGGTGGT GGTCACCCTG
GGCTTCGCCG CCGGCATGCT GCTGATCGGC TTCACGATCA GCGGTCCGGT GGCGACGTGG
CTCGAGGATG CGCCCTCGAT CGGCAAGCGC ATCGCCCACA AGCTGTCGTC TTTGCGCGAA
TCCTTCGATG TGGTGCTCGA TGCCAGCCGT CAGGTCGAAG AGGCGGCCGA TACCTCGTCC
AAGGACAATA CCGATCGCGT GGTGATGGCC CAGCCCGGTC TGCTGGTCAA AGCCGCCGAT
AGTCTGGCCT CGGGCTTCAC CACCTTTGGC GTGACCATCG TCATCACCCT GTTCTTCCTG
GCCTCGGGCA GCATGTTCAC CGAGAAGATC GTTCACGTGA TGCCGCTGTT TCGCGACAAG
CTGCGCGTGC TCCGCGTCGT GCGCGATATC GAGACCGAGG TGTCGTCCTA TCTGCTGACC
GTGGCGATGA TCAACGCCGG TCTGGGCGTG GCCGTCGGCT GCGCCCTGTG GCTGGCGGGG
ATGCCCAACG CCTTCCTGTG GGGCGGCATG GCCATGGTGC TTAACTTCCT GCCCTATATC
GGCTCGATGA TCGGTATTTC CCTGGTCGCC CTGGTGTCCT TCGTCACCTT CGATTCCCTG
GGCCAAGCCC TGATCCCGCC GCTTAGCTAT CTGGCGGTGA CCTCGGTCGA GGGGCAGTTC
CTTACCCCAG CGATCGTCGG CCGCCGGCTG GAGCTCAACG CCTTGTCGGT GATGCTGGCC
GTGTTGTTCT GGGCGTGGCT CTGGGGGATC GTCGGCGCCC TGATCGCCGT GCCGCTGCTG
GTCGTCGTCA AGGTCGTGTG TTCCCATTTC GAAGGACTGG CCGTGGTCGG CGAGTTCCTC
TCGGCCCGCC GGCCGTTTTC CGATCAGGTC AAGGACGGCG CATAG
 
Protein sequence
MPGAEKGPQP RRPDFTEAGV IQNRAAGRNR SPDPSPDGAQ ARRRSWRNQG AWLSFVAWPH 
RRGSATVEPP PLTLRQQLTT VRWLLTGLFA MMLLVFLYFG RDVLMPIVLA LLLALMLSPI
VRLLRRRKVP EGVAAVVVTL GFAAGMLLIG FTISGPVATW LEDAPSIGKR IAHKLSSLRE
SFDVVLDASR QVEEAADTSS KDNTDRVVMA QPGLLVKAAD SLASGFTTFG VTIVITLFFL
ASGSMFTEKI VHVMPLFRDK LRVLRVVRDI ETEVSSYLLT VAMINAGLGV AVGCALWLAG
MPNAFLWGGM AMVLNFLPYI GSMIGISLVA LVSFVTFDSL GQALIPPLSY LAVTSVEGQF
LTPAIVGRRL ELNALSVMLA VLFWAWLWGI VGALIAVPLL VVVKVVCSHF EGLAVVGEFL
SARRPFSDQV KDGA