Gene Rru_A1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1967 
Symbol 
ID3835391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2274402 
End bp2275793 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content70% 
IMG OID637826066 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_427054 
Protein GI83593302 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0177733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCCAT CCCTGCTTTT TGAAGCCGTT CTCGCCCAGC CCGGACAGGG TGTCGGCCGC 
GCCTGCGCCC TGCTCTGCGC CCTTGGCGCC GAAGTCTTGT TGCCCGGGGC GGTCTCGATC
AGCCTGCGCT GCTCCAAAGC CTTGGCCGAT CGCCTGGGGC CCGACCGCCT GATCCAGGCC
GATGCCGTGG TGCGCAAGCG CTATGACCTT GCCCAGGATC AACCCTGGCT GTGCTCGGAT
GGCCCTGATC CGACGCTAAT GCGCGTGGCC TTGCCCGCGC TTGCCGAGGG GGTCAGCGGC
TTTGTGCGCA TTCCGCCCTG CGCCGCCCAG CCGCCGCTGC CCGGCCGGCC GATCCACGCG
CCGCGCCCGC CGCTGTCGCC GCTGCCCCCG GACGTACCCT ATTTCCATTT ACAGGCACCC
GAGGATCTGA GCGCCCTGCT CGGCGCCCGA GACCTTCACC AGCGCGGGGA GACCGGCCGG
GGGGTGCGGG TGGCGGTGGT CGATAGCGGC TTTTACGCCC ATCCCTGGTT CGGCAGTCGC
GATCTTGGCG TTCGCGTGTT GCTCGCCCCC GGGGCCCGGG CGCCGGCTTG CGACGAAATC
GGCCATGGCA CCATGGTCTG CGCCGCCTTG TTGTCGCTGG CCCCGGGCGC CCAGGTCACC
CTGGTCAAAC AGAACGGCGA CGACGAGGTT TTCGTCGCCT TCAAACTGGC GCTGAGCCTA
TCGCCCGACA TCGTCCAGAA TACCTGGGGC TACAGCTTGG CCGAGGGACG GCTGGGAGCG
GCCGAAGACC TGATCGCCGC CACCTTGGAA GACGCCATCG CCCGGGGGAT CCTGGTGGTT
TTCGCCGGTG GCAACGGCGG TCTGCTGTAT CCGTCGCAGC GGCCCGAGGT TCTGGCCATC
GGCGGCGTCT TCCAGGGGGA AGACGGCGCG CGACAGGCGG CGAGCTACGC CAGCGGCTAT
CTCAGCAAAC TGTTCCCCGG CCGTCGGCTG CCCGATTTCT GCGCCCTGGT CGGCATGGCG
CCGATGGGGG TCTATCTGGC GATGCCGACC CAACCGGGCG GACAGATCGA CCGCGCCTTC
GCCCAACAGC CCTATCCCGA GGGCGACGAT ACGCCGCCGA CCGACGGTTG GGTGGTGATC
AGCGGCACCT CGGCGGCTTC GGCCCAGGTC TCGGGCCTCC TCGCCCTGCT GCGCGCCCGC
GCTCCCGGCT TGTCCCAGGA ACGGGCGCGG GCGCTTCTGG CCAAAACGGC GCGCGCCGTT
CACCACGGCG CCTCGGCCCA GGGCAACCCC GCCGGACCGG GCCAACCCAA TGTGGCGACC
GGCTATGGTC TGGTCGACGC CCAGGCCGCC TTTGAGGCGC TTCCCTCGGT CATCGATACG
ATCCCCCCTT GA
 
Protein sequence
MAPSLLFEAV LAQPGQGVGR ACALLCALGA EVLLPGAVSI SLRCSKALAD RLGPDRLIQA 
DAVVRKRYDL AQDQPWLCSD GPDPTLMRVA LPALAEGVSG FVRIPPCAAQ PPLPGRPIHA
PRPPLSPLPP DVPYFHLQAP EDLSALLGAR DLHQRGETGR GVRVAVVDSG FYAHPWFGSR
DLGVRVLLAP GARAPACDEI GHGTMVCAAL LSLAPGAQVT LVKQNGDDEV FVAFKLALSL
SPDIVQNTWG YSLAEGRLGA AEDLIAATLE DAIARGILVV FAGGNGGLLY PSQRPEVLAI
GGVFQGEDGA RQAASYASGY LSKLFPGRRL PDFCALVGMA PMGVYLAMPT QPGGQIDRAF
AQQPYPEGDD TPPTDGWVVI SGTSAASAQV SGLLALLRAR APGLSQERAR ALLAKTARAV
HHGASAQGNP AGPGQPNVAT GYGLVDAQAA FEALPSVIDT IPP