Gene RSP_2972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2972 
Symbol 
ID3720386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp1657164 
End bp1660448 
Gene Length3285 bp 
Protein Length1094 aa 
Translation table11 
GC content73% 
IMG OID640071159 
Producthypothetical protein 
Protein accessionYP_353034 
Protein GI77463530 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAGA GCGGCGAGCG GCAGGCATCG GAGCGGAAGG GGCGGTCGCG GCGCGGCCGG 
GCGTCGCTGT GGTTGCTGCT CAGTCTCGCG CTGGTGGCGG CCGTCGCGTG TTTTGCCACG
CTGGCCTTCA CGGGGCGGCC GCTGCCCTTG CCCGGCTGGG CGGTGACCGA GGCCGAGACG
CGGATCAACC GGGCGCTGGA GCCTGCGCTG TCGGTCTCGC TCGGGGGGCT CGTGCTGACG
GTGGAGCCGA ACTGGATCCC GCGGCTCCGG CTCGACGATG TGCGCCTGCG GCAGGCCGAC
GGGCGCACGC TCGTCACGCT GCCCGAGGCG CGGGTGGTGT TCGACCGGGG CGCCGCGCTG
CGCGGCGCGC TCCATCCCAA GACCATCACC CTCTCGGGCG CGCGCATCGC GCTGCGCCGG
CTGGCCGACG GGCGCTTCGA CTTCGCCATG GGCGAGCAGG GCGGCACCTT CCGCCTCGCA
AGCTATGCCG AACTGCGCGA GACGGTGGAC CGGCTCTTCG CCCAGCCGGT GCTGGCGGAC
CTCCGGCGGA TCGAGGCGGA GGGCACGACG CTGACGCTCG ATGACCGCCG CGCGGGCCGG
ACGTGGCAGG CGGGGGACGG GCGTCTCACG CTGGTGAACG GGCCCGACCG GCGCGCGCTC
GAGATCGGCC TGACACTTCT GAATTCCAAG GGCCGCGCGC CCGCGCAGGC GCTCGTGACC
TTCGTCACCA CGCCCGGCAG CCCCGAGGCG CGGATCTCGG CCACGGTCGA TCATGTGGCG
GCTGCCGACA TCGCCGCGCA GGCCGCCCCC GTGGCCTGGC TCGAGGTGCT GGATGCGGCG
CTCTCGGGCC GCTTCTCGGC GGCGCTCGAC GGCGAGGGGC GGCTCACGCG GCTCGAGGCG
GGGCTCGACA GCGGCGAGGG GGCGCTGCAG CCGCGGCCCG AGACCAGACC CATCGCCTTC
GACAAGGCCG GGCTCGCCTT CGCCTACGAT CCGGGACGCG AGCGGTTGAA CCTCACCCGG
CTCGAGGTGC AGGGCCGCTC GATCCGGCTT TCGGCGCAGG GGCACGCCTA TCTGCCGGGG
GTCTCGCGCG GGTTGCCGAG CGAGATCCTG GCGCAGATCC GGGTCGAGGA TGCCAGCGCC
GACCCCGAGG GCCTGTTCGA GACGCCGGTC CATTTCTCGG AAGGGGCGCT CGATCTGCGG
ATGCGGCTCG ATCCGTTCCG GGTGGACATC GGGCAGCTCG CGCTGGTCGA GCAGGGGCGG
CGCCTGTCGG GGCGCGGCCA TGCCTCGGCC GAACCGGGCG GCTGGCGCGT GGCCTTCGAC
CTCGGGCTGA ACGAGATCTC GCATTCCGAC CTGCTCGCGC TCTGGCCGCT GTCGCTGGTG
TCGAAGACCC GGGAATGGCT GGAGGAGAAT GTGCAGGAGG GGCGGCTGTT CGAGGTCGAG
GCCGGCCTGC GCATGAGCCC CGGCCACGAG ACGCGCCTGT CGCTCGGCTA CGAGTTCCGC
GACGGGGACG TGCGCTATCT CAAAACCCTG CCGCCTATCG AGAAAGGGTC GGGCTATGCC
TCGATCGAGG ATCGCCGCTA TCTGATGGTG CTGGAGGAGG GGCAGGTTAC GCCCCCCGCA
GGCGGGCCGA TCCGGGTCAC GCGCTCGGTC TTCGAAGTGC CGGACGTGAC CGAGAAACCG
GCGCAGGCGC GGATCTCGCT CAACAGCGAG AGCGGCGTCA CGGCGGCCCT GTCGCTGCTG
GACCAGCCGC CGTTCCGCTT CCTCGAAAAG GCCGGCCGCT CGGTCGATCT GGGAGAGGGC
GTCGCGGTGA TGGAGACGGC GCTCTCCCTG CCGCTGAAGC GCAAGGTCGA GCCCGAGGAT
GTCGAGTTCT CGGTCCGCGG CACGATCACC GACTTCCGCT CCGACACGCT GGTGCCCGGC
CGCCGCATCG TGGCGCCCCG CCTTGCGCTC GAGGCGGAGC CCGAGGGTCT GACCGTGACC
GGCGCGGGCA GCTTCGGCCG GGTGCCGTTC GATGCGACCT ACCGGCTGGC CTTCGGCCGC
GAGGCGGAAG GACGGTCGTC GGTCGAGGGC ACCGCCACCC TCTCGCCTGC CGCCGTCGAG
GAGTTCAAGC TGGGTCTGCC CGCAGGCACC GTCGAGGGCC GCGCGCCCGG CCGCTTCCGG
GTGGAGATGG AGAAGGGCCG CGATCCGCGG CTCACGCTCT CGTCGGACCT CGTAGGGCTG
CGCACGGGTC TGGCCGCGAT CGGCTGGTCG AAGCCCGCGA ACCGCGCGGG CCGGCTCGAG
GTCGAGGCGT CGCTGGGCAA GCCGGTCACG GTCGGCAAGC TCGTGCTGGA GGGGGGCGGG
CTCGCGGCCT CGGGCCGGGT CGATCTGCGC GCCGACGGCG GGCTGGATGC CGTGCGCTTC
GACCGGGTGC GGCTGAACGG CTGGCTCGAT GCGCCGGTGA CGCTGGTCGG GCGGGGCGCG
AACCAGCCGC CCGAGGTGCA GCTGCGGGGC GGGTCGGTCG ATCTCACGCG GCTGGGCGAT
CTGGGCGGCG GGGGCGGCAG CGGGGGCACG CCGGTGCCGA TCCTCGTCGC GCTCGACCGG
CTGCAGGTCA CGTCGGGCAT CGCGCTGACC GGCGTCGAGG GGCGCTTCGG CACCCGCGGC
GGGTTCAACG GCGCCTTCCG CGGACGGGTG AACGGGCGCG CCGTCGTCGA AGGGTCGGTG
GTCCCGATGA GCGGGCGCAG CGCCGTGCGG CTGAGATCCC GGGACGCGGG CGGCGTGATC
GCCTCGGCGG GGATCTTTCC CGATGCGCGG GGCGGCGATC TCGACCTGAG CCTCATGCCC
GAGGGACGCG ACGGCTACCG CGGACGGGCG GCGGTCTCGA ACTTCCGCGT GACGAATGCG
CCGGTGCTGG CGGCACTCCT CAATGCGATT TCGGTGGTGG GCCTTCTCGA GCAGCTGAAC
GGCGACGGGC TGCTCTTCGC CGAGGGCGAT GTCCGGTTCC GCGTCCAGCC GGGGGCGGTC
GAGATCTCCG AGGCCTCGGC GGTGGGGGCC TCGATGGGGG TCACGCTGGA GGGGCTCTAT
CGCACCGCCG ACCGGCGGCT CGACCTGCAG GGGGTGATCT CGCCCATCTA TCTGCTGAAC
GGCATCGGCT CGGTGCTGAC CCGGCGCGGC GAGGGGCTGT TCGGTTTCAA CTATTCCGTG
ACCGGATCGG CCGACCGCCC GGCCGTCTCG GTGAACCCGC TCTCGATCCT CACGCCCGGC
ATGTTCCGCG AGATCTTTCG CCGACCGGTG CCGGTTCTGC CCTGA
 
Protein sequence
MAESGERQAS ERKGRSRRGR ASLWLLLSLA LVAAVACFAT LAFTGRPLPL PGWAVTEAET 
RINRALEPAL SVSLGGLVLT VEPNWIPRLR LDDVRLRQAD GRTLVTLPEA RVVFDRGAAL
RGALHPKTIT LSGARIALRR LADGRFDFAM GEQGGTFRLA SYAELRETVD RLFAQPVLAD
LRRIEAEGTT LTLDDRRAGR TWQAGDGRLT LVNGPDRRAL EIGLTLLNSK GRAPAQALVT
FVTTPGSPEA RISATVDHVA AADIAAQAAP VAWLEVLDAA LSGRFSAALD GEGRLTRLEA
GLDSGEGALQ PRPETRPIAF DKAGLAFAYD PGRERLNLTR LEVQGRSIRL SAQGHAYLPG
VSRGLPSEIL AQIRVEDASA DPEGLFETPV HFSEGALDLR MRLDPFRVDI GQLALVEQGR
RLSGRGHASA EPGGWRVAFD LGLNEISHSD LLALWPLSLV SKTREWLEEN VQEGRLFEVE
AGLRMSPGHE TRLSLGYEFR DGDVRYLKTL PPIEKGSGYA SIEDRRYLMV LEEGQVTPPA
GGPIRVTRSV FEVPDVTEKP AQARISLNSE SGVTAALSLL DQPPFRFLEK AGRSVDLGEG
VAVMETALSL PLKRKVEPED VEFSVRGTIT DFRSDTLVPG RRIVAPRLAL EAEPEGLTVT
GAGSFGRVPF DATYRLAFGR EAEGRSSVEG TATLSPAAVE EFKLGLPAGT VEGRAPGRFR
VEMEKGRDPR LTLSSDLVGL RTGLAAIGWS KPANRAGRLE VEASLGKPVT VGKLVLEGGG
LAASGRVDLR ADGGLDAVRF DRVRLNGWLD APVTLVGRGA NQPPEVQLRG GSVDLTRLGD
LGGGGGSGGT PVPILVALDR LQVTSGIALT GVEGRFGTRG GFNGAFRGRV NGRAVVEGSV
VPMSGRSAVR LRSRDAGGVI ASAGIFPDAR GGDLDLSLMP EGRDGYRGRA AVSNFRVTNA
PVLAALLNAI SVVGLLEQLN GDGLLFAEGD VRFRVQPGAV EISEASAVGA SMGVTLEGLY
RTADRRLDLQ GVISPIYLLN GIGSVLTRRG EGLFGFNYSV TGSADRPAVS VNPLSILTPG
MFREIFRRPV PVLP