Gene RSP_3854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3854 
Symbol 
ID4796448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_009007 
Strand
Start bp20963 
End bp25648 
Gene Length4686 bp 
Protein Length1561 aa 
Translation table11 
GC content75% 
IMG OID640102968 
ProductICE nucleation protein 
Protein accessionYP_001033817 
Protein GI125654623 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACT TCAGCAGCAA GACCACAGCA GAGATCGCGG CCCTGACGAG CGCCGAGGTC 
GCCAGCATGT CGAGCCAGGA TCTGGCCGCC CTCTCCACCG CCCAGATCGC GGCGCTCACC
GCCCAGCAGA TCGGCTGGGT CAAGGCGGCG TCGCTGAAGG GGTTGGGGGA TGCGCAGGTG
GTGGCGCTGA CGACGGCGCA GGCGGCGGCG CTCGGCTCGG CGCAGCTGGC CGCGCTGACG
ACGGCGCAGG TGGCGGCGAT GGAGACGGCC GATCTCGCGG CGCTCTCGGC CACGGGGGTG
GCGGGGCTGA CTTCGGCGCA GCTCGGGGGG CTCTCGACCG GTCAGGTGGC GGCCCTCACC
ACGGCGCAGG TCGCGGCCCT GTCCAGCGTG GCGGTCAAGG GTCTGGGCTC GGTCCAGGCC
TCGGGTCTCA CGACGGCCCA GGTGGCCGCC CTGTCGACCG CCCAGCTCAA AGCCTTCTCG
ACCGCGGGCA TGACGGGGCT CGGCACGGCG CAGATCGTGG CGCTCTCGAG CGCGCAGGCG
GCGGTGCTCG GCTCGGCACA GGTCGCGGCG CTCACGACGG CGCAGGCGGC GGCGATGGAG
ACGGCCGATC TCGCGGCCCT CACCAGCGTG GCGGTGAAGG GGCTGAGCTC GACCCAGGTG
GGCGCGCTGA CCACGGCGCA GGTGGCGGCG CTGACCACGG GACAGCTCGG CGCGCTCTCG
ACCGGGGCGC TGAAGGGCCT GACCACGGCG CAGGTGGTTG CCCTGACCAC GGCGCAGGCG
GCCGGGCTCG GCTCGGCGCA GGTGGCGGGC CTCTCGAGCA CGCAGATCGC GGCGCTGGAG
ACGGCGGATC TGGCCGCCCT CTCCTCGGCG GGGCTGAAGG GGCTCGGATC GGCGCAGGCC
GCGGGCTTGA CGACGGCGCA GGTGGCGGCG CTCACGACGG CTCAGGTGGG CCAGCTTTCG
AGTGCCGCGC TGAAAGGGCT CGGCACGGCG CAGGTGGTGG CACTGACGAC CGCGCAGGCG
GCGGCGCTCG GCACGGCGCA GGTGGGCGCG CTCTCGACCG CACAGGTGGC GGCGCTCGAG
ACCGTCGATC TCGCGGCGCT CTCGACGGCG GCAGCGAATG CCCTGACCTC GGCTCAGGCC
GCGAGCCTCA CGACGGCGCA GGTGGCCGCG CTGACGACGG CGCAGGTTGC GGCGCTCTCG
ACGGGGGCGG TGAAGGGGCT GAGCTCGACC CAGGCGGGCG CGCTGACCAC GGCACAGGTG
GCGGCGCTGA CCACAGGACA ACTGGGCGCA CTCACGACGG CGGCGCTGAA GGGCCTGACC
ACGGCGCAGG TGGTGGCACT GACCACGGCG CAGGCGGCGG GGCTCGGCTC GGCGCAGGTG
GCGGGGCTCT CGAGCACGCA GATCGCGGCG CTGGAGACGG CGGATCTGGC CGCGCTTTCC
ACGACGGGGC TGAAGGGTCT GGGCTCGGCG CAGGCCGCGG GCCTGACCAC GGCGCAGGTG
GCGGCGCTCA CCACGGCTCA GGTGGGCCAG CTCTCGAGTG CCGCGCTGAA AGGGCTCGGG
ACGGCGCAGA TCGTGGCGCT GACGACGGCG CAGGCGGCGG CGCTGGGGTC GACGCAGGTG
GCCGGGCTCT CGACCGCGCA GGTGGCGGCG CTGGAGACGG CCGATCTCGC GATGCTCTCG
ACCGCGGGGG TGAAGGCGCT GAGCTCGACG CAGGTGGGCG CGCTGACGAC GGGGCAGGTG
GCGGCCCTGA CCACGGCGCA GGCCGCCCAG CTCTCGACGG CGGCGCTGAA GGGCCTGAGT
TCGACGCAGG TGGCGGCCCT GACGACGGGG CAGGTGGCGG CCCTGACCAC GGCCCAGCTC
GGCGCGCTGA CGACGGCGGC GCTGAAGGGC GTGACCACGG CGCAGGTGGT GGCGCTGACC
ACGGCGCAGG CGGCGGGGCT CGGCTCGGCG CTGCTGGCGG GCCTGTCGAG CACGCAGATC
GCAGCGATCG AGACGGCGGA TCTGGCCGCG CTCTCCACCA CCGGGCTGAA GGGTCTGGGC
TCGGCGCAGG CGGCGGGCCT GACCACGGCG CAGGTGGCCG CCTTCACCAC GGCGCAGGTG
GGGCAGCTTT CGACGGCGGC GCTGAAGGGG CTCGGCACGG CGCAGGTGGT GGCGCTGACC
ACGGGCCAGG CGGGGGCGCT CGGCTCGGCG CAGGTGGCGG GTCTCTCGAC CGCGCAGGTG
GCGGCGCTCG AGACGGCCGA TGTCGCGGCG CTCTCGACGG CGGGGGTGAA GGGCGTGGGC
TCGGCGCAGG CGGCGGCCCT CGGCTCGGCG CAGGTGGCAG CGCTGACGAC GGCGCAGGTG
GGCCAGCTTT CGACCACGGC CCTGAAGGGC TTCGGCTCGG TGCAGGCTTC GGGTCTCACC
ACGGCGCAGG TGGCGGCGCT GACCACGGCG CAGCTCTCGC AGCTCTCGAC GGCGGCGGTG
AAGGGGCTCG GCACCGCGCA GATCGTGGCG CTGACCACGG GCCAGACGGC AGCGCTCGGC
TCGGCGCAAC TGGGCGCCCT CTCGACGGCG CAGGTGGCGG CCTTCGAGAC GGCGGATGCC
GCGGCGCTGA CCACGACGGC GCTGAAGGGG CTGACCACGG CGCAGGTGGT GGCGCTGACG
ACGGGTCAGG CGGCGGCGCT CGGCTCGGCG CAGGTCGCGG GCCTGTCGAG CACGCAGATC
GCGGCGCTCG AGACGGCGGA TCTCGCGGCC CTGACCACCA CGGCGGTGAA GGGCCTGGGC
TCGACGCAGG TTTCGAGCCT GACGACGGGG CAGGTGGCGG CGCTCACCAC CGTGCAGGTG
GCGGCGCTGA GCACGGCGGC CGTGAAGGGC GTGGGCTCGG TGCAGGCCTC GGGGCTGACG
ACGGCGCAGG TGGCGGCGCT GACCACGGCC CAGGTGGCCC AGCTCTCGAC GGCGGCGCTG
AAGGGGCTCG GCACGGCGCA GATCGTGGCG CTGACCACGG CCCAGGCGGC CAAGCTCGGC
TCCGATCAGG TCGCCGCCCT CTCGACGGCG CAGGTGGCGG CGCTGGAGAC GGCGGATCTG
GCGACCCTCT CGGCCACGGG CGTGAAGGGC TTCGGATCGG CACAGGCGGC GGCCCTCGGC
TCGGCACAGG TGGCGGCGTT CACCACGGCG CAGGTGGCGG CGCTGACCAC GGCGGCGGTG
AAGGGCTTCG GCTCGGTGCA GGCCTCGGGC CTCACCACCG CGCAGGTGGC CGCGCTGACC
ACGGCGCAGC TCTCGCAGCT CTCGACGGCG GCGGTGAAGG GGCTCGGCAC GGCGCAGATC
GTGGCGCTGA CCACGGGCCA GACGGCGGCG CTCGGCTCGG CGCAGCTGGG TGCCCTCTCG
ACGGCGCAGG TGGCGGCCTT CGAGACGGCG GATGCCGCGG CGCTGACCAC GACGGCGCTG
AAGGGGCTGA CCACGGCGCA GGTCGTGGCG CTGACCACGG GTCAGGCGGC CGCCCTCGGC
TCGGTGCAGG TGGCGGGTCT CACGACCGCG CAGATGGCGG CGCTCGAGAC GGCGGATCTC
GCGGCCCTCA CCACCACGGC GGTGAAGGGG ATCACCACCG CCCAGATGGG GGCGCTGACG
ACGGGACAGG TGGCAGCCCT CACCACGGCG CAGGTGGCCG CGCTTGCGGG CACGGCGGTG
AAGGGACTGT CCTCGACCCA GGCGGGGGCG CTGACGACGG CACAGGTGGC GGCGCTGACC
ACGGCCCAGG TGCCCCAGCT CTCGACGGCG GCGCTGAAAG GGCTCGGCTC GGCCCAGATC
GTGGCGCTGA CCACGGCCCA GGCGGCCGTC CTCGGCTCGG CGCAGCTGGG CGCCCTCTCG
ACGGTGCAGG TGGCGGCGCT CGAGACGGTC GATCTCGCGG CCCTGACCAC CGCGGCCGTG
AAGGGCCTCG GCTCGGCCCA GGTCGCGGGC CTGACCACGG GCCAGGTGGC GGCCCTCACC
ACGGCGCAGA TGGCCCAGCT CTCGACGGCG GCGATCGCGG GTCTGGGATC GGTGCAGGCC
TCCGGCCTGA CCACGGGCCA GGTGGCGGCC CTCACCACCG ATCAGCTCGC CCGGATCACC
ACCGCGGCGG TGAAGGGGCT CGGCACGGCG CAGATCGTGG CTCTGACCAC GGCGCAGGCG
GCGACGCTCG GCTCGGCGCA ACTGGCCGCG CTTTCGACGG CGCAGGTGGC GGCCTTCGAG
ACGGCGGATG TGGCGGCGCT GACCACCGCA GCGGTGAAGG GCTTCGGGAC AGCGCAGGTG
GCGGCGCTGA CCACCGGGCA GGCGGCCGCC CTTGGCTCGC GTCAGGTGGG CGCGCTTTCC
ACGGCGCAGG TGGCGGCGCT CGAGACGGCG GATCTCGCAG CCCTCACCAC CGCGGCCGTG
AAGGGTCTGG GATCGGCGCA GGCGAAAGTC CTGACGGCGG CGCAGATGGC CGCGCTCACC
TCGGCTCAGG TGGCGGCCCT CACCACGACC GCCGTGGCAG GCTTCGGCTC GGTGCAGGCG
GCGGCGCTCA CCACGGCGCA GATGACGGCG CTCACCACCG CGCAGATCCC CACCCTGACC
ACGGCCGCCA TCAAGGGTCT CGAAACCGCC GATATCGCGG CGCTCACCAC GACGCAGGCG
TCGGCCTTCA CGGCCACGCA ACTGGGGGCC ATGTCGAGCG CCCAGATCGC GGCTCTCTTC
CTCTGA
 
Protein sequence
MTDFSSKTTA EIAALTSAEV ASMSSQDLAA LSTAQIAALT AQQIGWVKAA SLKGLGDAQV 
VALTTAQAAA LGSAQLAALT TAQVAAMETA DLAALSATGV AGLTSAQLGG LSTGQVAALT
TAQVAALSSV AVKGLGSVQA SGLTTAQVAA LSTAQLKAFS TAGMTGLGTA QIVALSSAQA
AVLGSAQVAA LTTAQAAAME TADLAALTSV AVKGLSSTQV GALTTAQVAA LTTGQLGALS
TGALKGLTTA QVVALTTAQA AGLGSAQVAG LSSTQIAALE TADLAALSSA GLKGLGSAQA
AGLTTAQVAA LTTAQVGQLS SAALKGLGTA QVVALTTAQA AALGTAQVGA LSTAQVAALE
TVDLAALSTA AANALTSAQA ASLTTAQVAA LTTAQVAALS TGAVKGLSST QAGALTTAQV
AALTTGQLGA LTTAALKGLT TAQVVALTTA QAAGLGSAQV AGLSSTQIAA LETADLAALS
TTGLKGLGSA QAAGLTTAQV AALTTAQVGQ LSSAALKGLG TAQIVALTTA QAAALGSTQV
AGLSTAQVAA LETADLAMLS TAGVKALSST QVGALTTGQV AALTTAQAAQ LSTAALKGLS
STQVAALTTG QVAALTTAQL GALTTAALKG VTTAQVVALT TAQAAGLGSA LLAGLSSTQI
AAIETADLAA LSTTGLKGLG SAQAAGLTTA QVAAFTTAQV GQLSTAALKG LGTAQVVALT
TGQAGALGSA QVAGLSTAQV AALETADVAA LSTAGVKGVG SAQAAALGSA QVAALTTAQV
GQLSTTALKG FGSVQASGLT TAQVAALTTA QLSQLSTAAV KGLGTAQIVA LTTGQTAALG
SAQLGALSTA QVAAFETADA AALTTTALKG LTTAQVVALT TGQAAALGSA QVAGLSSTQI
AALETADLAA LTTTAVKGLG STQVSSLTTG QVAALTTVQV AALSTAAVKG VGSVQASGLT
TAQVAALTTA QVAQLSTAAL KGLGTAQIVA LTTAQAAKLG SDQVAALSTA QVAALETADL
ATLSATGVKG FGSAQAAALG SAQVAAFTTA QVAALTTAAV KGFGSVQASG LTTAQVAALT
TAQLSQLSTA AVKGLGTAQI VALTTGQTAA LGSAQLGALS TAQVAAFETA DAAALTTTAL
KGLTTAQVVA LTTGQAAALG SVQVAGLTTA QMAALETADL AALTTTAVKG ITTAQMGALT
TGQVAALTTA QVAALAGTAV KGLSSTQAGA LTTAQVAALT TAQVPQLSTA ALKGLGSAQI
VALTTAQAAV LGSAQLGALS TVQVAALETV DLAALTTAAV KGLGSAQVAG LTTGQVAALT
TAQMAQLSTA AIAGLGSVQA SGLTTGQVAA LTTDQLARIT TAAVKGLGTA QIVALTTAQA
ATLGSAQLAA LSTAQVAAFE TADVAALTTA AVKGFGTAQV AALTTGQAAA LGSRQVGALS
TAQVAALETA DLAALTTAAV KGLGSAQAKV LTAAQMAALT SAQVAALTTT AVAGFGSVQA
AALTTAQMTA LTTAQIPTLT TAAIKGLETA DIAALTTTQA SAFTATQLGA MSSAQIAALF
L