Gene Rsph17029_2735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2735 
SymboluvrC 
ID4897201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2876158 
End bp2878029 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content68% 
IMG OID640113337 
Productexcinuclease ABC subunit C 
Protein accessionYP_001044609 
Protein GI126463495 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.296974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0935486 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGGATA ACGAGACGGA CGGCCGCGAT GTGCCGACCG GACATGCGGT GATTCAGGGC 
TACCTGAAGA CGCTCGACGG CTCGCCGGGC GTCTACCGGA TGCTCGATGC CCAGAGCCAG
GTGCTCTATG TGGGAAAGGC GCGCAACCTC CGGGCGAGGG TGTCGAACTA CGCGCGCCCC
TCGGGCCATT CCGGTCGCAT CGCCCGGATG ATCCGCGAGA CCGCCTCGAT GATGTTTCTC
ACAACGAGAA CGGAGACCGA GGCCCTCCTG CTCGAACAGA ACCTCATCAA GCAGCTCAAG
CCGCGCTACA ACGTGCTCCT GCGCGACGAC AAGAGTTTCC CGAACATCCT GATCGCCAAG
GACCATCCCT TCCCGATGCT CAAGAAGCAC CGGGGCAAGA AGTCCGAGAA GGGCAGCTAC
TTCGGTCCCT TCGCCAGCGC CGGCGCCGTG AACCGGACGC TGAACCAGCT TCAGCGGGTC
TTTCTTCTGC GCACCTGCTC GGATGCCACC TTCGAAAGCC GCACTCGGCC CTGCCTTCTC
TTCCAGATCA AGCGATGCTC GGCCCCCTGC GTGGGCCGGG TACCGGCCGA GGATTATGCC
GAGCTGATCG GCGATGCCGA ACGCTTCCTG CAGGGGCGGA CGACGAAGGT TCAGGCCAAC
CTCGCGGAGC AGATGCAGGC GGCCTCGGAG GCCATGGAAT TCGAGCGCGC CGCGGCTCTC
CGCGACCGGA TCAAGGCGCT GACGCAGGTG CAGTCCTCGC AGGGGATTAA CCCCCGCGGC
GTGGCCGAGG CCGATGTGAT CGCCGTCCAT CTGGAGGGCG GTCAGGCCTG CGTGCAGGTC
TTCTTCATCC GGGCGAACCA GAGCTGGGGC AACCGCGACT TCTTCCCCCG CACCGGCGCC
GGGGCGGAAG AACCGGAGAT TCTCGAGGCC TTCCTCGCGC AGTTCTACGA CGACAAGGAG
CCGCCGCGGA TGATCCTGCT CTCGCATCCG GTGGACAATC CCGATCTCGT GGGGCAGCTC
CTGTCCGAGA GGGCGGGGCG CAAGGTGACC CTCGGGGTGC CGCAGCGGGG CGAGAAGGCC
GAGCTGGTCG AGAATGCGGC GCGCAATGCG CGCGAAAGCC TCGCCCGCCG CATGGCCGAG
AGCGCCACGC AGAACAGGCT GCTCGCGGGG CTCGCCGAGG CCTTCGAGCT CGATGCGGCC
CCGAAGCGGA TCGAGGTCTA CGACAACTCG CACATCCAGG GCACCAATGC GGTCGGCGGC
ATGATCGTCG CCGGACCCGA GGGTTTCCTG AAAAGCCAGT ATCGCAAGTT CAACATCCGC
GGCGCGGCGG GGGCACAGGG CGACGACTTC GGCATGATGA AGGAGGTGCT GACCCGCCGC
TTCGAGCGCC TGCTGAAGGA GGACCCCGAG CGCAAGACCG ACGCCTGGCC GGATCTTCTG
CTGATCGACG GCGGCGCGGG GCAGGTCTCG GCCGTGCAGG AGATCCTGCA GGAGCTGGGG
GTGGACGACG TGCCCTTCAT CGGCGTGGCC AAGGGCATCG ACCGCGACGC GGGCAAGGAG
GAGTTCCACC GCCCCGGCGA GCCGCCCTTC GCACTGCGCA TGAACGATCC GGTGCTCTAT
TTCGTGCAGC GCCTGCGCGA CGAGGCCCAC CGCTGGGCCA TCGGCGCCCA CCGGGCGAAG
CGGGCCAAGG CCGTCAGCGC CACGCCGCTC GACGAGATCC CGGGGGTGGG GGCCGCGCGC
AAGCGCGCGC TGCTGGCGCA TTTCGGCTCG GCCAAGGCGG TGGCCCGGGC GGGCGTGCCC
GATCTCTGCG CGGTCGAGGG CATCTCGGAG ACGATGGCGC AGTCCATCCA CGACTTTTTC
CACGGGAGCT GA
 
Protein sequence
MQDNETDGRD VPTGHAVIQG YLKTLDGSPG VYRMLDAQSQ VLYVGKARNL RARVSNYARP 
SGHSGRIARM IRETASMMFL TTRTETEALL LEQNLIKQLK PRYNVLLRDD KSFPNILIAK
DHPFPMLKKH RGKKSEKGSY FGPFASAGAV NRTLNQLQRV FLLRTCSDAT FESRTRPCLL
FQIKRCSAPC VGRVPAEDYA ELIGDAERFL QGRTTKVQAN LAEQMQAASE AMEFERAAAL
RDRIKALTQV QSSQGINPRG VAEADVIAVH LEGGQACVQV FFIRANQSWG NRDFFPRTGA
GAEEPEILEA FLAQFYDDKE PPRMILLSHP VDNPDLVGQL LSERAGRKVT LGVPQRGEKA
ELVENAARNA RESLARRMAE SATQNRLLAG LAEAFELDAA PKRIEVYDNS HIQGTNAVGG
MIVAGPEGFL KSQYRKFNIR GAAGAQGDDF GMMKEVLTRR FERLLKEDPE RKTDAWPDLL
LIDGGAGQVS AVQEILQELG VDDVPFIGVA KGIDRDAGKE EFHRPGEPPF ALRMNDPVLY
FVQRLRDEAH RWAIGAHRAK RAKAVSATPL DEIPGVGAAR KRALLAHFGS AKAVARAGVP
DLCAVEGISE TMAQSIHDFF HGS