Gene Caul_4277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4277 
SymboluvrC 
ID5901738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4646569 
End bp4648473 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content68% 
IMG OID641564796 
Productexcinuclease ABC subunit C 
Protein accessionYP_001685896 
Protein GI167648233 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.471865 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCCG TGAACGACGC GACCACCGAC ATTCCCGCCG CCAACGTCCT GAAGGGCGCC 
GCCCTGATCA AGGACGAGGT TTCGCGCCTG CCTGACAGTC CGGGCGTCTA CCGGATGATC
GGCGAGGACG ACGAGGTCCT CTATGTCGGC AAGGCGAAAA GCCTGAAGAA GCGCGTCGTC
CAGTACGCCC AGGGCCGCTT TCACACCAAC CGCATCGCCC ACATGGTCGA CGCCACGCGG
TCGATGGAGT TCGTCACCAC CCGCACCGAA GCCGACGCCC TGCTGCTCGA GATCAACCTG
ATCAAGTCGC TGAAGCCGCG TTTCAACGTG CTGCTGCGCG ACGACAAGAG CTTTCCCGAG
ATCATGATCC GCCGCGATCA CGACGCGCCC CAGCTGCGCA AGCACCGCGG CGCCCACACC
ATCAAGGGCG ACTATTTCGG GCCGTTCGCC AGCGCCTGGG CGGTGAACCG AACGCTGAAC
ACCCTGCAGA AGGCGTTCCT GCTGCGCTCG TGCAGCGACA GCGTCTACGA AACCCGCTCA
CGGCCCTGCA TGCTGCACCA GATCAAGCGC TGCGCCGCGC CCTGCACCGG CCTGATCGGC
AAGGACGACT ACCAGGCGCT GGTCGACCAG GCCGAGGATT TCCTGCGCGG CAAGTCCCGG
GCGGTGATGG CGACCATGGC CAAGGCGATG GAGGAAGCGG CGGAGGACCT GGAGTTCGAA
CGCGCCGCCC GGCTGCGCGA CCGGATCCGC GCCCTGGCCG CCGTGGCCCA GGAGAGCCAG
ATCAATCCCG AGACCGTGGA CGAGGCCGAC GTCGTCGCCC TGCACATCGA GGGCGGCCAG
GCCTGCGTGC AGGTGTTCTT CTTCCGGGCC GGCCAGAACT GGGGCAACCG CGCCTATTTC
CCCCGCATCA CGGGCGCGGC GGAGGAGGAA GGGGTCAGCG AGGAAGCCCA GGCGATGACC
GCGTTCCTGG GCCAGTTCTA TGACGACAAG CCGATCCCGC GGCTGATCCT GACCAATATC
GAGCCGGCCG AGCGCGACCT GCTGGCCGAG GCCTTCTGCC TGAAGAGCGG CCGCAAGGTC
GAGATCGCCA CCCCCAAGCG CGGCGAGAAG GCCGACCTGG TGGGCCACGC CCTGACCAAC
GCCCGCGAGG CGCTGGGCCG GAAAATGGCC GAGGGCAGCG CCCAGACCAA GCTGCTGGCC
GGGGTCGGCG AGGCGTTCGG CCTGGACGGG CCGCCCGAGC GGATCGAGGT CTACGACAAC
AGCCACATCC AGGGCACCAA CGCCGTCGGT GGGATGATCG TGGCCGGCCC CGAGGGCTTC
ATGAAGGGCC AGTACCGCAA GTTCAACATC AAGAGCACCG AGCTCACCCC CGGCGACGAC
TACGGCATGA TGAAGGAGGT GCTGAAGCGC CGCTTCGCCC GCCTGGTCAA GGAAGAGGAA
GAGGGCGACA GCAGCGCCCG CCCCGACCTG GTGCTGGTCG ACGGCGGCAA GGGCCAGCTG
GACGCGGTGA TCGAGGTGAT GGCCGACCTG GGCGTCGATG ACATCGCCGT GGTGGGCGTG
GCCAAGGGGC CGGACCGCGA CGCCGGCCTG GAGCGGTTCT TCATGCCGGA CAAGACGCCG
TTCATGCTCG AGCCCAAATC GCCGGTGCTC TACTACCTGC AGCGCCTGCG CGACGAGGCC
CACCGCTTCG CGATCGGGGC GCACCGCACG CGCCGCTCGA TGGACCTGAA GAAGAACCCG
CTCGACGAGA TCGAGGGCGT GGGTCCCGGC CGCAAACGAG CCCTGCTGAA CGCCTTCGGT
TCGGCCAAGG GCGTATCCCG GGCGGGGGTC GAGGACCTGA TGAAGGTGGA GGGCGTGAGC
CAACCGCTGG CCGAGCGGAT CCACGCGTTT TTCAGGAAGA GCTGA
 
Protein sequence
MSPVNDATTD IPAANVLKGA ALIKDEVSRL PDSPGVYRMI GEDDEVLYVG KAKSLKKRVV 
QYAQGRFHTN RIAHMVDATR SMEFVTTRTE ADALLLEINL IKSLKPRFNV LLRDDKSFPE
IMIRRDHDAP QLRKHRGAHT IKGDYFGPFA SAWAVNRTLN TLQKAFLLRS CSDSVYETRS
RPCMLHQIKR CAAPCTGLIG KDDYQALVDQ AEDFLRGKSR AVMATMAKAM EEAAEDLEFE
RAARLRDRIR ALAAVAQESQ INPETVDEAD VVALHIEGGQ ACVQVFFFRA GQNWGNRAYF
PRITGAAEEE GVSEEAQAMT AFLGQFYDDK PIPRLILTNI EPAERDLLAE AFCLKSGRKV
EIATPKRGEK ADLVGHALTN AREALGRKMA EGSAQTKLLA GVGEAFGLDG PPERIEVYDN
SHIQGTNAVG GMIVAGPEGF MKGQYRKFNI KSTELTPGDD YGMMKEVLKR RFARLVKEEE
EGDSSARPDL VLVDGGKGQL DAVIEVMADL GVDDIAVVGV AKGPDRDAGL ERFFMPDKTP
FMLEPKSPVL YYLQRLRDEA HRFAIGAHRT RRSMDLKKNP LDEIEGVGPG RKRALLNAFG
SAKGVSRAGV EDLMKVEGVS QPLAERIHAF FRKS