Gene Caul_3697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3697 
SymboluvrA 
ID5901153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3989747 
End bp3992656 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content67% 
IMG OID641564208 
Productexcinuclease ABC subunit A 
Protein accessionYP_001685322 
Protein GI167647659 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAC AACTGAACTT CATCCGCGTG CGCGGCGCGC GGGAGCACAA TCTCAAGGAC 
GTCAGCGTCG ACATCCCCCG GGGCGAGCTC GTCGTGCTGA CCGGGCTGTC GGGCTCGGGC
AAGAGCTCGC TGGCGTTCGA CACCATCTAC GCCGAGGGCC AGCGGCGCTA CGTCGAGAGC
CTGTCGGCCT ATGCCCGCCA GTTCCTGGAA CTGATGAGCA AGCCAGACGT CGATCTGATC
GAGGGCCTGT CGCCGGCCAT CTCGATTGAG CAGAAGACCA CCTCGCGCAA CCCGCGCTCG
ACGGTGGGCA CGGTCACCGA GATCCACGAC TACATGCGCC TGCTGTGGGC GCGGGTGGGC
GTGCCTTACA GCCCCGCCAC CGGCCTGCCG ATCGAGAGCC AGACCATCAG CCAGATGGTC
GACAAGATCA CCGCCCTGCC GGAAGGCACC CGTCTCTACC TGCTGGCCCC CGTGGTCCGC
GACCGCAAGG GCGAGTACCG CAAGGAGATC GCCGAGTGGC AGAAGACCGG CTTCCAGCGG
CTGAAGATCG ACGGCGAGTT CTACCCGATC GAGGACGCCC CCGTCCTCGA CAAGAAGTTC
AAGCACGACA TCGACGTGGT GGTGGACCGC ATCGTCACCA AGGCCGGCAT GGAGCAGCGC
CTGGCCGACT CGATGGAGCA GGCCCTGCGC CTGGCCGACG GTCTGGCCGT GGCCGAGTGG
GCCAATGTCG AGGAGGGCGA GAAGGAGCCG CGCCGGCTGC TGTTCTCCGA ACGCTTCGCC
TGCCCGGTCA GCGGCTTCAC GATCGCCGAG ATCGAGCCGC GCCTGTTCTC GTTCAACAAC
CCGGCGGGCG CCTGCCCGGC CTGCGACGGC CTGGGCGCCA AGCTGGCCTT CGACGCCGAC
CTGATCGTTC CGGACAAGGA CAAGACCCTG CACAAGGGCG CGGTGGCCCC GTGGTCGCGT
GGCCCCTCGC CGCTCTACAC CCAGACCCTG CAGGCCCTGG CCCGCCACTA CGGCTTCTCG
ATGGACGAGC CCTGGCACAA GCTGCCGGAA AGCGCGCGCC AGGTGGTGCT GCAAGGCTCC
AAGGGCGAGA AGATCAAGTT CGTCTATGAC GACAACGCCC GCAAATACGA GGTGGCCAAG
CCTTTCGAGG GCGTGCTGCC GAACCTGGAG CGCCGCTGGC GCGAGACGGA TTCCAGTTGG
GTCCGCGAAG AGCTGGGCCG CTACCAGTCC GACACCCCTT GCGAGGTCTG CCACGGCAAG
CGCCTGAAGC CCGAAGCCCT GGCCGTGAAG ATCGCCGGCA TGGACATCGC CCAGGTGTCG
ATGCTGGCCA TTCGCCCGGC CAAGGAGTGG TTCGCGGGCC TGGAGGCCCA GTTCACCGAC
AAGCAGATGG AGATCGCCCG GCGGATCCTG AAGGAGATCA ACGACCGCCT GCGGTTCCTG
GTCGACGTCG GCCTGGACTA CCTGAACCTG TCGCGCGGCT CTGGCACCCT GTCGGGCGGC
GAGAGCCAGC GCATCCGCCT GGCCAGCCAG ATCGGCTCGG GCCTGACCGG CGTGCTCTAC
GTGCTGGACG AGCCGTCGAT CGGCCTGCAC CAGCGCGACA ACACCCGGCT GCTGCAGTCG
CTGCAGGGCC TGCGCGACCT GGGCAATTCG GTGCTGGTGG TCGAGCACGA CGAGGAAGCC
ATCCTCACCG CCGACTATGT GATCGACATG GGTCCGGCGG CGGGCGTGCA CGGCGGCGAG
ATCGTCGCCG AGGGCAAGCC GGCCGACATC ATGGCCAACC CCAACAGCAT CACCGGCCAA
TATCTGACTG GCGTGCGCGA GATCGCCGTC CCGGAAGAGC GCCGGCCGAT CAACAAGAAG
AAGATGCTGA AGGTCATCGG GGCCACCGGC AACAACCTGA AGGGCGTCAC GGGCGAGATC
CCGGTCGGGA CCTTCACCTG CATCACGGGC GTGTCGGGCG GCGGCAAGTC GACCTTCACG
ATCGAGACCC TGTACAAGGC CGCCGCGCGC CGTCTGAACA ACGCCAGCGA CGCGCCCGCT
GCTCACGAGC GGATCGAGGG GCTGGAGAAC TTCGACAAGG TCATCGACAT CGACCAGTCG
CCGATCGGCC GCACCCCGCG CTCGAACCCG GCCACCTATA CCGGCGCCTT CGGGCCGATC
CGCGACTGGT TCGCCCAATT GCCGGAGTCC AAGGTGCGCG GCTACGGCCC GGGCCGCTTC
AGCTTCAACG TCAAGGGCGG GCGCTGCGAG GCCTGCCAGG GCGACGGGCT GATCAAGATC
GAGATGCACT TCCTGCCCGA CGTCTACGTC ACCTGCGATA TCTGCAAGGG CAAGCGCTAC
AACCGCGAGA CCCTCGACAT CCTGTTCAAG GGCAAGACCA TCGCCGACGT GCTGGACATG
ACGGTGGAAG AGGCCGCCGA CTTCTTCAAG GCCGTGCCGC CGATCCGCGA CAAGATGGAG
ACCCTCAAGC GGGTGGGCCT CGGCTATATC AAGGTCGGCC AGCAGGCCAC GACCCTGTCG
GGCGGCGAGG CCCAGCGCGT GAAGCTGTCC AAGGAGCTCT CCAAGCGCGC CACTGGCCGC
ACGCTCTATA TCCTCGACGA GCCGACCACC GGTCTGCACT TCGAGGACAC CAAGAAGCTG
CTGGAAGTGC TGCACGAACT GGTCGACCAG GGCAACACCG TGGTCGTCAT CGAGCATAAC
CTCGACGTGG TGAAGACCGC CGACTGGCTG CTGGACTTCG GTCCCGAAGG CGGCGACGGC
GGCGGCGAGA TCGTCGCCGT GGGCTCGCCC CAGGATGTGG CCAAGAGCGA GGCCAGCTGG
ACCGGCCGCT ATCTCAAGGA CGTGCTGGCC CGCCACGAAG ACCGGCGGCT GGAGCGCGTC
GCGGCGTTGA AGGCGGCCAA GCGGGCTTAG
 
Protein sequence
MAEQLNFIRV RGAREHNLKD VSVDIPRGEL VVLTGLSGSG KSSLAFDTIY AEGQRRYVES 
LSAYARQFLE LMSKPDVDLI EGLSPAISIE QKTTSRNPRS TVGTVTEIHD YMRLLWARVG
VPYSPATGLP IESQTISQMV DKITALPEGT RLYLLAPVVR DRKGEYRKEI AEWQKTGFQR
LKIDGEFYPI EDAPVLDKKF KHDIDVVVDR IVTKAGMEQR LADSMEQALR LADGLAVAEW
ANVEEGEKEP RRLLFSERFA CPVSGFTIAE IEPRLFSFNN PAGACPACDG LGAKLAFDAD
LIVPDKDKTL HKGAVAPWSR GPSPLYTQTL QALARHYGFS MDEPWHKLPE SARQVVLQGS
KGEKIKFVYD DNARKYEVAK PFEGVLPNLE RRWRETDSSW VREELGRYQS DTPCEVCHGK
RLKPEALAVK IAGMDIAQVS MLAIRPAKEW FAGLEAQFTD KQMEIARRIL KEINDRLRFL
VDVGLDYLNL SRGSGTLSGG ESQRIRLASQ IGSGLTGVLY VLDEPSIGLH QRDNTRLLQS
LQGLRDLGNS VLVVEHDEEA ILTADYVIDM GPAAGVHGGE IVAEGKPADI MANPNSITGQ
YLTGVREIAV PEERRPINKK KMLKVIGATG NNLKGVTGEI PVGTFTCITG VSGGGKSTFT
IETLYKAAAR RLNNASDAPA AHERIEGLEN FDKVIDIDQS PIGRTPRSNP ATYTGAFGPI
RDWFAQLPES KVRGYGPGRF SFNVKGGRCE ACQGDGLIKI EMHFLPDVYV TCDICKGKRY
NRETLDILFK GKTIADVLDM TVEEAADFFK AVPPIRDKME TLKRVGLGYI KVGQQATTLS
GGEAQRVKLS KELSKRATGR TLYILDEPTT GLHFEDTKKL LEVLHELVDQ GNTVVVIEHN
LDVVKTADWL LDFGPEGGDG GGEIVAVGSP QDVAKSEASW TGRYLKDVLA RHEDRRLERV
AALKAAKRA