Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3697 |
Symbol | uvrA |
ID | 5901153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3989747 |
End bp | 3992656 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641564208 |
Product | excinuclease ABC subunit A |
Protein accession | YP_001685322 |
Protein GI | 167647659 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAAC AACTGAACTT CATCCGCGTG CGCGGCGCGC GGGAGCACAA TCTCAAGGAC GTCAGCGTCG ACATCCCCCG GGGCGAGCTC GTCGTGCTGA CCGGGCTGTC GGGCTCGGGC AAGAGCTCGC TGGCGTTCGA CACCATCTAC GCCGAGGGCC AGCGGCGCTA CGTCGAGAGC CTGTCGGCCT ATGCCCGCCA GTTCCTGGAA CTGATGAGCA AGCCAGACGT CGATCTGATC GAGGGCCTGT CGCCGGCCAT CTCGATTGAG CAGAAGACCA CCTCGCGCAA CCCGCGCTCG ACGGTGGGCA CGGTCACCGA GATCCACGAC TACATGCGCC TGCTGTGGGC GCGGGTGGGC GTGCCTTACA GCCCCGCCAC CGGCCTGCCG ATCGAGAGCC AGACCATCAG CCAGATGGTC GACAAGATCA CCGCCCTGCC GGAAGGCACC CGTCTCTACC TGCTGGCCCC CGTGGTCCGC GACCGCAAGG GCGAGTACCG CAAGGAGATC GCCGAGTGGC AGAAGACCGG CTTCCAGCGG CTGAAGATCG ACGGCGAGTT CTACCCGATC GAGGACGCCC CCGTCCTCGA CAAGAAGTTC AAGCACGACA TCGACGTGGT GGTGGACCGC ATCGTCACCA AGGCCGGCAT GGAGCAGCGC CTGGCCGACT CGATGGAGCA GGCCCTGCGC CTGGCCGACG GTCTGGCCGT GGCCGAGTGG GCCAATGTCG AGGAGGGCGA GAAGGAGCCG CGCCGGCTGC TGTTCTCCGA ACGCTTCGCC TGCCCGGTCA GCGGCTTCAC GATCGCCGAG ATCGAGCCGC GCCTGTTCTC GTTCAACAAC CCGGCGGGCG CCTGCCCGGC CTGCGACGGC CTGGGCGCCA AGCTGGCCTT CGACGCCGAC CTGATCGTTC CGGACAAGGA CAAGACCCTG CACAAGGGCG CGGTGGCCCC GTGGTCGCGT GGCCCCTCGC CGCTCTACAC CCAGACCCTG CAGGCCCTGG CCCGCCACTA CGGCTTCTCG ATGGACGAGC CCTGGCACAA GCTGCCGGAA AGCGCGCGCC AGGTGGTGCT GCAAGGCTCC AAGGGCGAGA AGATCAAGTT CGTCTATGAC GACAACGCCC GCAAATACGA GGTGGCCAAG CCTTTCGAGG GCGTGCTGCC GAACCTGGAG CGCCGCTGGC GCGAGACGGA TTCCAGTTGG GTCCGCGAAG AGCTGGGCCG CTACCAGTCC GACACCCCTT GCGAGGTCTG CCACGGCAAG CGCCTGAAGC CCGAAGCCCT GGCCGTGAAG ATCGCCGGCA TGGACATCGC CCAGGTGTCG ATGCTGGCCA TTCGCCCGGC CAAGGAGTGG TTCGCGGGCC TGGAGGCCCA GTTCACCGAC AAGCAGATGG AGATCGCCCG GCGGATCCTG AAGGAGATCA ACGACCGCCT GCGGTTCCTG GTCGACGTCG GCCTGGACTA CCTGAACCTG TCGCGCGGCT CTGGCACCCT GTCGGGCGGC GAGAGCCAGC GCATCCGCCT GGCCAGCCAG ATCGGCTCGG GCCTGACCGG CGTGCTCTAC GTGCTGGACG AGCCGTCGAT CGGCCTGCAC CAGCGCGACA ACACCCGGCT GCTGCAGTCG CTGCAGGGCC TGCGCGACCT GGGCAATTCG GTGCTGGTGG TCGAGCACGA CGAGGAAGCC ATCCTCACCG CCGACTATGT GATCGACATG GGTCCGGCGG CGGGCGTGCA CGGCGGCGAG ATCGTCGCCG AGGGCAAGCC GGCCGACATC ATGGCCAACC CCAACAGCAT CACCGGCCAA TATCTGACTG GCGTGCGCGA GATCGCCGTC CCGGAAGAGC GCCGGCCGAT CAACAAGAAG AAGATGCTGA AGGTCATCGG GGCCACCGGC AACAACCTGA AGGGCGTCAC GGGCGAGATC CCGGTCGGGA CCTTCACCTG CATCACGGGC GTGTCGGGCG GCGGCAAGTC GACCTTCACG ATCGAGACCC TGTACAAGGC CGCCGCGCGC CGTCTGAACA ACGCCAGCGA CGCGCCCGCT GCTCACGAGC GGATCGAGGG GCTGGAGAAC TTCGACAAGG TCATCGACAT CGACCAGTCG CCGATCGGCC GCACCCCGCG CTCGAACCCG GCCACCTATA CCGGCGCCTT CGGGCCGATC CGCGACTGGT TCGCCCAATT GCCGGAGTCC AAGGTGCGCG GCTACGGCCC GGGCCGCTTC AGCTTCAACG TCAAGGGCGG GCGCTGCGAG GCCTGCCAGG GCGACGGGCT GATCAAGATC GAGATGCACT TCCTGCCCGA CGTCTACGTC ACCTGCGATA TCTGCAAGGG CAAGCGCTAC AACCGCGAGA CCCTCGACAT CCTGTTCAAG GGCAAGACCA TCGCCGACGT GCTGGACATG ACGGTGGAAG AGGCCGCCGA CTTCTTCAAG GCCGTGCCGC CGATCCGCGA CAAGATGGAG ACCCTCAAGC GGGTGGGCCT CGGCTATATC AAGGTCGGCC AGCAGGCCAC GACCCTGTCG GGCGGCGAGG CCCAGCGCGT GAAGCTGTCC AAGGAGCTCT CCAAGCGCGC CACTGGCCGC ACGCTCTATA TCCTCGACGA GCCGACCACC GGTCTGCACT TCGAGGACAC CAAGAAGCTG CTGGAAGTGC TGCACGAACT GGTCGACCAG GGCAACACCG TGGTCGTCAT CGAGCATAAC CTCGACGTGG TGAAGACCGC CGACTGGCTG CTGGACTTCG GTCCCGAAGG CGGCGACGGC GGCGGCGAGA TCGTCGCCGT GGGCTCGCCC CAGGATGTGG CCAAGAGCGA GGCCAGCTGG ACCGGCCGCT ATCTCAAGGA CGTGCTGGCC CGCCACGAAG ACCGGCGGCT GGAGCGCGTC GCGGCGTTGA AGGCGGCCAA GCGGGCTTAG
|
Protein sequence | MAEQLNFIRV RGAREHNLKD VSVDIPRGEL VVLTGLSGSG KSSLAFDTIY AEGQRRYVES LSAYARQFLE LMSKPDVDLI EGLSPAISIE QKTTSRNPRS TVGTVTEIHD YMRLLWARVG VPYSPATGLP IESQTISQMV DKITALPEGT RLYLLAPVVR DRKGEYRKEI AEWQKTGFQR LKIDGEFYPI EDAPVLDKKF KHDIDVVVDR IVTKAGMEQR LADSMEQALR LADGLAVAEW ANVEEGEKEP RRLLFSERFA CPVSGFTIAE IEPRLFSFNN PAGACPACDG LGAKLAFDAD LIVPDKDKTL HKGAVAPWSR GPSPLYTQTL QALARHYGFS MDEPWHKLPE SARQVVLQGS KGEKIKFVYD DNARKYEVAK PFEGVLPNLE RRWRETDSSW VREELGRYQS DTPCEVCHGK RLKPEALAVK IAGMDIAQVS MLAIRPAKEW FAGLEAQFTD KQMEIARRIL KEINDRLRFL VDVGLDYLNL SRGSGTLSGG ESQRIRLASQ IGSGLTGVLY VLDEPSIGLH QRDNTRLLQS LQGLRDLGNS VLVVEHDEEA ILTADYVIDM GPAAGVHGGE IVAEGKPADI MANPNSITGQ YLTGVREIAV PEERRPINKK KMLKVIGATG NNLKGVTGEI PVGTFTCITG VSGGGKSTFT IETLYKAAAR RLNNASDAPA AHERIEGLEN FDKVIDIDQS PIGRTPRSNP ATYTGAFGPI RDWFAQLPES KVRGYGPGRF SFNVKGGRCE ACQGDGLIKI EMHFLPDVYV TCDICKGKRY NRETLDILFK GKTIADVLDM TVEEAADFFK AVPPIRDKME TLKRVGLGYI KVGQQATTLS GGEAQRVKLS KELSKRATGR TLYILDEPTT GLHFEDTKKL LEVLHELVDQ GNTVVVIEHN LDVVKTADWL LDFGPEGGDG GGEIVAVGSP QDVAKSEASW TGRYLKDVLA RHEDRRLERV AALKAAKRA
|
| |