Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_51710 |
Symbol | recB |
ID | 7764010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5261178 |
End bp | 5264858 |
Gene Length | 3681 bp |
Protein Length | 1226 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643807989 |
Product | exodeoxyribonuclease V, beta subunit |
Protein accession | YP_002802223 |
Protein GI | 226947150 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1074] ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) |
TIGRFAM ID | [TIGR00609] exodeoxyribonuclease V, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGC CGCCGACCGC CGAGCGCCCG CTGGCGCTAC GCTTCCCGCT GCACGGCAGC CGCCTGATCG AGGCCAGCGC CGGTACCGGC AAGACCTTCA CCATCTCCGC CCTTTACCTG CGCCTGGTGC TCGGCCATGG CGGCGAGGCG GGCTTTTCCC GCGAGTTGCT GCCGCCGGAG ATCCTCGTGG TGACCTTCAC CGACGCCGCC ACCCGCGAGC TGCGCGACCG CATCCGCGCG CGGCTGGTCG AGGCGGCCAA GGTGTTTCGC GAGGAGGCGC CGGGCGACGG TCTGCTGCGG CAGTTGCGCG CCGACTTTTC CGCCGAGCGC TGGCCGGCCT GCGCGCGCCG CCTGGACATC GCCGCGCAGT GGATGGACGA AGCGGCGGTG TCGACCATTC ACGGCTGGTG CCAGCGCATG CTGCGCGAGC ACGCCTTCGA CAGCGGCAGC CTGTTCACCC AGACCCTGGA GACCGACCAC GGCGAACTGC TCGCCGAGGT GGTGCGCGAC TACTGGCGCC AGCACTGCTA CCCGCTCTCC GCTGCGGCGC TGGGCTGGGT GATGGAGCAC TGGGGGCATC CGGATGCGCT GCTCAAGGAC AAGCTGGGCC CGCTGCTCGG CGAGGCGCAG CCGCTGCCGG CCGAGCCGCT CGGTGCGCTG CTGGAGCGGG CGCTGGCCGA GCAGGCCGGC GAACTGGCCC GGCTGAAGGA AGGCTGGAAC GCCTGGGCCG ACGAGCTGGA AGCGTTGCTC GATGCGGCGG TGCAGGCCAA ACGGGTGGAC GGACGCAAGA TCCAGGCGCG CTACTACCGG CCCTGGTGCG AGAAGTTGCG CGCCTGGGCC GTTGGCGACG GGACGGACCT CGACCTCGGC AGCGGCTTCG CCCGCCTGAC CCCCGAGGGC CTGGCCGAGG CCTGGAAGGG CGAGTCGCCG CGCCATCCGG CGTTGGAGGC CATGCGCGCG CTGCCGGCGC GCCTGAGGAC GCTGTCCGGC CCCGACGAGC CGGCCCGGCG CCATGCCGCC GCCTGGGTAA GCCTGCGCTT CGAGCAGGAA AAGCGCCGGC GCGCCGAAAT GGGTTTCGAC GACATGCTGA GCCGCCTCGA CGCCGCCCTG CGGGGCGACA ACGGCGGGCG CCTGGCCGAG CTGATCCGTG GCCAGTTCCC GGTGGCGCTG ATCGACGAGT TCCAGGACAC CGACCCGCTG CAGTACCGCA TCTTCGACCG CATCTACGCC ATCGCGAGCA ACCGCGCCGA CTGCGGCCTG TTCCTGATCG GCGACCCCAA GCAGGCGATC TACGCCTTCC GCGGCGCCGA TATCCATACC TACCTGCGCG CCCGCCAGGC CACCGAGGGC CGCCACTACA ACCTGGAAAC CAACTTCCGC TCCAGCCAGG CGCTGGTTTC CGCGGTCAAT CGGGTATTCC GCCTGGCCGA GCGGCGCGAA GCCGGGCACG GCGCCTTCCT GTTCCGCGAC GGCACGACCA ACCCGCTGCC CTTTCTCGAA GTGGGCGCGC AGGGCCGCGA AGAGGTCTGG CAGATCGGGG GCAATGCGCA GCCGGCACTG AATCTCTGGC ATCTGGAGGA CGAGGCGCCG CTGCCCGGCG GCGCCTACCG CGCGCAGATG GCCGAACGCG CCGCCAGCGA GATCGTCCGC CTGCTGAACC TCGGCCAGCG GGGCGGGGCG GGCTTCGTCC GGGACGGGCG GCTGCAGCCG CTGCGCCCCA GCGATATCGC CATTCTGGTG CGCGACTTCA AGGAGGCCGA GGCGATCCGT GGCCAACTGG CCGCTCGCGG GGTACGCAGC GTCTACCTGT CGGACAAGGA TTCGGTGTTC GCCAGCCCCG AGGCGCGCGA CCTCCTGCTG TGGCTGCGCG CCTGCGCCGA ACCGGACGTC GACCGCCCGC TGCGCGCCGC CCTGGCCAGC CGCAGCCTGG GCCTCGCGCT GGCCGAGCTG GAGGAACTGA ACCGCGACGA GCGCATCTGG GAGAGGCGCG TCATGCAGTT CCGCGGCTAC CGCCAGCGCT GGCAGCGCCA GGGCGTGCTG CCGATGCTGC ACCAACTGCT GCACGACTTC GATCTGCCGC AGCGGCTGAT GGCGCGCCCC GACGGCGAGC GCGCGCTGAC CAACCTGCTG CATCTGGCCG AGCTGCTGCA GCGTGCCGCC GCCGGGCTGG ACGGCGAGCA GGCGCTGATC CGCCATCTGT CCGAGCTGCT GGCCGGCGAG ATCCAGGTCG CCGAGGAGCA GGTGCTGCGC CTGGAGAGCG ACGCGGCGCT GGTGCGGGTG GTGACCATCC ACAAATCCAA GGGTCTGGAG TATCCGCTGG TGTTCCTGCC GTTCGTCTGC GCCTTCCGCG CGGTGGAGGG CGGCAGGCCG CTGTGGGTGT ACGACGGCGA GCGCCGCCGG CTGGTGCTGA CGCCGGGCGA GGACGAGGTG GCGCGTGCCG ACCGCGAGCG CCTGGGCGAG GAACTGCGCC TGCTCTACGT GGCTCTGACC CGCGCCCGCC ACGCCTGCTG GCTGGGCGTC GCCGACCTCT GCGTCGGCAA TGCCAGAAAA TCGCGCCTGC ACGACTCGGC GCTCGGCTAT CTGCTCGGCG GCGCTGTGCC GCTGGAGCGC AGCGCCGCCC TGGCCGACTG GCTCGCGCCG CTGGACGTGG CGGGTGAGAG CGCGGTGCTG GAGGCGCCCG AAGCCAGCGG CGAGCGCTTC GTCGAGCAGG CCGTCGCCCG GAGCAAGCCG CAATGGCGCA CCCCGGCGCG CAGCGCCGCC GAGCACTGGT GGATCGCCTC CTACAGCGCC CTGCGCATCG GCGCCGGCGA GGAAAGGCTG GCCGACAGCC AGATACCGGC CAGCCGCTTC GACGACGCCG CGCCGGACAG CCCGGCGGCA CAGAAGGTCG CCGACGACGA GCGCGAGAGC GTCTTCGTGC CGCTGCGCGC GGGCGAAACG CCGACCCTGC ATCGTTTTCC GCGCGGCCCC AATCCCGGCA CCTTCCTCCA CGGCCTGCTG GAAATGGCCG GCCGCGAGGG CTTCGCCGCG CTGGCCGCCG AGCCCGCGCG GCTGCGCGAG CTGATCGCCC GGCGCTGCCA GTTGCGTGGC ATGACCGAGT GGATCGATCC GCTGGCCGAC TGGCTGCTCG ACCTGCTGAC CAGGCCCCTG GCGCTCGGCG GGGACCGGTG CGTGGCGCTG GTCGGGCTCG GCCAGTACCA GCCCGAACTG GAGTTCTGGT TCGAGGCGCG CGCCGTCGAC GTGCGCCGCA TCGACGAACT GGTCCAGGAG CGGGTGCTGC CCGGCCTGTC GCGTCCGGCA CTGGCGAAGG ACCGGCTCAA CGGCATGTTC AAGGGCTTCA TCGACCTGGC GTTCGAGCAC GGCGGGCGCT ACTACCTGCT CGACTACAAG TCCAACTGGC TGGGCGAGGG CGATGCCGCC TACAGCGCCG AGGCGATGGC GGCGACCGCC GCCAGCCACC GCTACGACCT ACAGTCGGTG CTCTACCTGC TGGCCCTGCA CCGCCAGTTG CGGGCGCGCC TGCCCGGCTA CGACTACGAC GCGCACCTGG GCGGCGCGCT CTGCCTGTTC CTGCGCGGCA GCCGCGCGGC GGGGCAGGGC ATCCATTGGG TGCGCCCGCC ACGCGAGCTG ATCGAGACGC TGGATGCGCT GTTCGAGGGC AAGAAGGAGG CCGAGGCATG A
|
Protein sequence | MSQPPTAERP LALRFPLHGS RLIEASAGTG KTFTISALYL RLVLGHGGEA GFSRELLPPE ILVVTFTDAA TRELRDRIRA RLVEAAKVFR EEAPGDGLLR QLRADFSAER WPACARRLDI AAQWMDEAAV STIHGWCQRM LREHAFDSGS LFTQTLETDH GELLAEVVRD YWRQHCYPLS AAALGWVMEH WGHPDALLKD KLGPLLGEAQ PLPAEPLGAL LERALAEQAG ELARLKEGWN AWADELEALL DAAVQAKRVD GRKIQARYYR PWCEKLRAWA VGDGTDLDLG SGFARLTPEG LAEAWKGESP RHPALEAMRA LPARLRTLSG PDEPARRHAA AWVSLRFEQE KRRRAEMGFD DMLSRLDAAL RGDNGGRLAE LIRGQFPVAL IDEFQDTDPL QYRIFDRIYA IASNRADCGL FLIGDPKQAI YAFRGADIHT YLRARQATEG RHYNLETNFR SSQALVSAVN RVFRLAERRE AGHGAFLFRD GTTNPLPFLE VGAQGREEVW QIGGNAQPAL NLWHLEDEAP LPGGAYRAQM AERAASEIVR LLNLGQRGGA GFVRDGRLQP LRPSDIAILV RDFKEAEAIR GQLAARGVRS VYLSDKDSVF ASPEARDLLL WLRACAEPDV DRPLRAALAS RSLGLALAEL EELNRDERIW ERRVMQFRGY RQRWQRQGVL PMLHQLLHDF DLPQRLMARP DGERALTNLL HLAELLQRAA AGLDGEQALI RHLSELLAGE IQVAEEQVLR LESDAALVRV VTIHKSKGLE YPLVFLPFVC AFRAVEGGRP LWVYDGERRR LVLTPGEDEV ARADRERLGE ELRLLYVALT RARHACWLGV ADLCVGNARK SRLHDSALGY LLGGAVPLER SAALADWLAP LDVAGESAVL EAPEASGERF VEQAVARSKP QWRTPARSAA EHWWIASYSA LRIGAGEERL ADSQIPASRF DDAAPDSPAA QKVADDERES VFVPLRAGET PTLHRFPRGP NPGTFLHGLL EMAGREGFAA LAAEPARLRE LIARRCQLRG MTEWIDPLAD WLLDLLTRPL ALGGDRCVAL VGLGQYQPEL EFWFEARAVD VRRIDELVQE RVLPGLSRPA LAKDRLNGMF KGFIDLAFEH GGRYYLLDYK SNWLGEGDAA YSAEAMAATA ASHRYDLQSV LYLLALHRQL RARLPGYDYD AHLGGALCLF LRGSRAAGQG IHWVRPPREL IETLDALFEG KKEAEA
|
| |