Gene Avin_20100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_20100 
SymboluvrB 
ID7760939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1997646 
End bp1999655 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content67% 
IMG OID643804908 
Productexcinuclease ABC subunit B 
Protein accessionYP_002799191 
Protein GI226944118 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGAAT TTCAGCTGGT GACCCGTTTC CAGCCCGCCG GCGACCAGCC CGATGCCATC 
CGCCAGATGG TCGAGGGGCT GGAGGCCGGC CTGTCGCACC AGACCCTGCT CGGGGTGACC
GGCTCCGGCA AGACCTTCAG CATCGCCAAC GTGATCGCCA GGGTACAGCG TCCGACCCTG
GTGCTGGCGC CCAACAAGAC CCTGGCGGCG CAGCTCTACG GCGAGTTCAA GAGCTTCTTT
CCGCACAACG CGGTGGAGTA CTTCGTTTCC TACTACGACT ACTACCAGCC GGAAGCCTAT
GTGCCGTCCT CCGACACCTA CATCGAGAAG GACGCCTCGA TCAACGACCA CATCGAGCAG
ATGCGCCTGT CGGCGACCAA GGCGCTGATG GAGCGGCCGG ACGCGATCAT CGTCGCCACC
GTCTCGTCGA TCTACGGCCT GGGCGATCCG GCCAGCTACC TGAAGATGGT GCTGCACATC
GACCGCGGCG ACCGACTCGA CCAGCGCGCC CTGCTGCGCC GCCTGGCCGA CCTGCAGTAC
ACCCGCAACG ACCTGGACTT CGCCCGCGCC ACCTTCCGCG TGCGTGGCGA CGTGATCGAC
GTCTTCCCCG CCGAATCCGA GCTGGAGGCG ATCCGCATCG AGCTGTTCGA CGACGAGGTG
GAGAGTCTGG CCGCCTTCGA TCCCTTGACC GGCGAGGTGA TCCGCAAGCT GCCGCGCTTC
ACCTTCTACC CCAAGAGCCA CTACGTCACC CCGCGCGAGA CGCTGCTGGA AGCGGTGGAG
CAGATCAAGG CCGAACTCAA GGAGCGCCTG GAGCACCTGC GCGAGCGCGG CAAGCTGGTG
GAGGCGCAGC GCCTGGAACA GCGCACCCGC TTCGACCTGG AGATGATCCT CGAACTGGGC
TACTGCAACG GCATCGAGAA CTACTCCCGC TACCTCTCCG GCCGTGCGCC GGGGCTGCCG
CCGCCGACCC TCTACGACTA CCTGCCGGAC AACGCGCTGA TGGTGATCGA CGAATCCCAC
GTCACCGTCC CGCAGGTCGG CGCCATGTAC AAGGGCGACC GTTCGCGCAA GGAGACCCTG
GTGGAATACG GCTTCCGCCT GCCGTCGGCG CTGGACAACC GGCCGATGCG CTTCGACGAA
TGGGAGCGCA TCGCCCCGCA GACCATCTTC GTCTCGGCGA CCCCCGGCCC CTACGAAGCC
GAGCACGCCG GACGGGTGAT CGAACAGGTG GTGCGCCCCA CCGGGCTGGT CGACCCCGAG
CTGGAGGTGC GCCCGGCGCT GACCCAGGTG GACGACCTGC TGTCGGAGAT CCGCAAGCGC
GTCGCCGTCG AGGAGCGCGT GCTGGTCACC ACCCTGACCA AGCGCATGGC CGAGGACCTC
ACCGATTACC TCGGCGACCA CGACGTGCGG GTGCGCTACC TGCACTCGGA CATCGACACC
GTGGAGCGCG TGGAGATCAT CCGCGACCTG CGCAGCGGCG CCTTCGACGT GCTGGTGGGC
ATCAATCTGC TGCGCGAGGG GCTGGATATG CCCGAAGTAT CGCTGGTGAC GATTCTCGAC
GCGGACAAGG AAGGTTTCCT GCGCAGCGAA CGCTCGCTGA TCCAGACCAT CGGCCGCGCC
GCGCGCAACC TCAACGGCAA GGCGATTCTC TACGCCGACA GCATCACCGG TTCGATGCGG
CGCGCCATCG ACGAGACCGA GCGGCGCCGG GCCAAGCAGA TCGCCTTCAA CGAAACCCAC
GGCATCGTGC CCAGGGGCGT CAAGAAGGAC ATCCAGGACA TCCTCGAAGG CGCCGTGGTG
CCCGGTGCGC GCGGTCGCAA GCGGGTGGCC AGGGCGGCGG AGGAGAGCGG CCAGTACGCC
GCCGAGCTGC GCTCGCCGAG CGAGATCGAC AAGCGCATCC GTCAGCTCGA GGAGAAGATG
TACGCCTTGG CCCGCGATCT GGAGTTCGAG GCCGCGGCGC GGCTGCGCGA CGAGATCCAG
GCGCTGCGCG AGCGGCGGCT GCAGGTCTGA
 
Protein sequence
MSEFQLVTRF QPAGDQPDAI RQMVEGLEAG LSHQTLLGVT GSGKTFSIAN VIARVQRPTL 
VLAPNKTLAA QLYGEFKSFF PHNAVEYFVS YYDYYQPEAY VPSSDTYIEK DASINDHIEQ
MRLSATKALM ERPDAIIVAT VSSIYGLGDP ASYLKMVLHI DRGDRLDQRA LLRRLADLQY
TRNDLDFARA TFRVRGDVID VFPAESELEA IRIELFDDEV ESLAAFDPLT GEVIRKLPRF
TFYPKSHYVT PRETLLEAVE QIKAELKERL EHLRERGKLV EAQRLEQRTR FDLEMILELG
YCNGIENYSR YLSGRAPGLP PPTLYDYLPD NALMVIDESH VTVPQVGAMY KGDRSRKETL
VEYGFRLPSA LDNRPMRFDE WERIAPQTIF VSATPGPYEA EHAGRVIEQV VRPTGLVDPE
LEVRPALTQV DDLLSEIRKR VAVEERVLVT TLTKRMAEDL TDYLGDHDVR VRYLHSDIDT
VERVEIIRDL RSGAFDVLVG INLLREGLDM PEVSLVTILD ADKEGFLRSE RSLIQTIGRA
ARNLNGKAIL YADSITGSMR RAIDETERRR AKQIAFNETH GIVPRGVKKD IQDILEGAVV
PGARGRKRVA RAAEESGQYA AELRSPSEID KRIRQLEEKM YALARDLEFE AAARLRDEIQ
ALRERRLQV