Gene EcolC_3969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3969 
SymboluvrA 
ID6064504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4359981 
End bp4362803 
Gene Length2823 bp 
Protein Length940 aa 
Translation table11 
GC content56% 
IMG OID641603382 
Productexcinuclease ABC subunit A 
Protein accessionYP_001726897 
Protein GI170021943 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0893236 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAGA TCGAAGTTCG GGGCGCCCGC ACCCATAATC TCAAAAACAT CAACCTCGTT 
ATCCCCCGCG ACAAGCTCAT TGTCGTGACC GGGCTTTCGG GTTCTGGCAA ATCCTCGCTC
GCTTTCGACA CCTTATATGC CGAAGGGCAG CGCCGTTACG TTGAATCCCT TTCCGCCTAC
GCGCGGCAGT TTCTGTCACT GATGGAAAAG CCGGACGTCG ATCATATTGA GGGGCTTTCT
CCTGCCATCT CAATTGAGCA GAAATCGACG TCTCATAACC CGCGTTCTAC GGTGGGGACA
ATCACCGAAA TCCACGACTA TTTGCGTTTG TTGTTCGCCC GCGTTGGCGA GCCGCGCTGT
CCGGACCACG ACGTCCCGCT GGCGGCGCAA ACCGTCAGCC AGATGGTGGA TAACGTGCTG
TCGCAGCCGG AAGGCAAGCG TCTGATGCTA CTCGCGCCAA TCATTAAAGA GCGCAAAGGC
GAACACACCA AAACGCTGGA GAACCTGGCA AGCCAGGGCT ACATCCGTGC TCGTATTGAT
GGCGAAGTCT GCGATCTTTC CGATCCGCCA AAACTGGAAC TGCAAAAGAA ACATACCATT
GAAGTGGTGG TTGATCGCTT CAAGGTGCGT GACGATCTTA CCCAACGTCT TGCCGAGTCA
TTTGAAACCG CGCTGGAGCT TTCCGGTGGT ACCGCGGTAG TGGCGGATAT GGACGACCCG
AAAGCGGAAG AGCTGCTGTT CTCCGCCAAC TTCGCCTGCC CAATTTGCGG CTACAGTATG
CGTGAACTGG AGCCGCGACT GTTTTCGTTT AACAACCCGG CGGGGGCCTG CCCGACCTGC
GACGGCCTTG GCGTACAGCA ATATTTCGAT CCTGATCGAG TGATCCAGAA TCCGGAACTG
TCGCTGGCTG GTGGTGCGAT CCGTGGCTGG GATCGCCGCA ACTTCTATTA TTTCCAGATG
CTGAAATCGC TGGCAGATCA CTATAAGTTC GACGTCGAAG CGCCGTGGGG CAGCCTGAGC
GCGAACGTGC ATAAAGTGGT GTTGTACGGT TCTGGCAAAG AAAACATTGA ATTCAAATAC
ATGAACGATC GTGGCGATAC CTCCATTCGT CGTCATCCGT TCGAAGGCGT GCTGCATAAT
ATGGAGCGCC GCTATAAAGA GACGGAATCC AGCGCGGTAC GCGAAGAATT AGCCAAGTTT
ATCAGTAATC GTCCGTGCGC CAGCTGCGAA GGGACGCGTC TGCGTCGGGA AGCGCGCCAC
GTGTATGTCG AGAATACGCC GCTGCCTGCT ATCTCCGACA TGAGCATTGG TCATGCGATG
GAATTCTTCA ACAATCTCAA ACTCGCAGGT CAGCGGGCGA AGATTGCAGA AAAAATCCTT
AAAGAGATCG GCGATCGTCT GAAATTCCTC GTTAACGTCG GCCTGAATTA CCTGACGCTT
TCCCGCTCGG CAGAAACGCT TTCTGGCGGT GAAGCACAGC GTATCCGTCT GGCGAGCCAG
ATTGGTGCGG GCCTGGTTGG CGTTATGTAC GTGCTGGACG AGCCGTCTAT CGGCCTGCAC
CAGCGTGATA ACGAGCGCCT GTTGGGTACG CTTATCCATC TGCGCGATCT CGGTAATACC
GTGATTGTGG TGGAGCACGA CGAAGACGCA ATTCGCGCCG CTGACCATGT GATCGACATT
GGCCCGGGCG CAGGTGTTCA CGGCGGTGAA GTGGTCGCAG AAGGTCCGCT GGAAGCGATT
ATGGCGGTGC CGGAGTCGTT GACCGGGCAG TACATGAGCG GCAAACGCAA GATTGAAGTG
CCGAAGAAAC GCGTTCCGGC GAATCCGGAA AAAGTGCTGA AGCTGACAGG CGCACGCGGC
AACAACCTGA AGGACGTGAC GCTGACGCTG CCGGTGGGTC TGTTTACCTG CATCACCGGG
GTTTCAGGTT CCGGTAAATC GACGCTGATT AACGACACGC TGTTCCCTAT TGCCCAACGC
CAGTTGAATG GTGCGACCAT CGCCGAACCG GCACCGTATC GCGATATTCA GGGGCTGGAG
CATTTCGACA AAGTGATCGA TATCGACCAA AGCCCAATTG GTCGTACTCC GCGTTCTAAC
CCGGCGACCT ATACCGGCGT GTTTACACCT GTGCGCGAAC TTTTTGCGGG CGTACCGGAA
TCCCGTGCGC GTGGTTATAC GCCAGGACGT TTCAGCTTTA ACGTCCGTGG CGGACGCTGC
GAAGCCTGTC AGGGCGACGG TGTGATCAAA GTGGAGATGC ACTTCCTGCC GGACATTTAC
GTACCGTGCG ACCAGTGTAA AGGTAAACGC TATAACCGTG AAACGCTGGA AATTAAGTAC
AAAGGCAAAA CCATCCACGA AGTGCTGGAT ATGACCATCG AAGAGGCGCG TGAGTTCTTT
GATGCCGTAC CTGCACTGGC GCGTAAGCTG CAAACGTTGA TGGACGTTGG CCTGACGTAC
ATTCGCCTGG GGCAGTCCGC AACCACCCTT TCTGGTGGTG AAGCCCAGCG CGTGAAGCTG
GCGCGTGAGC TGTCAAAACG CGGCACCGGG CAGACACTGT ATATTCTCGA CGAGCCGACC
ACCGGTTTGC ACTTCGCCGA TATTCAGCAA CTGCTCGACG TGCTGCATAA ACTGCGCGAT
CAGGGCAATA CCATTGTGGT AATTGAGCAC AATCTCGACG TGATTAAAAC CGCTGACTGG
ATTGTCGACC TGGGACCGGA AGGCGGCAGT GGCGGCGGCG AGATCCTCGT CTCCGGTACG
CCAGAAACCG TCGCGGAGTG CGAAGCTTCG CATACGGCGC GCTTCCTCAA GCCGATGCTG
TAA
 
Protein sequence
MDKIEVRGAR THNLKNINLV IPRDKLIVVT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY 
ARQFLSLMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT ITEIHDYLRL LFARVGEPRC
PDHDVPLAAQ TVSQMVDNVL SQPEGKRLML LAPIIKERKG EHTKTLENLA SQGYIRARID
GEVCDLSDPP KLELQKKHTI EVVVDRFKVR DDLTQRLAES FETALELSGG TAVVADMDDP
KAEELLFSAN FACPICGYSM RELEPRLFSF NNPAGACPTC DGLGVQQYFD PDRVIQNPEL
SLAGGAIRGW DRRNFYYFQM LKSLADHYKF DVEAPWGSLS ANVHKVVLYG SGKENIEFKY
MNDRGDTSIR RHPFEGVLHN MERRYKETES SAVREELAKF ISNRPCASCE GTRLRREARH
VYVENTPLPA ISDMSIGHAM EFFNNLKLAG QRAKIAEKIL KEIGDRLKFL VNVGLNYLTL
SRSAETLSGG EAQRIRLASQ IGAGLVGVMY VLDEPSIGLH QRDNERLLGT LIHLRDLGNT
VIVVEHDEDA IRAADHVIDI GPGAGVHGGE VVAEGPLEAI MAVPESLTGQ YMSGKRKIEV
PKKRVPANPE KVLKLTGARG NNLKDVTLTL PVGLFTCITG VSGSGKSTLI NDTLFPIAQR
QLNGATIAEP APYRDIQGLE HFDKVIDIDQ SPIGRTPRSN PATYTGVFTP VRELFAGVPE
SRARGYTPGR FSFNVRGGRC EACQGDGVIK VEMHFLPDIY VPCDQCKGKR YNRETLEIKY
KGKTIHEVLD MTIEEAREFF DAVPALARKL QTLMDVGLTY IRLGQSATTL SGGEAQRVKL
ARELSKRGTG QTLYILDEPT TGLHFADIQQ LLDVLHKLRD QGNTIVVIEH NLDVIKTADW
IVDLGPEGGS GGGEILVSGT PETVAECEAS HTARFLKPML