Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1631 |
Symbol | uvrA |
ID | 3905910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1960338 |
End bp | 1963268 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637878969 |
Product | excinuclease ABC subunit A |
Protein accession | YP_480736 |
Protein GI | 86740336 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.978501 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGATC GTCTTGTCGT GCGTGGCGCG CGTGAACACA ATCTCCGTGA CGTCGACCTC GACCTGCCCC GCGACGGGCT GATCGTCTTC ACCGGTATGT CCGGGTCCGG CAAGTCCAGC CTCGCCTTCG ACACAATCTT CGCCGAGGGG CAGCGGCGCT ACGTGGAGTC GCTGTCGGCC TATGCGCGGC AGTTCCTCGG GCAGATGGAC AAGCCCGACG TCGACTTCAT CGAGGGGCTG TCTCCGGCCG TCTCCATCGA TCAGAAGTCG ACCTCGCGCA ACCCGCGGTC CACCGTCGGC ACGATCACCG AGGTCTACGA CTACCTGCGG CTGCTCTTCG CCCGCGCCGG CCGTCCGCAC TGCCCGAAGT GCGGGCGGCT GATCTCCCGG CAGACCCCGC AGCAGATCGT CGACCGGCTG CTGGAGTTGC CCGAAGGCAC CCGCTTCCAG GTGCTGGCCC CGGTGGTGCG TGGCCGCAAG GGCGAGTACG TCGACCTGTT CGCCGAGCTG CAGAGCTCCG GGTTCGCACG GGCCCGCGTC GACGGCACGG TGGTGCCGCT CACCGATCCG CCGAAGCTGG AGAAGCAGCG CAAGCATACG ATCGAGGTCG TCGTCGATCG GCTCGCGATC AAGTCGTCAG CGAAGCGGCG GCTCACCGAC TCGGTGGAGA CGGCCCTGAA GCTGGGCGGC GGGCTCGTCC TGGTTGACTT CGTCGACCGC GACGCCGACG ATCCCGAGCG CGAGCGGATG TACTCCGAGC ATCTGGCCTG CCTCTACGAC GACCTCTCCT TTGAGGAGCT GGAGCCGCGG TCGTTCTCGT TCAACTCGCC GTACGGCGCC TGCCCCGAGT GCACCGGGAT CGGGACCCGC AAGGAGGTCG ACCCGGAGCT CGTCGTGCCC GACCCCACCC TGTCGCTGGC GCAGGGCGCG GTGGCCCCCT GGTCGGGCGG CCACAACAAG GAGTACTTCG AACGGTTGCT CACGGCCCTC GCCGAGGACC TCAGCTTCAG GATGGACACC CCGTGGGAGG GGCTGCCCGA GCGGGCCCGC AAGGCCGTGC TGTACGGCAG CGGCGACACC GAGATCCACG TCGGCTACAC CAACCGCTAC GGCCGCAAGC GTTCCTACCA CACCTCCTTC GAGGGCGTGA TCGGTTTCCT GGGCCGCCGG CACCGCGAGG CCGAGTCGGA CAGCAGCCGG GAGCGGTACG AGGGCTACAT GCGCGACATC CCCTGCCCGG CCTGCCGCGG CGCGCGGCTA AAGCCGGAGT CACTCGCGGT GACGCTGGGT GGCCGGTCGA TCGCCGAGGT TTCGGGTCTG GCGATCGGCG AGTGCGCCGT GTTCCTGCGC GGGCTCGACC TCACCGAACG GGAGCGGACC ATTGCCGGTC GGGTGCTCAA GGAGGTCGAC GCGCGGCTGA CATTCCTGCT GGACGTGGGG CTGGACTACC TCTCCCTCGA CCGGGCCGCC GGCACCCTGG CGGGCGGCGA GGCACAGCGC ATCCGGCTGG CCACCCAGAT CGGTTCCGGG CTCGTCGGGG TGCTGTACGT GCTCGACGAG CCGTCGATCG GCCTGCACCA GCGGGACAAC CGCCGGCTCA TCCAGACCCT GCTGCGGCTG CGCGACCTGG GTAACACGCT CATCGTCGTC GAGCACGACG AGGACACGAT TCGCGCCTCC GACTGGGTGG TCGACATCGG CCCCGGCGCC GGCGAGCACG GCGGTCGGGT CGTCGTCTCC GGGCCGGTCG AGAAGCTGCT CGCCAGCGAG GAGTCGATGA CCGGCGCCTA CCTGTCCGGG CGGCGGCGGA TCCCGGTGCC CGACATCCGC CGGGTGCCGG CCAAGGGCCG GGCACTGACC GTGCACGGGG CCCGGCAGCA CAACCTGCGC GACGTCACCG TGTCGTTCCC GCTGGGCTGT TTCGTGGCCG TCACCGGGGT GTCGGGCTCG GGGAAGTCCA CCCTCGTCAA CGACATCCTC GCCGCCGTCC TGGCGAACAA GCTCAACGGC GCCCGGCAGG TGCCGGGCCG GCACCGCACT GTCAGCGGGC TCGACAATCT CGACAAGGCG GTCCGGGTCG ACCAGTCCCC GATCGGACGC ACGCCGCGGT CCAACCCCGC CACCTACACC GGCGTGTTCG ACCACATCCG CCGGCTGTTC GCCGAGACGA CCGAGGCGAA GGTCCGTGGC TATCTGCCGG GCCGGTTCTC GTTCAACGTC AAGGGCGGGC GGTGCGAGGC GTGTTCCGGT GACGGCACCC TCAAGATTGA GATGAACTTC CTGCCAGACG TGTACGTGCC GTGCGAGGTC TGCCACGGCG ACCGGTACAA CCGGGAGACG CTGGAGGTGC ACTACAAGGG CAGGAACATC GCCGAGGTGC TCGACATGCC GATTGAGGAG GCGGCGGAGT TCTTCGCCGC GGTGCCGGCC ATCGCCCGCC ACCTGCGGAC GCTGAACGAC GTCGGCCTGG GGTACGTCCG GCTCGGCCAG TCGGCGCCGA CCCTCTCCGG CGGCGAGGCG CAGCGGGTCA AGCTCGCCTC GGAGCTGCAG CGGCGCTCCA CCGGCCGGAC GGTCTACGTG CTCGACGAGC CGACGACCGG TCTGCACTTC GAGGACATCC GCAAGCTGCT CGGCGTGCTC GGCCGGCTGG TCGACGCCGG GAACACGGTC ATCGTGATCG AGCACAACCT CGATGTCATC AAGACGGCGG ACTGGATCAT TGACCTGGGT CCGGAGGGGG GCACGGGAGG CGGCCGTGTC GTCGCGCAGG GGTCGCCCGA GGCCGTGGCT GCGGTCGAGG AGAGCCACAC CGCGGTCTTC CTCCGGGAGA TCCTCAGTGA CCGGGTCGCC GAGGTGGGAC CGCTCCCGCC CTCTCAGGTC CGCAGCAGGT CCGCGAGGAC GGCGTCCAGC TCCTGTTCGG TGTGCTCCTG A
|
Protein sequence | MADRLVVRGA REHNLRDVDL DLPRDGLIVF TGMSGSGKSS LAFDTIFAEG QRRYVESLSA YARQFLGQMD KPDVDFIEGL SPAVSIDQKS TSRNPRSTVG TITEVYDYLR LLFARAGRPH CPKCGRLISR QTPQQIVDRL LELPEGTRFQ VLAPVVRGRK GEYVDLFAEL QSSGFARARV DGTVVPLTDP PKLEKQRKHT IEVVVDRLAI KSSAKRRLTD SVETALKLGG GLVLVDFVDR DADDPERERM YSEHLACLYD DLSFEELEPR SFSFNSPYGA CPECTGIGTR KEVDPELVVP DPTLSLAQGA VAPWSGGHNK EYFERLLTAL AEDLSFRMDT PWEGLPERAR KAVLYGSGDT EIHVGYTNRY GRKRSYHTSF EGVIGFLGRR HREAESDSSR ERYEGYMRDI PCPACRGARL KPESLAVTLG GRSIAEVSGL AIGECAVFLR GLDLTERERT IAGRVLKEVD ARLTFLLDVG LDYLSLDRAA GTLAGGEAQR IRLATQIGSG LVGVLYVLDE PSIGLHQRDN RRLIQTLLRL RDLGNTLIVV EHDEDTIRAS DWVVDIGPGA GEHGGRVVVS GPVEKLLASE ESMTGAYLSG RRRIPVPDIR RVPAKGRALT VHGARQHNLR DVTVSFPLGC FVAVTGVSGS GKSTLVNDIL AAVLANKLNG ARQVPGRHRT VSGLDNLDKA VRVDQSPIGR TPRSNPATYT GVFDHIRRLF AETTEAKVRG YLPGRFSFNV KGGRCEACSG DGTLKIEMNF LPDVYVPCEV CHGDRYNRET LEVHYKGRNI AEVLDMPIEE AAEFFAAVPA IARHLRTLND VGLGYVRLGQ SAPTLSGGEA QRVKLASELQ RRSTGRTVYV LDEPTTGLHF EDIRKLLGVL GRLVDAGNTV IVIEHNLDVI KTADWIIDLG PEGGTGGGRV VAQGSPEAVA AVEESHTAVF LREILSDRVA EVGPLPPSQV RSRSARTASS SCSVCS
|
| |