Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2057 |
Symbol | uvrA |
ID | 5670458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2477637 |
End bp | 2480534 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641240979 |
Product | excinuclease ABC subunit A |
Protein accession | YP_001506400 |
Protein GI | 158313892 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0577341 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.730418 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGACC GTCTTGTCGT CCGCGGAGCA CGTGAGCACA ACCTTCGTGA CGTCGATCTC GACCTGCCCA GGGACGGCCT GGTGGTCTTC ACCGGGCTGT CGGGCTCGGG CAAGTCGAGC CTCGCCTTCG ACACGATCTT CGCCGAGGGG CAGCGCCGGT ACGTCGAGTC ACTTTCGGCC TACGCCCGCC AGTTCCTCGG CCAGATGGAC AAGCCGGACG TCGACTTCAT CGAGGGCCTG TCACCCGCGG TCTCGATCGA CCAGAAGTCG ACCTCGCGCA ACCCGCGCTC GACGGTGGGC ACGATCACCG AGGTCTACGA CTACCTGCGG CTGCTGTACG CGCGGGTCGG CCACCCGCAC TGCCCGAAGT GCGGCCGCCC GATCGCCCGG CAGACCCCGC AGCAGATCGT CGACCGGCTC CTGGAGCTCC CCGAGGGCAC CCGCTTCCAG GTCCTGGCGC CGGTCGTCCG TGGCCGCAAG GGCGAGTACG CCGACCTCTT CGCCGAGCTG CAGAGCTCCG GCTTCGCCCG GGTCCGGGTG GACGGCACCG TCGTCGCGCT CACCGAGTGC CCCAAGCTCG AGAAGCAGCG CAAGCACACG ATCGAGGTCG TGGTCGACCG GCTCGCGGCG AAGGAGTCGG CCAAGCGCCG CCTCACCGAC TCGATCGAGA CGGCGCTCAA GCTCGGCAGC GGCCTGGTGC TGCTCGACTT CGTCGACCGG GATCCGGCCG ACCCCGACCG CGAGCGGATG TACTCCGAGC ACCTGGCCTG CATGTACGAC GACCTCTCCT TCGAGGAGAT GGAGCCGAGG TCGTTCTCGT TCAACTCGCC GTTCGGGGCC TGCCAGGAGT GTTCGGGTCT GGGAACCCGC AAGGAGGTCG ACCCCGACCT CGTCGTGCCC GACCCGACGC TCTCGCTCGC CGAGGGCGCG ATCCAACCGT GGGCCGGCGG GCACAACAAG GAGTACTTCG AGCGGCTGCT GACGGCGCTC AGCGAGGACC TCAGCTTCCG GATGGACACC CCGTGGGAGG GGCTGCCGGA GCGCGCCCGC AAGGCGATCC TGCACGGCAG CGGCGAGACC GAGATCCACG TCGGCTACAC GAACCGCTAC GGCCGCAAGC GCTCCTACTA CACGTCCTTC GAAGGCGTGA TGGCCTTCCT GCGCCGCCGC CACTCGGACG CCGAGTCCGA CAGCAGCCGC GAGCGCTACG AGGGCTACAT GCGCGACGTG CCCTGCCCGG CGTGCCGGGG CGCCCGGCTC AAACCGGAGT CGCTGGCCGT CACGCTCGGC GGCCGCTCCA TCGCCGAGGT CTCCGGGATG TCGATCGGGG AGTGCGCGGC ATTCCTGCGC GGCGTCGAGC TCAGCGAGCG GGAGCAGGCC ATCGCCGGGC GGGTCCTCAA GGAGATCGAC GCGCGGCTCG CCTTCCTGCT GGACGTCGGC CTCGACTACC TGTCCCTGAA CCGTTCCGCC GGCACGCTCG CCGGTGGGGA GGCCCAGCGG ATCCGCCTCG CCACCCAGAT CGGCTCGGGG CTGGTCGGCG TGCTCTACGT GCTCGACGAG CCGTCGATCG GCCTCCACCA GCGGGACAAC CGCCGCCTCA TCCAGACCCT CGTCCGGCTG CGCGATCTCG GCAACACGCT GATCGTCGTG GAACACGACG AGGACACGAT CCGTGCCGCC GACTGGGTGG TGGACATCGG CCCGGGGGCC GGCGAGCACG GCGGCCGGGT GGTCGTCTCC GGCCCGGTCG AGGAGCTGTT GAGCAGCGAG GAGTCGATGA CCGGTGCCTA CCTCTCCGGT CGGCGGCAGA TCCCGGTGCC CGACATCCGC CGCGCGCCGA CGAAGTCGCG CTCGCTGACC GTGCACGGCG CCCGGGAGCA CAACCTCCGG GACGTGACGG TGTCGTTCCC GCTCGGCTGC CTGGTGGCGG TCACCGGCGT CTCCGGCTCG GGCAAGTCGA CGCTGGTCAA CGACATCCTC GCCAGCGTGC TCGCGAACCA TCTCAACGGC GCGCGCGAGG TCCCGGGCCG GCACCGCACG GTGTCCGGCC TCGAGCACCT CGACAAGGCG GTGCGGGTCG ACCAGTCGCC GATCGGCCGG ACCCCGCGGT CCAACCCGGC GACGTACACG GGCGTGTTCG ACCACATCCG CCGGCTGTTC GCGGAGACGA CCGAGGCGAA GGTCCGCGGG TACCTGCCGG GCCGCTTCTC GTTCAACGTC AAGGGCGGCC GGTGCGAGGC ATGCTCCGGC GACGGCACCA TCAAGATCGA GATGAACTTC CTGCCGGACG TCTACGTCCC GTGCGAGGTC TGCGAGGGGG CCCGGTACAA CCGGGAGACG CTGGAGGTGC ACTTCAAGGG CCGGAACATC GCCGAGGTGC TCGACATGCC GATCGAGGAG GCCGCGGAGT TCTTCGCGGC GGTCCCGGCG ATCGCCCGGC ACCTGCGCAC GCTCAACGAC GTCGGTCTCG GCTACGTCCG GCTGGGCCAG TCGGCGCCCA CCCTCTCGGG CGGTGAGGCG CAGCGGGTGA AGCTCGCCTC GGAGCTGCAG CGACGCTCCA CCGGGCGGAC CGTCTACGTG CTGGACGAGC CCACCACCGG CCTGCACTTC GAGGACATCC GGAAACTGCT CGGCGTGCTC GGCCGGCTCG TCGACGCCGG CAACACGGTG ATCGTCATCG AGCACAACCT CGACGTCATC AAGACCGCCG ACTGGATCGT CGACATGGGC CCGGAGGGCG GGACGGGCGG CGGCCGGGTG ATCGCCGAGG GAGCTCCCGA GACGGTCGCC ACGGTGTCCG AGAGCCACAC CGGCGCCTTC CTGCGGGAGA TCCTCGGCGA CCGGGTCGAC GCCCGTCCGA AGTCACGGCC GAAGCTCGCC GGCGCGGCCG CGCTGTGA
|
Protein sequence | MADRLVVRGA REHNLRDVDL DLPRDGLVVF TGLSGSGKSS LAFDTIFAEG QRRYVESLSA YARQFLGQMD KPDVDFIEGL SPAVSIDQKS TSRNPRSTVG TITEVYDYLR LLYARVGHPH CPKCGRPIAR QTPQQIVDRL LELPEGTRFQ VLAPVVRGRK GEYADLFAEL QSSGFARVRV DGTVVALTEC PKLEKQRKHT IEVVVDRLAA KESAKRRLTD SIETALKLGS GLVLLDFVDR DPADPDRERM YSEHLACMYD DLSFEEMEPR SFSFNSPFGA CQECSGLGTR KEVDPDLVVP DPTLSLAEGA IQPWAGGHNK EYFERLLTAL SEDLSFRMDT PWEGLPERAR KAILHGSGET EIHVGYTNRY GRKRSYYTSF EGVMAFLRRR HSDAESDSSR ERYEGYMRDV PCPACRGARL KPESLAVTLG GRSIAEVSGM SIGECAAFLR GVELSEREQA IAGRVLKEID ARLAFLLDVG LDYLSLNRSA GTLAGGEAQR IRLATQIGSG LVGVLYVLDE PSIGLHQRDN RRLIQTLVRL RDLGNTLIVV EHDEDTIRAA DWVVDIGPGA GEHGGRVVVS GPVEELLSSE ESMTGAYLSG RRQIPVPDIR RAPTKSRSLT VHGAREHNLR DVTVSFPLGC LVAVTGVSGS GKSTLVNDIL ASVLANHLNG AREVPGRHRT VSGLEHLDKA VRVDQSPIGR TPRSNPATYT GVFDHIRRLF AETTEAKVRG YLPGRFSFNV KGGRCEACSG DGTIKIEMNF LPDVYVPCEV CEGARYNRET LEVHFKGRNI AEVLDMPIEE AAEFFAAVPA IARHLRTLND VGLGYVRLGQ SAPTLSGGEA QRVKLASELQ RRSTGRTVYV LDEPTTGLHF EDIRKLLGVL GRLVDAGNTV IVIEHNLDVI KTADWIVDMG PEGGTGGGRV IAEGAPETVA TVSESHTGAF LREILGDRVD ARPKSRPKLA GAAAL
|
| |