Gene Franean1_2057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2057 
SymboluvrA 
ID5670458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2477637 
End bp2480534 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content71% 
IMG OID641240979 
Productexcinuclease ABC subunit A 
Protein accessionYP_001506400 
Protein GI158313892 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0577341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.730418 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACC GTCTTGTCGT CCGCGGAGCA CGTGAGCACA ACCTTCGTGA CGTCGATCTC 
GACCTGCCCA GGGACGGCCT GGTGGTCTTC ACCGGGCTGT CGGGCTCGGG CAAGTCGAGC
CTCGCCTTCG ACACGATCTT CGCCGAGGGG CAGCGCCGGT ACGTCGAGTC ACTTTCGGCC
TACGCCCGCC AGTTCCTCGG CCAGATGGAC AAGCCGGACG TCGACTTCAT CGAGGGCCTG
TCACCCGCGG TCTCGATCGA CCAGAAGTCG ACCTCGCGCA ACCCGCGCTC GACGGTGGGC
ACGATCACCG AGGTCTACGA CTACCTGCGG CTGCTGTACG CGCGGGTCGG CCACCCGCAC
TGCCCGAAGT GCGGCCGCCC GATCGCCCGG CAGACCCCGC AGCAGATCGT CGACCGGCTC
CTGGAGCTCC CCGAGGGCAC CCGCTTCCAG GTCCTGGCGC CGGTCGTCCG TGGCCGCAAG
GGCGAGTACG CCGACCTCTT CGCCGAGCTG CAGAGCTCCG GCTTCGCCCG GGTCCGGGTG
GACGGCACCG TCGTCGCGCT CACCGAGTGC CCCAAGCTCG AGAAGCAGCG CAAGCACACG
ATCGAGGTCG TGGTCGACCG GCTCGCGGCG AAGGAGTCGG CCAAGCGCCG CCTCACCGAC
TCGATCGAGA CGGCGCTCAA GCTCGGCAGC GGCCTGGTGC TGCTCGACTT CGTCGACCGG
GATCCGGCCG ACCCCGACCG CGAGCGGATG TACTCCGAGC ACCTGGCCTG CATGTACGAC
GACCTCTCCT TCGAGGAGAT GGAGCCGAGG TCGTTCTCGT TCAACTCGCC GTTCGGGGCC
TGCCAGGAGT GTTCGGGTCT GGGAACCCGC AAGGAGGTCG ACCCCGACCT CGTCGTGCCC
GACCCGACGC TCTCGCTCGC CGAGGGCGCG ATCCAACCGT GGGCCGGCGG GCACAACAAG
GAGTACTTCG AGCGGCTGCT GACGGCGCTC AGCGAGGACC TCAGCTTCCG GATGGACACC
CCGTGGGAGG GGCTGCCGGA GCGCGCCCGC AAGGCGATCC TGCACGGCAG CGGCGAGACC
GAGATCCACG TCGGCTACAC GAACCGCTAC GGCCGCAAGC GCTCCTACTA CACGTCCTTC
GAAGGCGTGA TGGCCTTCCT GCGCCGCCGC CACTCGGACG CCGAGTCCGA CAGCAGCCGC
GAGCGCTACG AGGGCTACAT GCGCGACGTG CCCTGCCCGG CGTGCCGGGG CGCCCGGCTC
AAACCGGAGT CGCTGGCCGT CACGCTCGGC GGCCGCTCCA TCGCCGAGGT CTCCGGGATG
TCGATCGGGG AGTGCGCGGC ATTCCTGCGC GGCGTCGAGC TCAGCGAGCG GGAGCAGGCC
ATCGCCGGGC GGGTCCTCAA GGAGATCGAC GCGCGGCTCG CCTTCCTGCT GGACGTCGGC
CTCGACTACC TGTCCCTGAA CCGTTCCGCC GGCACGCTCG CCGGTGGGGA GGCCCAGCGG
ATCCGCCTCG CCACCCAGAT CGGCTCGGGG CTGGTCGGCG TGCTCTACGT GCTCGACGAG
CCGTCGATCG GCCTCCACCA GCGGGACAAC CGCCGCCTCA TCCAGACCCT CGTCCGGCTG
CGCGATCTCG GCAACACGCT GATCGTCGTG GAACACGACG AGGACACGAT CCGTGCCGCC
GACTGGGTGG TGGACATCGG CCCGGGGGCC GGCGAGCACG GCGGCCGGGT GGTCGTCTCC
GGCCCGGTCG AGGAGCTGTT GAGCAGCGAG GAGTCGATGA CCGGTGCCTA CCTCTCCGGT
CGGCGGCAGA TCCCGGTGCC CGACATCCGC CGCGCGCCGA CGAAGTCGCG CTCGCTGACC
GTGCACGGCG CCCGGGAGCA CAACCTCCGG GACGTGACGG TGTCGTTCCC GCTCGGCTGC
CTGGTGGCGG TCACCGGCGT CTCCGGCTCG GGCAAGTCGA CGCTGGTCAA CGACATCCTC
GCCAGCGTGC TCGCGAACCA TCTCAACGGC GCGCGCGAGG TCCCGGGCCG GCACCGCACG
GTGTCCGGCC TCGAGCACCT CGACAAGGCG GTGCGGGTCG ACCAGTCGCC GATCGGCCGG
ACCCCGCGGT CCAACCCGGC GACGTACACG GGCGTGTTCG ACCACATCCG CCGGCTGTTC
GCGGAGACGA CCGAGGCGAA GGTCCGCGGG TACCTGCCGG GCCGCTTCTC GTTCAACGTC
AAGGGCGGCC GGTGCGAGGC ATGCTCCGGC GACGGCACCA TCAAGATCGA GATGAACTTC
CTGCCGGACG TCTACGTCCC GTGCGAGGTC TGCGAGGGGG CCCGGTACAA CCGGGAGACG
CTGGAGGTGC ACTTCAAGGG CCGGAACATC GCCGAGGTGC TCGACATGCC GATCGAGGAG
GCCGCGGAGT TCTTCGCGGC GGTCCCGGCG ATCGCCCGGC ACCTGCGCAC GCTCAACGAC
GTCGGTCTCG GCTACGTCCG GCTGGGCCAG TCGGCGCCCA CCCTCTCGGG CGGTGAGGCG
CAGCGGGTGA AGCTCGCCTC GGAGCTGCAG CGACGCTCCA CCGGGCGGAC CGTCTACGTG
CTGGACGAGC CCACCACCGG CCTGCACTTC GAGGACATCC GGAAACTGCT CGGCGTGCTC
GGCCGGCTCG TCGACGCCGG CAACACGGTG ATCGTCATCG AGCACAACCT CGACGTCATC
AAGACCGCCG ACTGGATCGT CGACATGGGC CCGGAGGGCG GGACGGGCGG CGGCCGGGTG
ATCGCCGAGG GAGCTCCCGA GACGGTCGCC ACGGTGTCCG AGAGCCACAC CGGCGCCTTC
CTGCGGGAGA TCCTCGGCGA CCGGGTCGAC GCCCGTCCGA AGTCACGGCC GAAGCTCGCC
GGCGCGGCCG CGCTGTGA
 
Protein sequence
MADRLVVRGA REHNLRDVDL DLPRDGLVVF TGLSGSGKSS LAFDTIFAEG QRRYVESLSA 
YARQFLGQMD KPDVDFIEGL SPAVSIDQKS TSRNPRSTVG TITEVYDYLR LLYARVGHPH
CPKCGRPIAR QTPQQIVDRL LELPEGTRFQ VLAPVVRGRK GEYADLFAEL QSSGFARVRV
DGTVVALTEC PKLEKQRKHT IEVVVDRLAA KESAKRRLTD SIETALKLGS GLVLLDFVDR
DPADPDRERM YSEHLACMYD DLSFEEMEPR SFSFNSPFGA CQECSGLGTR KEVDPDLVVP
DPTLSLAEGA IQPWAGGHNK EYFERLLTAL SEDLSFRMDT PWEGLPERAR KAILHGSGET
EIHVGYTNRY GRKRSYYTSF EGVMAFLRRR HSDAESDSSR ERYEGYMRDV PCPACRGARL
KPESLAVTLG GRSIAEVSGM SIGECAAFLR GVELSEREQA IAGRVLKEID ARLAFLLDVG
LDYLSLNRSA GTLAGGEAQR IRLATQIGSG LVGVLYVLDE PSIGLHQRDN RRLIQTLVRL
RDLGNTLIVV EHDEDTIRAA DWVVDIGPGA GEHGGRVVVS GPVEELLSSE ESMTGAYLSG
RRQIPVPDIR RAPTKSRSLT VHGAREHNLR DVTVSFPLGC LVAVTGVSGS GKSTLVNDIL
ASVLANHLNG AREVPGRHRT VSGLEHLDKA VRVDQSPIGR TPRSNPATYT GVFDHIRRLF
AETTEAKVRG YLPGRFSFNV KGGRCEACSG DGTIKIEMNF LPDVYVPCEV CEGARYNRET
LEVHFKGRNI AEVLDMPIEE AAEFFAAVPA IARHLRTLND VGLGYVRLGQ SAPTLSGGEA
QRVKLASELQ RRSTGRTVYV LDEPTTGLHF EDIRKLLGVL GRLVDAGNTV IVIEHNLDVI
KTADWIVDMG PEGGTGGGRV IAEGAPETVA TVSESHTGAF LREILGDRVD ARPKSRPKLA
GAAAL