Gene Francci3_1631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1631 
SymboluvrA 
ID3905910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1960338 
End bp1963268 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content70% 
IMG OID637878969 
Productexcinuclease ABC subunit A 
Protein accessionYP_480736 
Protein GI86740336 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.978501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGATC GTCTTGTCGT GCGTGGCGCG CGTGAACACA ATCTCCGTGA CGTCGACCTC 
GACCTGCCCC GCGACGGGCT GATCGTCTTC ACCGGTATGT CCGGGTCCGG CAAGTCCAGC
CTCGCCTTCG ACACAATCTT CGCCGAGGGG CAGCGGCGCT ACGTGGAGTC GCTGTCGGCC
TATGCGCGGC AGTTCCTCGG GCAGATGGAC AAGCCCGACG TCGACTTCAT CGAGGGGCTG
TCTCCGGCCG TCTCCATCGA TCAGAAGTCG ACCTCGCGCA ACCCGCGGTC CACCGTCGGC
ACGATCACCG AGGTCTACGA CTACCTGCGG CTGCTCTTCG CCCGCGCCGG CCGTCCGCAC
TGCCCGAAGT GCGGGCGGCT GATCTCCCGG CAGACCCCGC AGCAGATCGT CGACCGGCTG
CTGGAGTTGC CCGAAGGCAC CCGCTTCCAG GTGCTGGCCC CGGTGGTGCG TGGCCGCAAG
GGCGAGTACG TCGACCTGTT CGCCGAGCTG CAGAGCTCCG GGTTCGCACG GGCCCGCGTC
GACGGCACGG TGGTGCCGCT CACCGATCCG CCGAAGCTGG AGAAGCAGCG CAAGCATACG
ATCGAGGTCG TCGTCGATCG GCTCGCGATC AAGTCGTCAG CGAAGCGGCG GCTCACCGAC
TCGGTGGAGA CGGCCCTGAA GCTGGGCGGC GGGCTCGTCC TGGTTGACTT CGTCGACCGC
GACGCCGACG ATCCCGAGCG CGAGCGGATG TACTCCGAGC ATCTGGCCTG CCTCTACGAC
GACCTCTCCT TTGAGGAGCT GGAGCCGCGG TCGTTCTCGT TCAACTCGCC GTACGGCGCC
TGCCCCGAGT GCACCGGGAT CGGGACCCGC AAGGAGGTCG ACCCGGAGCT CGTCGTGCCC
GACCCCACCC TGTCGCTGGC GCAGGGCGCG GTGGCCCCCT GGTCGGGCGG CCACAACAAG
GAGTACTTCG AACGGTTGCT CACGGCCCTC GCCGAGGACC TCAGCTTCAG GATGGACACC
CCGTGGGAGG GGCTGCCCGA GCGGGCCCGC AAGGCCGTGC TGTACGGCAG CGGCGACACC
GAGATCCACG TCGGCTACAC CAACCGCTAC GGCCGCAAGC GTTCCTACCA CACCTCCTTC
GAGGGCGTGA TCGGTTTCCT GGGCCGCCGG CACCGCGAGG CCGAGTCGGA CAGCAGCCGG
GAGCGGTACG AGGGCTACAT GCGCGACATC CCCTGCCCGG CCTGCCGCGG CGCGCGGCTA
AAGCCGGAGT CACTCGCGGT GACGCTGGGT GGCCGGTCGA TCGCCGAGGT TTCGGGTCTG
GCGATCGGCG AGTGCGCCGT GTTCCTGCGC GGGCTCGACC TCACCGAACG GGAGCGGACC
ATTGCCGGTC GGGTGCTCAA GGAGGTCGAC GCGCGGCTGA CATTCCTGCT GGACGTGGGG
CTGGACTACC TCTCCCTCGA CCGGGCCGCC GGCACCCTGG CGGGCGGCGA GGCACAGCGC
ATCCGGCTGG CCACCCAGAT CGGTTCCGGG CTCGTCGGGG TGCTGTACGT GCTCGACGAG
CCGTCGATCG GCCTGCACCA GCGGGACAAC CGCCGGCTCA TCCAGACCCT GCTGCGGCTG
CGCGACCTGG GTAACACGCT CATCGTCGTC GAGCACGACG AGGACACGAT TCGCGCCTCC
GACTGGGTGG TCGACATCGG CCCCGGCGCC GGCGAGCACG GCGGTCGGGT CGTCGTCTCC
GGGCCGGTCG AGAAGCTGCT CGCCAGCGAG GAGTCGATGA CCGGCGCCTA CCTGTCCGGG
CGGCGGCGGA TCCCGGTGCC CGACATCCGC CGGGTGCCGG CCAAGGGCCG GGCACTGACC
GTGCACGGGG CCCGGCAGCA CAACCTGCGC GACGTCACCG TGTCGTTCCC GCTGGGCTGT
TTCGTGGCCG TCACCGGGGT GTCGGGCTCG GGGAAGTCCA CCCTCGTCAA CGACATCCTC
GCCGCCGTCC TGGCGAACAA GCTCAACGGC GCCCGGCAGG TGCCGGGCCG GCACCGCACT
GTCAGCGGGC TCGACAATCT CGACAAGGCG GTCCGGGTCG ACCAGTCCCC GATCGGACGC
ACGCCGCGGT CCAACCCCGC CACCTACACC GGCGTGTTCG ACCACATCCG CCGGCTGTTC
GCCGAGACGA CCGAGGCGAA GGTCCGTGGC TATCTGCCGG GCCGGTTCTC GTTCAACGTC
AAGGGCGGGC GGTGCGAGGC GTGTTCCGGT GACGGCACCC TCAAGATTGA GATGAACTTC
CTGCCAGACG TGTACGTGCC GTGCGAGGTC TGCCACGGCG ACCGGTACAA CCGGGAGACG
CTGGAGGTGC ACTACAAGGG CAGGAACATC GCCGAGGTGC TCGACATGCC GATTGAGGAG
GCGGCGGAGT TCTTCGCCGC GGTGCCGGCC ATCGCCCGCC ACCTGCGGAC GCTGAACGAC
GTCGGCCTGG GGTACGTCCG GCTCGGCCAG TCGGCGCCGA CCCTCTCCGG CGGCGAGGCG
CAGCGGGTCA AGCTCGCCTC GGAGCTGCAG CGGCGCTCCA CCGGCCGGAC GGTCTACGTG
CTCGACGAGC CGACGACCGG TCTGCACTTC GAGGACATCC GCAAGCTGCT CGGCGTGCTC
GGCCGGCTGG TCGACGCCGG GAACACGGTC ATCGTGATCG AGCACAACCT CGATGTCATC
AAGACGGCGG ACTGGATCAT TGACCTGGGT CCGGAGGGGG GCACGGGAGG CGGCCGTGTC
GTCGCGCAGG GGTCGCCCGA GGCCGTGGCT GCGGTCGAGG AGAGCCACAC CGCGGTCTTC
CTCCGGGAGA TCCTCAGTGA CCGGGTCGCC GAGGTGGGAC CGCTCCCGCC CTCTCAGGTC
CGCAGCAGGT CCGCGAGGAC GGCGTCCAGC TCCTGTTCGG TGTGCTCCTG A
 
Protein sequence
MADRLVVRGA REHNLRDVDL DLPRDGLIVF TGMSGSGKSS LAFDTIFAEG QRRYVESLSA 
YARQFLGQMD KPDVDFIEGL SPAVSIDQKS TSRNPRSTVG TITEVYDYLR LLFARAGRPH
CPKCGRLISR QTPQQIVDRL LELPEGTRFQ VLAPVVRGRK GEYVDLFAEL QSSGFARARV
DGTVVPLTDP PKLEKQRKHT IEVVVDRLAI KSSAKRRLTD SVETALKLGG GLVLVDFVDR
DADDPERERM YSEHLACLYD DLSFEELEPR SFSFNSPYGA CPECTGIGTR KEVDPELVVP
DPTLSLAQGA VAPWSGGHNK EYFERLLTAL AEDLSFRMDT PWEGLPERAR KAVLYGSGDT
EIHVGYTNRY GRKRSYHTSF EGVIGFLGRR HREAESDSSR ERYEGYMRDI PCPACRGARL
KPESLAVTLG GRSIAEVSGL AIGECAVFLR GLDLTERERT IAGRVLKEVD ARLTFLLDVG
LDYLSLDRAA GTLAGGEAQR IRLATQIGSG LVGVLYVLDE PSIGLHQRDN RRLIQTLLRL
RDLGNTLIVV EHDEDTIRAS DWVVDIGPGA GEHGGRVVVS GPVEKLLASE ESMTGAYLSG
RRRIPVPDIR RVPAKGRALT VHGARQHNLR DVTVSFPLGC FVAVTGVSGS GKSTLVNDIL
AAVLANKLNG ARQVPGRHRT VSGLDNLDKA VRVDQSPIGR TPRSNPATYT GVFDHIRRLF
AETTEAKVRG YLPGRFSFNV KGGRCEACSG DGTLKIEMNF LPDVYVPCEV CHGDRYNRET
LEVHYKGRNI AEVLDMPIEE AAEFFAAVPA IARHLRTLND VGLGYVRLGQ SAPTLSGGEA
QRVKLASELQ RRSTGRTVYV LDEPTTGLHF EDIRKLLGVL GRLVDAGNTV IVIEHNLDVI
KTADWIIDLG PEGGTGGGRV VAQGSPEAVA AVEESHTAVF LREILSDRVA EVGPLPPSQV
RSRSARTASS SCSVCS