Gene Franean1_3777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3777 
Symbol 
ID5672142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4478201 
End bp4481512 
Gene Length3312 bp 
Protein Length1103 aa 
Translation table11 
GC content79% 
IMG OID641242658 
Productexonuclease SbcC 
Protein accessionYP_001508078 
Protein GI158315570 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00618704 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCCGG TGCAGCTGGA CCTCGCCGGT TTCGGATCGT TCCGGGAACC GGCGGTGCTG 
GACCTCACCG ACGCCGACTA CTTCGCGCTG GTGGGCCCGA CCGGCGCCGG CAAGTCGACG
CTGATCGACG CGCTGACGTT CGCGCTGTTC GGCTCGGTGC CGCGCTGGAA CAACCGGGCG
ACGGTACATC TGGCGCTCGC GCCGACCGCG GCCCGCGGCA CGGTACGTCT CGTCTTCGAC
GCGGCCGGCG CCCGCTACGT GGTGGCGCGG GAGCTGCGCC GCGCCGCCCG CGGGGGTGTG
ACCGTGCGCA ACGCCCGGTT CGAGCGGCTG GTCGACCCGG CCGGCGCGGG CGGGCCGGGG
GAGCCGACCG ACGTGCTCGC CGCGGGCGCG CCCGCGGTCA CGGCGGCGGT CGAGGAACTG
CTCGGGCTCA CCTTCGAGCA CTTCTGCACC TGCGTGGTGC TTCCGCAGGG CGAGTTCGCC
GAGTTCCTGC GGGCGAAGCC GAGCGAGCGG CAGGCCATCC TCACCCGGCT GCTCGGGCTG
GGCGTCTACG ACACGATCCG CGCGGCGGCG AACGCCCGCG CGAGCGACCA GCGCCAGCGC
GCGGACGTCC TCGCCGAGCA GCTCGCCGGG TACGCGGACG CGACCGCCGC TGCCGAGGCC
GCCGCGGCCG GCCGGGTCGG CGCGCTCACC CGGCTCGCCG CCGAGGTGGC CGCCGCCCTC
GCCGAGGTGG CGCGGCTGGA CACCGCCGCC GCCGAAGCCG CCGCCCGGAC CGCCAGACTG
GGCGCCGAGC GCGACCAGCT CGCCGCCGTC GAGCCGCCGG CCGGGCTGGT GGACCTCACC
GGGCGGCTGC GCGCTGCCCG GGCCGCCGAC GCGGACGCCC GTCGGCGCCG CCAGGAGGCC
GAGCGGGCGG ACAGCGCCGC GCGGGGAGAT CGCGCCGCCG CGCAGCCCCG TGCTCCGCTC
GACCGGTACG CCCGCGACCA CCGGGAGCTC GCCGACCTGC TCGCGACCAG ACCGGCGGCC
GTCACCGAGC ACGCCGAGGC CGACGCCGCG CTCGCCCGGG CGGACGCCGA ACTCGGGGCG
GCGCGCGCCG CGCTGGCGGC GGCGACCGCC GCCGAACGCG ACGCGGAGCA GGCCGCTGTA
TCCGCTGCCG CGGAGTTGAG CCGGCTGGAC TCCGAGCTGG GGCTGCTCGC CGATGTCCGG
GCGCCGGCCG GGCTGGCGGC CCTGACCGGC CGCCTGCGCG GGGCCGAGGC GGCGCGGGCC
GAGGCCTTGG AGCGCCGGGA GGCGGCCGAG CGGGCTGACA CCGCCGTCCG CGAGGAGCGG
GCCGCCGCAC CGTCACGCGG CCCGCTGGAG CGGGCGCTGC GCGATCATGA GCTGCTCGTC
GAGCTGCTGG CCGGCCAGGC CGCTGCGGCG GCCGCGCACG CCGACGTCCA GGCGGCGCTC
GGCCGGGCCG ACGCCGCACT GGAAGAGGCC CGGCACACCC GGCACCGGGC GGCCGGAGCC
CGGGATGATG CCGCCCGCGC CGACCTGGCC GCCGCGTTGC GGCCGCACCT TGCCGTGGGG
GAGGACTGCC CGGTCTGCGC GCGGCCCGTC GCGACGCTGC CGCCCCCGCT GCCGGCCGCC
GGCCTCGGTA CCGCGGACGA GGCGGTCACC GCCGCCGAAC GCGAGGTGAC CGCCCGCGCC
GCCGAGGCGA CCGCCGCGAC CCGGGCGGCG GCGGAGTCCG AGGCCCGGCT CGCGGACCTG
ACCAGGCGGG TCGAGGAGCT GCGCCGCGCG GTCGCCGACG CGCCGGACGC CCGCCGCGCG
GCCGGCCTCC TCGCGGACGT CGAACGGCTC GACGGCGCCG TCGCCCGCGC GGACCTGGCC
CTGCGCCGGG CCCGCGACGC CGCGGCCGCC GCCGACACGG TCGTCACCCG CTCCCGGGAC
GGGGCGGCCG GGGCGCGGCG GGCGCTTGAC GCCGTCCGCG ACCCGCTGGT CTCCCTCGGC
GCGCCCGGGC TGGCGGGCAC CGACCTGGAG GCGGACTGGG CCACGCTGGT CACCTGGGCC
GCCGAGCTCG CCCGCGACCG GACCGGTCAG CGTGACGCCG CGGCCGAGCG TTCGGCCGCC
GCCACGGCCT CCCTGGCCGC CGCGCGGCAG GCGCTGGCGC TCGCCTCCGC CGGGCAGGCG
AGGGCGGACG AGAACCGGAT CGCCGAGGCC CGCGCCCAGC AGCGGGCCGC CACCCGGGTC
GCGACCCTCG ACGACCGGGC CGAGGCCCTG CGTGTCGTGC TCGCCAGCGC GCCGACCGCC
GTCGAGGTCG CGGCCGCGCT CGCCGAGCTG GACCGGCTCG CCGCTACCGC CGACCAGGCC
GACCAGCGGC TGCGCGCCGC CCGGGCGGAG GCCGACGACG CCGAGCGGGC GTTCGAGCGG
GCCCGGGACG CGCTCGGGGC GGCCGGCGGC GCGCTGGCCG CGGCCCGCGA CGGCCTCGTC
CCGCTCGGCG CGCCCGCCGT CGACAGCGAG GACGTCCTGG CTGGCTGGAC GGCGCTGACC
GACTGGTCCC GCACGCAGGC CACGGCCCGT GACCGCCTGC TCGCGGCGGC CACCGCCGAG
GCCGAGGACG CCCGCCGACG GCACGACGAG GCGGACACGG CGCTGGTCGA GCTGCTCGCC
GCCCACGACC TGCCGCCTGG CCCCGGTTCC AAGGCCGCCG ACGGTGCGTC CGTCGCAGTG
GCGGACGGGC TGGCGGGGGC CCGCGCGGAG CTCGCCCGAG TCGTCGAGCG CCGTGCCGAG
GCCGGCCGGC TCACCGCCGA GGTCGCGACG GCGCGCGAGG CGGACCATGT CGCCCGCCAG
CTCGGCCAGC TGCTGCGCTC CGACCGGTTC CCGCGCTGGA TGGTGGCCGC CGCGCTGGAC
ATCCTGGTCG AGGAGGCGTC GGTGACACTG TCGGAGCTGT CCGGTGGGCA GTTCGAGTTG
ACCCACGAGG ATGGCGAGTT CGTGGTCGTC GACCATGCCG ACGCCGACTC GCGCCGGCCG
GTGCGGACCC TGTCCGGCGG TGAGACGTTC CAGGCCAGCC TGGCGTTGGC TCTCGCGTTG
TCCTCCCAGC TGCGGGCGAT GGCGGCCGGC GGCGCAGCCA CGCTCGACTC GCTGTTCCTC
GACGAGGGCT TCGGCACCCT CGACGAGGCC ACCCTGGACG TCGTCGCGAG CACCCTGGAG
AGCCTCGCGA GCGGCGGCGG CCGGATGGTC GGCCTGGTCA CCCACGTCCA GGCGCTGGCC
GAACGGGTCC CCGTACGCTT CGCGGTGAAC CGGGACCAGC GGACGTCGAC CGTCGTCAGG
GAGCGCACTT GA
 
Protein sequence
MRPVQLDLAG FGSFREPAVL DLTDADYFAL VGPTGAGKST LIDALTFALF GSVPRWNNRA 
TVHLALAPTA ARGTVRLVFD AAGARYVVAR ELRRAARGGV TVRNARFERL VDPAGAGGPG
EPTDVLAAGA PAVTAAVEEL LGLTFEHFCT CVVLPQGEFA EFLRAKPSER QAILTRLLGL
GVYDTIRAAA NARASDQRQR ADVLAEQLAG YADATAAAEA AAAGRVGALT RLAAEVAAAL
AEVARLDTAA AEAAARTARL GAERDQLAAV EPPAGLVDLT GRLRAARAAD ADARRRRQEA
ERADSAARGD RAAAQPRAPL DRYARDHREL ADLLATRPAA VTEHAEADAA LARADAELGA
ARAALAAATA AERDAEQAAV SAAAELSRLD SELGLLADVR APAGLAALTG RLRGAEAARA
EALERREAAE RADTAVREER AAAPSRGPLE RALRDHELLV ELLAGQAAAA AAHADVQAAL
GRADAALEEA RHTRHRAAGA RDDAARADLA AALRPHLAVG EDCPVCARPV ATLPPPLPAA
GLGTADEAVT AAEREVTARA AEATAATRAA AESEARLADL TRRVEELRRA VADAPDARRA
AGLLADVERL DGAVARADLA LRRARDAAAA ADTVVTRSRD GAAGARRALD AVRDPLVSLG
APGLAGTDLE ADWATLVTWA AELARDRTGQ RDAAAERSAA ATASLAAARQ ALALASAGQA
RADENRIAEA RAQQRAATRV ATLDDRAEAL RVVLASAPTA VEVAAALAEL DRLAATADQA
DQRLRAARAE ADDAERAFER ARDALGAAGG ALAAARDGLV PLGAPAVDSE DVLAGWTALT
DWSRTQATAR DRLLAAATAE AEDARRRHDE ADTALVELLA AHDLPPGPGS KAADGASVAV
ADGLAGARAE LARVVERRAE AGRLTAEVAT AREADHVARQ LGQLLRSDRF PRWMVAAALD
ILVEEASVTL SELSGGQFEL THEDGEFVVV DHADADSRRP VRTLSGGETF QASLALALAL
SSQLRAMAAG GAATLDSLFL DEGFGTLDEA TLDVVASTLE SLASGGGRMV GLVTHVQALA
ERVPVRFAVN RDQRTSTVVR ERT