Gene Franean1_0892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0892 
Symbol 
ID5669306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1038925 
End bp1041597 
Gene Length2673 bp 
Protein Length890 aa 
Translation table11 
GC content77% 
IMG OID641239819 
Producthypothetical protein 
Protein accessionYP_001505254 
Protein GI158312746 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.608011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCCG ACTCGGTACT CGACCTGCTC GCCGCGCAGA TGGCGCAGGA CGCCCTGCCG 
GCGCGGGTCC GTGACCTGCT ACTGGCCGCC CTGGACGGCG ACGGCCCGCT CGCCGTGGAG
CTCGCCCGAC GCGGCGGGAC GCCTGGTACC AACGAGCCGG GCGATGGCGC CACCTCAGAT
GACGGCACCG CCCCAGGCGG CGGCACCGCC CGGCTGGGCG GGACGGGGTC GGGTGAGCCG
GCCGTCCACC TGGAGTCGAT CGGCGTGCAG GGGTTCCGTG GGATCGGGCC GCTCGCGGTG
CTCCCCCTCC GGCCCGGGCC GGGTCTGACT CTCGTCACGG GACGTAACGG CTCGGGCAAG
TCCAGCTTCG CGGAGGCCGC CGAGATCGCG CTGACAGGCG ACACCCGCCG CTGGTCACGG
CGGGCGGCCG TCTGGCGCGG CGGCTGGCGC AATCTGCACA GCACCGGGCC CGCGCGGGTC
GAGATCACCT GCACGGTGGA GGGTCGCCCC GAGCCGGTCA CCGTCACCCG GACCTGGCCG
GAGGGCGCCG GTCTGGACGA GGGCGCCGGC GAGATCCGCG CCTCGGACGG CGGGGCGTGG
TCACCGTCCG GGTGGGACTG GGCTCTGTCC ACCTACCGGC CGTTCCTGTC CTACGCCGAA
CTGGGCGATC TCCTCGGCGG ACGTCCCAGC GCGATGTTCG ATGCGCTGCA CGCGATCCTC
GGGCTGGACG AGATGGTCGA CACGGCGCGA CGCCTGAGCG CGGCGCGCAG CCGGTTCACC
GAGGCCGCGC GCCGCTCGCG GGAGGAGCAC ACCGCCCTGC TCGCGGCGCT GCGGCGCAGC
ACGGACCCGC GGGCAGTCGA GGCCGCCGAC GCGTTGCGGG ACGCCGCACG TCCCGATCTC
GACCGAGCCT GGGCCGTCGC GGCGGGCCTG CCGGCGCCGA GCACCGCGCC GCCCGCGAAG
CCGGCGGTGC TGCGGGAGCC GGCGGTGCTG CGGGAACTGC TGGGGCTGCG GGAACTGCTG
GGGCTGCGGG TGCCGGCGGA GTCCGACGTC CGGGAGCTCG CCGTGCGACT GCGCGCGCTC
GCGACCGAGG AGACGAGGCT GGCGGCGACG GCGGCCGGCG ACGCGGCCCG CCTCACCGAC
CTGCTGGCCG CCGCGCTCGA CCACCACGAG CATCAGAACG CCGTCCACGC GGGTGGTCCG
TCATCTGAGG GTGCCGAGGA CGGCGATCGG GGCCGGGGTG GGAGCGGGAG CCGGCCGTGC
CCGGTCTGCG GGGTCGGGCG GCTCGACGAG CGGTGGTGGC ACGCGACACG GGCGGAGGTC
ACCCGCCAGC GGGAGCGGGC CGCGGCCGTG CTGGTCGTGC GCGCCGGCCT CACCGCCGCG
CTGGCCGACG TCGCGCTGCT GCTCGCACCC GCCCCCGAGC CGTTGACCAC CCCTGATCCC
CTGGCTCCCT CCGGGCCCCT GGCTCCCTCC GGGCCGCTGA CCGCCGGTGG GAGCACGGGC
GTGCTGGCCG CGCTGGCCGC GGCCGCGGGC ATGGCCTGGC GCCGGTGGGC ACGGCTGCGC
GACGACTCCG ACCCGCTGGC CGTCGCGGAC GGGCTGGAGT CCGCCCACCC TCCGCTGCTG
CGGGCGGTCG AGGCCCTGCG GGCGGCCGCC CGCGACGAGC TCGACCGCGC CGACGCCGAC
TGGCGTCCGC TCGCTGCCCG GGTCACCGCC TGGGTCGGGG CGGCGCGGGC GGTCGATGCC
GACGCCGAGG CGCTGGCCGA CGTCCGACAG GCTCTTGACT GGCTGCGGGA GGCGACCCAA
CGGCTGCGGG ACGAGCGGAT GCGGCCCTTC GCGCAGCGGT CGGCCCAGAT CTGGTCGATG
CTGCGCCAGG AGAGCAACGT CGATCTGGGC CCGGTGCGCC TCACCGGCAG TGCCAACCAG
CGGCGCGTCG ACCTCGACGT CACCGTCGAC GGGGTGGACG GCGCCGCGCT CGGCGTGATG
AGCCAGGGCG AGCTGCACGC GCTGGGGCTG GCGCTGTTCC TGCCCCGCGC GACCAGCGAC
GCGAGCCCGT TCCGCTTCCT CATCATCGAT GACCCGGTGC AGTCGATGGA CCCGGCGAAG
GTCGACGGGC TGGCGCGCGT ACTCGCCCAG GTTGCCGAGA CTCGGCAGGT CGTCGTCCTC
ACCCATGATG ATCGGCTGGC CGACGCGGTG CGCCGGCTGC GGCTGCCGGC GACGGTGCTC
GACGTGGTCC GTCGCGAGGG GTCGCTGGTG CGGCTGCGCG GGAACCTCGA TCCGGTCGGG
CGCCATCTCG CGGACGCGCG GGCGCTCGCC CGGACCGGTG ACCTGCCCCG CGACCTGGCG
ATGATGCTCG TCCCGGCGAT GTGCCGCTCG GCGGTGGAGA CTGCGTGCAA CGAGGTGGTC
CGGCGGCGCC GGCTCGGTGC GGGAGCGCGC CACGCCGAGG TCGAGGCGGC GCTCGCGGCG
GCGCACTCGG TGAGCGAGAA GGCCGCCCTG GCCCTGTTCG ACGACGCGCG CCGGGCCCGC
GGCGTCCTGC GCCGCCTCGA CGGCCACGCG CCGTGGGCCG CCGACACGTT CCGGGCCGTC
CGGGACGGCG TGCACGTCGG GTACGACGGC AACCTGCTGA GCCTGGTCCG CGACACCGGC
CGGCTCACGG ACCACATCCG GACCCTGGCT TGA
 
Protein sequence
MAPDSVLDLL AAQMAQDALP ARVRDLLLAA LDGDGPLAVE LARRGGTPGT NEPGDGATSD 
DGTAPGGGTA RLGGTGSGEP AVHLESIGVQ GFRGIGPLAV LPLRPGPGLT LVTGRNGSGK
SSFAEAAEIA LTGDTRRWSR RAAVWRGGWR NLHSTGPARV EITCTVEGRP EPVTVTRTWP
EGAGLDEGAG EIRASDGGAW SPSGWDWALS TYRPFLSYAE LGDLLGGRPS AMFDALHAIL
GLDEMVDTAR RLSAARSRFT EAARRSREEH TALLAALRRS TDPRAVEAAD ALRDAARPDL
DRAWAVAAGL PAPSTAPPAK PAVLREPAVL RELLGLRELL GLRVPAESDV RELAVRLRAL
ATEETRLAAT AAGDAARLTD LLAAALDHHE HQNAVHAGGP SSEGAEDGDR GRGGSGSRPC
PVCGVGRLDE RWWHATRAEV TRQRERAAAV LVVRAGLTAA LADVALLLAP APEPLTTPDP
LAPSGPLAPS GPLTAGGSTG VLAALAAAAG MAWRRWARLR DDSDPLAVAD GLESAHPPLL
RAVEALRAAA RDELDRADAD WRPLAARVTA WVGAARAVDA DAEALADVRQ ALDWLREATQ
RLRDERMRPF AQRSAQIWSM LRQESNVDLG PVRLTGSANQ RRVDLDVTVD GVDGAALGVM
SQGELHALGL ALFLPRATSD ASPFRFLIID DPVQSMDPAK VDGLARVLAQ VAETRQVVVL
THDDRLADAV RRLRLPATVL DVVRREGSLV RLRGNLDPVG RHLADARALA RTGDLPRDLA
MMLVPAMCRS AVETACNEVV RRRRLGAGAR HAEVEAALAA AHSVSEKAAL ALFDDARRAR
GVLRRLDGHA PWAADTFRAV RDGVHVGYDG NLLSLVRDTG RLTDHIRTLA