Gene Franean1_1746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1746 
Symbol 
ID5675686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2092698 
End bp2095937 
Gene Length3240 bp 
Protein Length1079 aa 
Translation table11 
GC content77% 
IMG OID641240667 
Producthypothetical protein 
Protein accessionYP_001506090 
Protein GI158313582 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000886792 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGAACG ACGAGTCGCC GAGGTCCCAA CCGCCGTCCA GGGCTGGCCG TGCTGCCACC 
GGCGGTGGCG GCGCGCGGCG CACCGAGGGC CGCCGTGACG CGCCGGGTGC CCGGCGTGGG
CCGTCCTCGT CCGGCTCGTC CGGGCGGGAT GCCGGTGGCG AGGCCCGGCG CTCCGGTCGG
GTCACCAGAG GCGATGCGGG CCGGACGGGG TTCGTCCGGC GTGCTGACGG TGGGCGAGGC
CCCGCGCCAG CTGGCGCCTC GGCGGCGGGG ACGGGGGCGG GTGGTCGTGG TGCCCCCGCG
TGGCAGGGCA CCGGGCGTGC CGGCGGTCCG TCCCGGCAGG GAGGTCCCGG CCAGGGCGCC
GGACGTCCGA AGGCCGCCTG GCGTCCCGCT GGCACAGGCC AGGGGCGTGG TGGCGCTGGC
GGTTCCGGTG GACCCGGCGG CGAGCGGTGG GGTCGTCCCG CCTCGTCGCG TGAGGGTGGT
CCGCGCGAAG CGAGCGGCGG GCGGCCTGCT GGAGACCGTT GGTCCGGCGG TGCGGGCCGG
CCGGGAGCGC ACGGTCGGCC GGAAGGGCAG GGCCGGTCCG AGGGGGGCCG GCCGGCCCGC
CGCTGGGAGG GCTCCGGCCG GAGCGCGCCG GCCTCGGATT CGCGGGGCCG GCAGTGGTCG
GGGAGCCCGT CCCGCGAGGC GGGAGACCGC CCCAGACAGA GTGACAGGCC GCCCTATGAC
GCGGCCCGCG GCACCACATC CCGGCACGAC GGCCCTCGTC ACGACGGGTC CCGCCATGAC
GGCCCTCGTC ACGACGGGCC TCGGCATGAC GGGCCCCGCC ACGGCGGAGG TCGTGACGAG
AGTGGCCGCG GTGATGCTCG GCGGGATGGC GGTGGCTGGG CCGGCGGCGG CTCTCGAGCC
GGTTCGTCAC CCGACCGCGG TGGGCAGCGC AGCTACGGAG ACGGCGGTGG ACGACCGTCC
GGTGGACGAC CGACGTCGGG CCAGACGGCG CGGGGCTCCT CCGGGCGTCC GTACGGTGCC
GGGCGGCCGG TCGGCGGTGG GCGTGACGGT TCCGCGGGCC CCGGGAACGG CGGCTCGGGC
GAGAACCGGT TCGAGCGGCG CTCACGGGCG CAGGGCTCGG GCCAGTGGCG ACCCTCCTCG
GGGGACGCGG CGACACGGGG CGCGGCAGGT CCGCGTAGTG GGATTCGACG TGACGGCCCG
GGCAGGTCCG ACGGGACGGC GGCCGGCCGC GGCCCGGGTG CCGACCGGCG TCCCCCGTCG
GCCGGGTCGC GGCCCTGGCA AAGCGGTGCG ACCGGCGATG CCCGGACCTC CAACGGCTCC
GGGTACGACC GTGGTCACCG GTCTGATCGC GCCTCCGGAC CCGATCGTGG TTACCGGCCC
GATCGTGGCG CCGGGTCCGA CCGTGGTTAC CGGTCTGATC GTGGCGCCGG GTCCGATCGC
GGTGACAGCC GTCGGAGTGG CCCGCCCAGC GGGGGCTCGT CCGCACGGCC CTACACGCGG
CCGACGCACC GTGAGCAGCC CGCGTCCGGC ACGGACCGGG TAGATCGGGC GGAGCGCACG
GACCGACCGG ACGCTCCGGG CACCGGTGCG GATCGGGCCC GTCCCGACCA GACCCGCTCC
TATGAGGCCC GTGCGAACCA GGGCCGCCCG GACGAGGGCC GTGCGGAGCA GGGGGGCCCG
GACCGGGGCC GCCCGGACGT GGGCCGCTCC GGTCAGGGCC GTGCGGACAC GGGACGTCCG
TATCAGGAAC GGCCGGATCG GGCGGCTCGG CCGGATCGAT CCGGTGGGAG CCGGTTCGAC
GCGAACCGTG CCGGCGCAGG GGGCTCCTAT GCGGGTCGCC GTGGTCCGGA CGCCCGCTCG
GGATCGAGCG GTGGACCTCG GGCCGGCCGT GGTGGGGTGA GTGGCGGGCC CAGCCGTTCG
GGCCCCGGTA CGCGTCCGGC CGGCGGAGCC GGGGGGCGAT CCTCCGCGGG GTCGGCCGGC
CGGCCGAACA GGGACGACGT CACGGCGCGC CGCAGGCCGC CCGCACCTGC GCTGCCGGAC
GAGGCCAAGG CCGAGCTGCT TGATCGCGAC ATTCGCCGTG ACCTGCGCAG CGTGCCCGCG
CCGCTGGCCG AGACGATCGC CCGGCACCTC GTCGCGACCG CGCTGCTCGT CGACACCGAC
CCGGTGCAGG CGCTGGCCCA CGCGCGTGCC GCGGCGGCCC GGCTGCCCAG GATGGCCGCG
GTGCGGGAGG CCGTGGGAGT CGCGGCCTAC CACGCCGGCG AGTTCGCCTC CGCGCTGCTG
GAGCTGCGCG CGGCCCGCCG CATCGACGGC TCGTCCCACA ACCTCCCGCT GATGGCTGAC
GCCGAACGTG GCCTCGGTCG GCCCGAACGC GCGATCGACT ACCTCTCGGA CCCGGGTGTC
GCGGCGCTGG ACGCCGCCGG CCGGGCCGAG CTCCTGATCG TCGTGTCCGG TGCCCGGCGC
GACATGGGTC AGCCGGAGGC GGCCGCGGTG CTGCTGCGCG ACGAGGTGAC CGCCAGAACG
GAGCCGAAGC CGTGGACGCC CCGTCTCTGG TACGCCTACG CCGAGGCGCT GCTGGCGGCC
GGTCGCACGA TGGAGGCGCT GCGCTGGTTC ACGGCCACGG CCGGCATCGA CGAGGACACG
ACCGACGCGG CCGAGCGCGT CTACGAGCTC ACCATCGATG ACGAGACCGA GATCGACAAC
GGGGACGGCG GCGCCGAGGA CAACGGGCCG GAGAGCAATC GGCTCGAGGA CGAGCTGCTC
GGTGACGAGC GACTCGAAGA CGACGGGCCC GCCGCCGAAA CCCACGACGG CGCGACCGTA
GACGAACCGG ACCCTGACGG CCTGATCGCC TCCGGCACAG GTGCGGCCGT TGACGCCGTG
GCAGATGACG CCGCGCTGGC CGACGCCGAC GATGCCGGTG ACGCGGCCGA CGATGCCGGT
GACGCGGACG CGGAAGCTGA GGCCGCCGAC CATGGCGACA CGGGTGGCGC CAGTGACACC
GGGGCAGCGG TCGATGTGAC CGTCGACGGC GAGCCGGCGG CACCGGTCGA GCAGCCGGCG
CCGGCGGAGC CCGCCGCGGC CGTGGAGCCC ACCGAGGTCG GCTTCTCCGC GGCCGAGGAC
ACGAGCGCGG TGCCTGCCGC GGAGGTGACC GCCGACACCC CGCCGATTCC AGAGATCATT
TTCTCGGACG CCCCGGGCGG TGCGGACCAG GCCGGAGCGC ACCGGACATC CGAGGACTGA
 
Protein sequence
MPNDESPRSQ PPSRAGRAAT GGGGARRTEG RRDAPGARRG PSSSGSSGRD AGGEARRSGR 
VTRGDAGRTG FVRRADGGRG PAPAGASAAG TGAGGRGAPA WQGTGRAGGP SRQGGPGQGA
GRPKAAWRPA GTGQGRGGAG GSGGPGGERW GRPASSREGG PREASGGRPA GDRWSGGAGR
PGAHGRPEGQ GRSEGGRPAR RWEGSGRSAP ASDSRGRQWS GSPSREAGDR PRQSDRPPYD
AARGTTSRHD GPRHDGSRHD GPRHDGPRHD GPRHGGGRDE SGRGDARRDG GGWAGGGSRA
GSSPDRGGQR SYGDGGGRPS GGRPTSGQTA RGSSGRPYGA GRPVGGGRDG SAGPGNGGSG
ENRFERRSRA QGSGQWRPSS GDAATRGAAG PRSGIRRDGP GRSDGTAAGR GPGADRRPPS
AGSRPWQSGA TGDARTSNGS GYDRGHRSDR ASGPDRGYRP DRGAGSDRGY RSDRGAGSDR
GDSRRSGPPS GGSSARPYTR PTHREQPASG TDRVDRAERT DRPDAPGTGA DRARPDQTRS
YEARANQGRP DEGRAEQGGP DRGRPDVGRS GQGRADTGRP YQERPDRAAR PDRSGGSRFD
ANRAGAGGSY AGRRGPDARS GSSGGPRAGR GGVSGGPSRS GPGTRPAGGA GGRSSAGSAG
RPNRDDVTAR RRPPAPALPD EAKAELLDRD IRRDLRSVPA PLAETIARHL VATALLVDTD
PVQALAHARA AAARLPRMAA VREAVGVAAY HAGEFASALL ELRAARRIDG SSHNLPLMAD
AERGLGRPER AIDYLSDPGV AALDAAGRAE LLIVVSGARR DMGQPEAAAV LLRDEVTART
EPKPWTPRLW YAYAEALLAA GRTMEALRWF TATAGIDEDT TDAAERVYEL TIDDETEIDN
GDGGAEDNGP ESNRLEDELL GDERLEDDGP AAETHDGATV DEPDPDGLIA SGTGAAVDAV
ADDAALADAD DAGDAADDAG DADAEAEAAD HGDTGGASDT GAAVDVTVDG EPAAPVEQPA
PAEPAAAVEP TEVGFSAAED TSAVPAAEVT ADTPPIPEII FSDAPGGADQ AGAHRTSED