Gene Franean1_6781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6781 
Symbol 
ID5675094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8253492 
End bp8257184 
Gene Length3693 bp 
Protein Length1230 aa 
Translation table11 
GC content62% 
IMG OID641245630 
ProductWD-40 repeat-containing protein 
Protein accessionYP_001511021 
Protein GI158318513 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.57679 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGGG TGACCAGCGG TGTGCGGGAG CGCTCCGGGC CCATCGCCGT GGTAGGCGCC 
TCCGGAACTG GCAAGTCATC ACTGCTGCAT GCGGGACTGC TGCCCGCGCT GGCCAGTGGC
GAGCCGGGCG TTCCTGCTGC GGCTTCCTGG CCTTACCTGG TGATAACCCC GGGTGCCAAT
CCGATCGGTA ATCTCGCGGC CAGAATCTCT ATGTCGGTCG GAGTTTCTTC CCATGAGGCT
GAACAGATTC TCCGGTCTTC GCCAGCAGGC CTGGAGCGAC TGGTTGATCA GCTTCTCGTG
GACTCGGGAG GTCAACCGCC TCGGCAAGCC CGCCTCCTGC TCATAGTAGA CCAGTTTGAA
GAACTTTTTC TGCTTTGTGA CGACGAGGAT GATCGTAAAC TATTCATCAG GGCGATCACA
GCTGGCAGTC CGCAAGCTTG CGAGAATGAG GAACCAGTTG AGCTACCGAT GTCGGATCGT
GATGAATGTT TATTCACTAC GATACTCGGC ATTCGCGCCG ACGTCTTCGG TGCCTGTTCA
CGGTATCCTG AGCTGGCCGA GTCGCTGCGG CGTCCACTTT TGGTGGGCCC GATGTCTGAG
GAAGACATCC GACGGACTCT CGAAGGTCCC GCCCAAGTGG CCGGCCTGCG CTGGGAGTCG
GGGCTAATCG AACACATCCT CGCTGAGTTT CTGGAACGAG GTCGGTCCGA GGCTGGGACC
TTGCCACTCT TGTCGCACGC AATGCGGCAA ACCTGGATCC ATAGTGACCG TAGCACGCTA
CTCTTCGCCG ACTACCATGA GTCAGGCGGA GTGCGACGCG CGATCGCCAC GACTGCAGAA
TCCTGCTATG CCGGAATGGG CGATCTCGAA CAGGATGTTG CTCGCAATAT TTTCCTGCGA
ATGGTGCGGC TTCAGTCGAC CGACGAGCCC ACCAGACGAC GGCTCCCACT GACGGATCTC
CCGCCGACAG GGGCTCACCA GCGCGTTCTT GGACGGCTTG TGCAAGACCG GTTGGTGAGC
GTCGATCGCG GCTCCTGCGA AATTGTGCAC GAGGCGCTGC TGCGGGAATG GCCTAGGCTC
CGCGACTGGA TCAGTCGTGA CCAGGAGCGT CTTCGCGCGC GAGATGAGCT GGTCGGGCAC
GCGGAGGCAT GGGAGCGGTC GGGCCGCGAC GATTCACGGC TTTACCGAGG GGCCCAGCTG
GGACGCATCC GCGAACGCCT CGCGACACCG AGCTCGAGCG AGGAGGATCT GCCCGAGCCT
GGGCCGTCGT TCCTCGCCGC GTCGGTCGCT CACCAGGCCG AGCTCGACGC CGAAGCGGAA
GCACGGGTGC GGGGGGACCG CCGGACCAGC CGACGACTGC GCCAACTCAC GGCAGGCCTA
GCCATTCTGA CGGTCCTCGC GGTCGCCGGC GCGTCTGCAG CCATTGTGCA GCGCGGGAAC
GCCGTCGCCG AGCGCGATGC CGCCAGCCAG GCAAGCATCC TCTCGACCGC GGACGGGTTG
GTGGCCCGCG CCGATGCTGC CCGCGATGTC GATCCGCTGC TGTCTCTGCG GCTGGGTATC
GCAGCACGAG CGGTCGCAGA CAATACACGG ACCCAGTCCG GCCTCCTTGA GACGATTACC
GATTCCCCTT ACATCGCGTC GAGCTTCGCT ATCTCTGACC CGTTCCTGGA AGTGGCCGTC
AGCCCGGACT CCCATCTTGT TGCCGTCGGC GGAGTTAGTG GGAAAATCGA ACTCTGGAAC
ATCGTCGACC CGCGGCGACC GAAGAAGCAG TACGGGCCGT TCGGGCTCGG CGATGGAGTA
GCCATCACTC AGATCACCTT CGACCCGCAG TCGTCCGCGC TGGCTGTGGT GACGATCGAC
AAGGTGACGC TCTGGGACAT CAGCAATCCG ACGGCGCCCC GGCCGTTCGG TCAAGTCCAG
GAAGATGTCC TTAGTTTTGT CAAATTCAGC CCCTCTGGTG AGATTCTGGT CGTCGCATCC
CCGGGGATGA CATCATTGTG GGATGTGACC GTGCGTGACC AGCCGGAGCG GATAGGTGTG
CCGGTTGTTG GCAATGGTGG AGCTGTATGG GCTGTGACCT TCTCGCCAGA CGGACGACTC
CTCGCTACCG GCGATAATAT CGGCTATGTG CTCGTTATGG ACGTCAGCGA TCCGAGATCT
GTGGCGACGC TGGGGCGCTT CCCTTCTCCG CAGGGGACGG TGACCAGTCT CGCTTTCGTC
GGGGACGACG AACACCTGGC CGTGGGAGCA GGAAAATACG ACGGAAATAT CATCATGATG
GACATCGATA ATCCTGGCGC CCCGATCGTT ATAGAAAATA ATCCGTTACG CCATTTCACC
AAACGGATCG TGGGTCTTGA CACCGCCAAG GGATCAGATT TCCTTTTTGC TGCCAGCGAG
AACGGAGAAC TGGTTGCCTG GAATGTCTCC GACTCGGAGG TACCGTACCC GGTCCAGTCG
TCGTTCGCCG GGCACACCCA GAGTATCTCC GGCCTGGACT TGGCCGAGGA TGGGAGGATC
CTCGCGACGG CCGGTGTTGA TGGGACAACG ATCTTGTGGG ACATCGCCCA ACCGACCGAG
CCCCGACAAG TCGCAGGAAT CACGACCGAG CCCGAAGTTG GCTACAGAGC GCTCGCGACC
AACCGCAGCG GTACGCTCCT GGCTGGCGGA CGCAGCGACG GCTCGACCCT CTTGTGGGAT
ATCACCCATT TGGATAAGCC GCGTCAAATC ACGACCATCA ACGCGGGATC GGAAGTCAAT
AGCCTTGCAC TTACCGGTGA TGGCGGCATG CTGGCGGTTG GACATGTCAA CGGCGATGTC
GCGCTATGGA ACATCCGTGA CGCGGAACAC CCTGAACTCA TCACCACCAA ACGCTCCGGG
CTCGGGCTAG TCACGGCGGT AGAGTTCAAC AAACCCGGCG ACCTGCTCGC TATCGGTGCA
GTGTCTGCCA CCGGCAGCCG CGTTGATGAC GCGAGCGTGA CACTGTGGAG GGTCAACGAG
CGGGAGCCAT CCGCTATCGA GCTCTACCGG GGTTCGGACG CCCCGCAGTC ACTCTCCTTC
TCTCCAGATT CAGATGCAAT TCTGGTCGGC TTTGAAGGAG GAACCTCGCG ACTGTGGCAG
CTGCCAGCGA ACTCCACAAC TATGGTCACC GAGCTTCCCG GGCACAGTTT CGAAGTAGCT
GCTACTGACT TCGACGATCG CAGCAAGCTG GCCGCCACCG GGAACGGCAA TTCTAACGGC
AAGGTTCTGC TCTGGGACGC CACAGATCTG CAGGCTGTTC ATAGGGTCGG TGATCCATTG
ATCGACCCAG ACCAGGAGGT AATAACCGAG ATCGAGTTCA GCCCGGGCGA GTCGATGTTG
GCGGTCGGAG GCACCCGCCA GTACGACATG GGCCAAGCTG TCAGCTCGGT CCGGCTCTGG
AACCTCGATG AGGCATCCAG GCCACGCTCT CTCGGAACAA TTCACGTGAG TCCCACCAAC
AACCTCTCTG ATCTCGCTAT CGTGCAGCGC GACGGAGCAA TCGTCACCGC CACGGCCTTT
CCCGAGGGAG GGATCCAAAT CTGGGACGCT AAGGTACTTC GCACTGTGCG AGAAGATGCG
ATCAAGATCA GCTGTCAGCG GGCAGAACGC GGTCTTAACG AAGCGGAATG GGCCCAGTAC
GTACCCGGCC TCCCCTACCG CGCCACCTGC TGA
 
Protein sequence
MERVTSGVRE RSGPIAVVGA SGTGKSSLLH AGLLPALASG EPGVPAAASW PYLVITPGAN 
PIGNLAARIS MSVGVSSHEA EQILRSSPAG LERLVDQLLV DSGGQPPRQA RLLLIVDQFE
ELFLLCDDED DRKLFIRAIT AGSPQACENE EPVELPMSDR DECLFTTILG IRADVFGACS
RYPELAESLR RPLLVGPMSE EDIRRTLEGP AQVAGLRWES GLIEHILAEF LERGRSEAGT
LPLLSHAMRQ TWIHSDRSTL LFADYHESGG VRRAIATTAE SCYAGMGDLE QDVARNIFLR
MVRLQSTDEP TRRRLPLTDL PPTGAHQRVL GRLVQDRLVS VDRGSCEIVH EALLREWPRL
RDWISRDQER LRARDELVGH AEAWERSGRD DSRLYRGAQL GRIRERLATP SSSEEDLPEP
GPSFLAASVA HQAELDAEAE ARVRGDRRTS RRLRQLTAGL AILTVLAVAG ASAAIVQRGN
AVAERDAASQ ASILSTADGL VARADAARDV DPLLSLRLGI AARAVADNTR TQSGLLETIT
DSPYIASSFA ISDPFLEVAV SPDSHLVAVG GVSGKIELWN IVDPRRPKKQ YGPFGLGDGV
AITQITFDPQ SSALAVVTID KVTLWDISNP TAPRPFGQVQ EDVLSFVKFS PSGEILVVAS
PGMTSLWDVT VRDQPERIGV PVVGNGGAVW AVTFSPDGRL LATGDNIGYV LVMDVSDPRS
VATLGRFPSP QGTVTSLAFV GDDEHLAVGA GKYDGNIIMM DIDNPGAPIV IENNPLRHFT
KRIVGLDTAK GSDFLFAASE NGELVAWNVS DSEVPYPVQS SFAGHTQSIS GLDLAEDGRI
LATAGVDGTT ILWDIAQPTE PRQVAGITTE PEVGYRALAT NRSGTLLAGG RSDGSTLLWD
ITHLDKPRQI TTINAGSEVN SLALTGDGGM LAVGHVNGDV ALWNIRDAEH PELITTKRSG
LGLVTAVEFN KPGDLLAIGA VSATGSRVDD ASVTLWRVNE REPSAIELYR GSDAPQSLSF
SPDSDAILVG FEGGTSRLWQ LPANSTTMVT ELPGHSFEVA ATDFDDRSKL AATGNGNSNG
KVLLWDATDL QAVHRVGDPL IDPDQEVITE IEFSPGESML AVGGTRQYDM GQAVSSVRLW
NLDEASRPRS LGTIHVSPTN NLSDLAIVQR DGAIVTATAF PEGGIQIWDA KVLRTVREDA
IKISCQRAER GLNEAEWAQY VPGLPYRATC