Gene Franean1_7293 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7293 
Symbol 
ID5675594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8909911 
End bp8913198 
Gene Length3288 bp 
Protein Length1095 aa 
Translation table11 
GC content77% 
IMG OID641246130 
Producthypothetical protein 
Protein accessionYP_001511518 
Protein GI158319010 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3846] Type IV secretory pathway, TrbL components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.35344 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGATG TGCCACAGGC ACCGTACGGC GAGGAAGGGG ACGCGCGGCC TGTTTCGTTG 
ATAGGTGCGC GAATCCCCTC ATCTACGCCG TCCGCGGCCG AGCTCCCGGC GCGCGATGGC
GAGAGCGACG ACCGCTCGGC GGCCAGGGCG GCCCGCCGCA GGAACCGCCG GCGCGGCGGT
CCTGACGATG CCTCCGCCAC CGACGATGGT GACGGAACGT GGCAGGACCA AGCGTTCCAT
CTAGAGGATG CGGTGTCCGG CGACCCGCCC CCCGCAGGCC GGTCCGGCAT GTCGGGATCA
GGCCGGACGG GCATGACCCG TGGCACCACC CCGGAGCCGC GCGAGCCCGG GCAGCGCCGG
GGGCGGCGCC TGCCGCGCAC GCCGCCGGCA GCCGGGCCCG TCCCGCCGAG GTCGTCTGTT
CCACCCGGCA CGCCCGTTCC GCCCGGCGAC GCTGCCCGCG GCGGCCGCGC TGTGCCGCCC
CGTTCGCCGC GCCCCGAATC GCCCAGGGCC GCTGACCAGG CCCGGCGGTT CGACGCGGGC
GCCGCGGCGG CGGTCGACTC GCCCGACGGC TCCCTCGACG GCTCCGCCCC GCCGCGCCGG
CCGATGCTGC GCCGGGCCGC CACACCCCGC GCCGTCCCGC CGCCGGCGCC GATGGGCCGC
AAGGATCCGA GCTTCCCGCC CGCGACCTCG GCCGCGGCGG CAGCGACCGC CGCGGCCGCC
GCCGGCACGG GCACCACGGG CACCACGGCC GCTCCGGGCA CCCCCGGCGG CGCCGGAGCC
GCCGGGAGCC CGGGCTCGTC AGCCGGCTCG GGTGGGTACG CGGGCTCGAG CACGCCCGGT
AGTACGGCCC GGCCTGGTTC CGGCGGGTCA GGTGCCGGCG GGCCCGGTTC CGGTGCTTCG
GGTTCCGGTG CTTCGGGTTC CGGTAGCTCG AGTGCCGGTG GTTCGGGCGC GATGCGGTCC
GGTGGCGCGC GGTCCGGTGG AACCGGCCGG GCCGCCGGCT CGGGTGGGTC CGCAGCCTCG
CGTTCTCAGC GTCCCGAGGC GCCGGAGGCG CCGGTCCGGC GGTCCGGCGG ATCGGGCGGC
ACCAGCGCGC CCCCGCCCAG CCGCCGGCCG ACGTCGGCCG GCCCGCGCCG TCCCGGGCGC
CCGTCCGCTC TCGCGCCCAG GACCCCCGCC GCCAGCCCCC CGCCGGCGGG GCCGGAGACG
GGATCGACCG CATGGCCCGC CCCGGCCACG GGATCGCCAC AGGCGGACCA CCCGGCGGCG
GGTTCACCAC CGGCTGATCC CGCCTCCGGC GGCTCGTTCG GCGAGCCCAC GCCGGGCATG
GGGCCAGGGC CGTTCGACGC GCAAGGGCCG TACGACCGGT CCGAGCCGCT CGCGCCCGGC
GAGGCAGCTG GCTCGCCGGC CCCGGCGTGG CCCTCGCAGG GCTACGAGCA GGCCGACGTC
CCGCTGGTGG ATCCGTCGCT CGGTGGTCTC CCCGTGGAGC TTGACGGTCT CCCCGTGGAG
TTCGGTGGTC TCCCGATGGA AAGCCGCTCG CCGGGCGGCG ACCAGCCGGG CAGCGGGTCC
TACTACGACC CCTGGGCCCC CCATCCGGCG CCGGGCCCGG CGCCGGATGC CGGGAACGAC
GAGTCAACCG GTGCTCCCGG GTGGGGCGCC GTCGACGCCT GGCAGGAGGC CGGCGAACCG
GTCAGCCTCG GGGAGCCCGC CCAGCCGGGC GAGGCGGCCC CGCCCGGCGA TCCCGCCGGC
GGCAGCGGCC TGGCCGAGCG CACCCGGCGG GGCGCGGCGG CCGGCGAGAC CGAGACCGAG
ACCGAGACCG ACGATTCGGC CGCGGCCGGA AGCTCGTCAA CAGGCGGAGC GCCGCCCGCC
GACGAGGCCG CGTCCACGGA CGAGGCCTGG GCTGGTCACG GCGTCGCGGA ACCCGAGCGG
ACGTCCGACG TGCTCCCGCG GCCCTCGACC GAGCTGGTTC CCGCCGGTCA CCTCCCGAGG
CCGGGCACCG GCGCCGACCG CGCGCCGGCA CTGACCGACC AGGGCCCAGG CCGGGGCCTG
TGGCCCTGGC AGGAGCCCTG GGCCGACCTC GACGACCTCG AGCCATGGGA GGAGATCCCG
CTACGGATCA CTCTCTCGGA CCGGGCCGCG GTGGCCCTGC CGGGCTCGTG GCGGATCGCC
GTGACCTCCC TGTTCCCCTA CTCCGGCACC ACCACGTTGG CCGGGGTCGT CGGACTTACC
CTGGCCGGTG TGCGTGCCGA GCCGGTGCTC GCGATCGATC TCTACCCCGG GGCCGCCACG
CCCGGTTTCG TGCCGCCCAC CTCGTCGGAG AGCGCCGAGA TCGACGAGTC GCGGGGCGAC
AACCTGGTCG CGCGGGTCGG CAGCCGAGGC ACCACCACGG TTGCGGACAT CGCCCGCCAA
CGCCGCTCCA CCGCCAAGGC GACGCCCGAC GAGCTGCGCT CGCTCGTCGG CGCGCGGCGG
TCCGGCAGCG TCTTCGACCT TGACGTGCTG CCGGTCGACC GCCCGGTCGG CGACGAGGAG
ACCACGAACG CGCTGGTGCC GGTCGACGAG CCGATCACCC CGGCCGTGCT GCGCTCCGCC
CTGGGCGCGC TCGGTCACGC GTACCCGCTG ATCCTGATGG ACGCCCCCGC GACCGCACCG
CTGACCCCGG AAGCGATCCG CGCCGCCGAT GTGATCCTCC TCGTGACCCT CGCGACCGCG
TCCGATCTAG AAGCCACCCT CGCTGACCTG CGCGATCCCC AGGGCTCACT CGCGGAGGTC
GGCGTGAGCG CGCGGGAGCG CATCCCGTCA GGCCAGGGCC CTGGCCACCC GCCGCACGAG
CGGACGGGTC CCGCGGTGAT CGCGGCCGTC GTGTCGCCCC GCCGCGGCCG CCCGTCACCA
CGCACACGCA CCGCCACGGC GCGGCTGGCC CGCCACGTCG ACGGCATCGT GCGGGTCCCG
TACGACCCGC GGCTCGACCC GAGCAGGGGG ACCCCGGTCC GCATCCCGCG GCTGCGGTGG
GCGACCAGGA GGTCCTACCT CCGGCTGGCG GCCGAGACCG TCGACGCGCT CGCCGGCATC
GCCAACGCTG ATGTCGGGAC ACCGGCCGGC GGAGAAGATC CCTCCTTTCA ACCGCAGTTC
GAACGACCGA TACGCACCGC TCTGGTGTTG CCACGCAACG CCGATACTGA TCACGCAGAG
GTGTCAGGCG TATCATTTGG CGACCTGCGG CACGGAAGCA CACCCACGCG GGTGTCCGGG
CCGGATCGTC CGGGCGGCCA ACCGCCCGCA GGAAGGGAAC CACGATGA
 
Protein sequence
MSDVPQAPYG EEGDARPVSL IGARIPSSTP SAAELPARDG ESDDRSAARA ARRRNRRRGG 
PDDASATDDG DGTWQDQAFH LEDAVSGDPP PAGRSGMSGS GRTGMTRGTT PEPREPGQRR
GRRLPRTPPA AGPVPPRSSV PPGTPVPPGD AARGGRAVPP RSPRPESPRA ADQARRFDAG
AAAAVDSPDG SLDGSAPPRR PMLRRAATPR AVPPPAPMGR KDPSFPPATS AAAAATAAAA
AGTGTTGTTA APGTPGGAGA AGSPGSSAGS GGYAGSSTPG STARPGSGGS GAGGPGSGAS
GSGASGSGSS SAGGSGAMRS GGARSGGTGR AAGSGGSAAS RSQRPEAPEA PVRRSGGSGG
TSAPPPSRRP TSAGPRRPGR PSALAPRTPA ASPPPAGPET GSTAWPAPAT GSPQADHPAA
GSPPADPASG GSFGEPTPGM GPGPFDAQGP YDRSEPLAPG EAAGSPAPAW PSQGYEQADV
PLVDPSLGGL PVELDGLPVE FGGLPMESRS PGGDQPGSGS YYDPWAPHPA PGPAPDAGND
ESTGAPGWGA VDAWQEAGEP VSLGEPAQPG EAAPPGDPAG GSGLAERTRR GAAAGETETE
TETDDSAAAG SSSTGGAPPA DEAASTDEAW AGHGVAEPER TSDVLPRPST ELVPAGHLPR
PGTGADRAPA LTDQGPGRGL WPWQEPWADL DDLEPWEEIP LRITLSDRAA VALPGSWRIA
VTSLFPYSGT TTLAGVVGLT LAGVRAEPVL AIDLYPGAAT PGFVPPTSSE SAEIDESRGD
NLVARVGSRG TTTVADIARQ RRSTAKATPD ELRSLVGARR SGSVFDLDVL PVDRPVGDEE
TTNALVPVDE PITPAVLRSA LGALGHAYPL ILMDAPATAP LTPEAIRAAD VILLVTLATA
SDLEATLADL RDPQGSLAEV GVSARERIPS GQGPGHPPHE RTGPAVIAAV VSPRRGRPSP
RTRTATARLA RHVDGIVRVP YDPRLDPSRG TPVRIPRLRW ATRRSYLRLA AETVDALAGI
ANADVGTPAG GEDPSFQPQF ERPIRTALVL PRNADTDHAE VSGVSFGDLR HGSTPTRVSG
PDRPGGQPPA GREPR