Gene Franean1_0791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0791 
Symbol 
ID5669207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp918872 
End bp922693 
Gene Length3822 bp 
Protein Length1273 aa 
Translation table11 
GC content77% 
IMG OID641239719 
ProductO-antigen and teichoic acid-like export protein 
Protein accessionYP_001505155 
Protein GI158312647 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.750231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.326449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGGCG TGACCGATCT CGGTATGGAG CCGAGGGCGG CCGGCGCGGA CGACGCGGAC 
CTCCCGCCCG AGCCGCCCGC CGGCGACGCC GACACCACGC AGTACGTGCC GCGCGCCCCG
CGTCAGCCGC GCACCGGGCC GACCGGCCGG AACGACGAGC CCCAGCCCCG GCCTGGCGCG
ACCGATCCCC GGCAGCAGGG GGTCCACGTC GGCACTCCGC TGCCGCGGAT CGTCCTGCCG
GGCACGCCCG ATCCGCGGGG CGCCGCCGAT CCCCGCGGTG GTGCTGACCA GTGGGGTGGT
GCTGACGGGC GGGCTGGAAC TGATCCGCGC GGGGGGCCGG AGCAGCACCG GCCGGCTCCC
GAGCCGTTCG TGGTGGAGAG CCGATCCGGC TACCGGGACG GGCGCACCCC ACTCCCCGGG
CGCACTCCAC CAGCCGGGCG TACTCCGCCC CCGGGCTACC GGCCGGAGGC GGGTCACACT
CCGCCCCCGG GGCGCACGCC AGCCCCGGGG CGCTCCGAGC CGTACGGGCG TACCCCATCC
TCTGGCGCGA CACCGTGGCC AGGCCATGCC GCGCCACCCG CGCACACCCC GCCGCCGGGA
CGCACCCCGC CGCCGGGCGG CGCCGCGCCG TTCGAGCGGG CTCCGGCCCC GGGGAACACC
CCACCGGCCG GGCAGGAGCC TCCCGGTCAC ACACCGGCAG GGCACGTGCC TCCCGGGCGC
ACGCCGTCCG GGCGCACGCC GCCGCCCGGC TACACCCCCC GTCCGGGCGG TACGCCCCTG
CCGGGCGTCC GGCCGCCGGA CGGGACGCCG CCGAGCCAGG GCGCGGCGCG CGCGCTGACG
CCGCCGCCGG GCTGGACCCC CATGCCCCGG CACTCCGCTG GCAACACGCC CGCCCCCGGC
GGCACCCCAA CCGACCCTGC TGCCCGGCAC GACCAGCCGG GACAGGGCGA GCGGGGCGCC
GCGCAACGGG CGGGCGGGGC GCCCCCGTCC GGCGGATGGG TTCCGCCCGA GACCACCGTC
CGCATGGGCA CCACCGGGCC GCAGGCACCG CCTCCGCCGG GAGGCGTGGC CCGTCAGCCC
ACGCCGAGGC CCGCTCCGCC GAGCGGCTTC ACGGTGGGCA CGCCACTGCC GGGGCGCCCG
GCCGGCACCC CGCCGCCCGG CGCCGGGCCG GCGCAGACTC CACCACCGCA GATGTCACCA
CCGCGGACCT CGCCACCCGT CCCGTCGCCC GGCCTGCCGC CGGCGGGCCG GCGGGGCGAT
GACTTCATGC GGGTCGACGT CGCCGGCACG GGTGCCACCG CACCCCCGCC GGCGACCCAG
CTCCCGTCCA CACCGCCCGC AGCCCCCCAG GCCCCGGCCG CCCACCCGCC TGCCGGGCAG
TCGCCGGCGG ACCAGCGCCA GGCCCCGCCG CCGCCCGCCG CACCGCGTTC CGTGGGAGTC
CCCCCCGAGG GGCGCCCTGC TGGCGCGGCC GGCGCGGGGG ACACGCGGCG ATCGGGGCGC
GGATGGTCGG CGACGGGGCG CCGGGAGCCG CGCCCACCGG GCCGGGGCGG CGGCGCGGCC
GGATCGGACC AGACCGCGCT GACCAGCCGG GCCGGCGACG GCCGTGGCCG GCAGGGCCGT
GGCGGGCGCC GCGGGCGGCA GGGCGCGGCC CGCAACCCCG ACATCCACTT CATCAACGCG
TTCACCGACA CCATCGACCT GTCGTCGCTG CGGCGCCGGC TGGAGATGGA GGACGGTGCC
GCCGCCGTCG ACCAGACGGA CTACATCCCG CGTCTCTACG GCCGTAACCG GCCGGAGTCC
GACGCCGCGA GCACGCAGGT CATGCGGCGC TCGGGGCCCG GAACGGGGCC GGGGCAGGCC
GGTGGCCGGT GGGCCGGCCC GGCCGGCTCC GGGAGCGCGG CCGAAGCGGC CGCGGGCGAG
GGCGACCCGG CGGACGGAAC CGTCCGGCCC GGCCGGGCGC GCCACGCACG CGGCGGAGGT
TCCGGCACCG GGGCGCTGGA GGTGGCCGAG CAGCGTCCAG GCACCGGGCC GGCCGCCGCC
GAGGAGGCCG TCGTAGGCGG CGGGTCGACG AAGGCGAAGG CGATCTGGAC CCTCGCCGAC
CAGGGCGTCT CCAGCGCCAC CAACGCCGCC GTCTCGCTCC TGATCGCCCG GCAGGTGAGC
TCGTCCGAGT ACGGCTCGTT CGCGATCGCC TACATCATCT TCTCGGTGAT CATCGGGATC
TCCCGGGCGG GTGGCTGCCT GCCGCTGGGG ATCTCGTACT CGGGCAAGTC GGTGTCCGCG
TTCAGGTACG CGGCCGCGTC CGCCACCGGA GCGTGCCTGG TCTTCGGTGG ATTGCTGGGG
ATCGTGCTCG TCGGCGTGGG GGCCGTCGCC GGTGGGTCGG TCGGCTCCGC CCTCGTCGTG
GTGGGCTTCG TCCTGCCCGG CCTGCTGCTC CAGGACGCCT GGCGCTACGT GTTCTTCGCG
ATGGGCAAAC CCCTCGGCGC GTTCCTGAAC GATGTCGCCT GGGCCCTGGT GCAGGTGGTC
GGGCTGACGG TGCTCATCGA GCGGGGCGTG ACGGCGTCGC CGCCGATGCT GCTCGCCTGG
GGCATCTCCG CGCTGGTCGC GGCGCTGCTG GGGGTCGCGC AGGCCGGTCT GTGGCCGGCG
CCGTCGCGGG CGTTGACCTG GGTCCGGGAG AACCGGGCCA ACGCGGCCAA CCTCGCGGCC
GAGTTCGTCA CCGTCCAGGG CGCGCTGCAG GCGTCGATGC TGCTGATCGG GCTGCTCAGC
TCCAAGGAGA CGATCGGGGC GCTGAACGGC GTGCGGACGC TGCTCGGCCC GACCACCGTC
ATCGGTGTGG GCATCGTCAG CTTCGCGGTG CCGGAGCTCT CCCGCCGCAT CGACATGTCC
GTGCGGGCGC GCGAGCGCGC GGCGGTGCTG CTCACGGTGA TCGTGGTCGG CGTCGGCGGC
CTGTGGAGCC TGCTGTTCAT CGCATTCCCG GCCATCGGGG AAACGCTGCT CGGTGACACC
TGGCCCGGGG CGCACCACAT CCTCGTGTAC TCCGCGTTCC ACTACGCGGG GACGGCGCTG
CCGACCGGGC CGGCCTGCAT CATGTACGCG CTCGGCCGAA CCAAGATCAC GTTTCGGATC
AACCTGTCCA TGGCGCCGAT GCTCTTCGCC TTCCCGATTC TCGGCCTGCT GCTCGCGGAC
GCCACGGGCG CGGTCATCGG GTACAACCTC GTGTTCTGGG GCATCGCCCC GGTCTGGTGG
ATCCTGCTGC GTCGGATCGT CCGCGAACAC GCACGCGGAC GACCGGACGC CCGCGACGCG
GCCGAGATGT CCGTCCAGGT CGGGGCGCCC GAACAGGCCG AGACGTTCGA GGAGATCGCG
ACGTCCGACC GGGCCGAGAG GTTCGATCAC ACCGGGACGT CCGACCGGGC CGGCCTGCCG
GCCGGGAACG TGCGGGTCGT CGAGCCCTCC GGACGGGAGG GCCCCGGCCG GGAACGGGCC
CAGGTGTCGA GGCTGCCCGC GCTCGCGCCG CCACCACCAT CCGGTCGTCC GAGCCGCGGC
GAGGCGGTAC ATGCCAGGCC GGCACACGGC GGCGAGGCGC ACGGAACCTC CGTGCGCGGC
GGCGCGGGTG CTGGGGGCGC GGGCTCCGGT GGGGCGGCGC AGGAGGACGT CGAGCGGACC
ACGGTTCTGC AGGGCCCGCC CCCGCGCGGA AGCTCGGTTG ATGTGTCGCC GCAGGCCGGC
GAGCGGGACC GCCGTGACCC TGAGCCGCTG GACCTGCTCG AACCGGAACA GCGCGGCGGT
GCCGACGGAC CGCGAGGCGG CGGGCGAAAC CCTGGCCGCT AG
 
Protein sequence
MSGVTDLGME PRAAGADDAD LPPEPPAGDA DTTQYVPRAP RQPRTGPTGR NDEPQPRPGA 
TDPRQQGVHV GTPLPRIVLP GTPDPRGAAD PRGGADQWGG ADGRAGTDPR GGPEQHRPAP
EPFVVESRSG YRDGRTPLPG RTPPAGRTPP PGYRPEAGHT PPPGRTPAPG RSEPYGRTPS
SGATPWPGHA APPAHTPPPG RTPPPGGAAP FERAPAPGNT PPAGQEPPGH TPAGHVPPGR
TPSGRTPPPG YTPRPGGTPL PGVRPPDGTP PSQGAARALT PPPGWTPMPR HSAGNTPAPG
GTPTDPAARH DQPGQGERGA AQRAGGAPPS GGWVPPETTV RMGTTGPQAP PPPGGVARQP
TPRPAPPSGF TVGTPLPGRP AGTPPPGAGP AQTPPPQMSP PRTSPPVPSP GLPPAGRRGD
DFMRVDVAGT GATAPPPATQ LPSTPPAAPQ APAAHPPAGQ SPADQRQAPP PPAAPRSVGV
PPEGRPAGAA GAGDTRRSGR GWSATGRREP RPPGRGGGAA GSDQTALTSR AGDGRGRQGR
GGRRGRQGAA RNPDIHFINA FTDTIDLSSL RRRLEMEDGA AAVDQTDYIP RLYGRNRPES
DAASTQVMRR SGPGTGPGQA GGRWAGPAGS GSAAEAAAGE GDPADGTVRP GRARHARGGG
SGTGALEVAE QRPGTGPAAA EEAVVGGGST KAKAIWTLAD QGVSSATNAA VSLLIARQVS
SSEYGSFAIA YIIFSVIIGI SRAGGCLPLG ISYSGKSVSA FRYAAASATG ACLVFGGLLG
IVLVGVGAVA GGSVGSALVV VGFVLPGLLL QDAWRYVFFA MGKPLGAFLN DVAWALVQVV
GLTVLIERGV TASPPMLLAW GISALVAALL GVAQAGLWPA PSRALTWVRE NRANAANLAA
EFVTVQGALQ ASMLLIGLLS SKETIGALNG VRTLLGPTTV IGVGIVSFAV PELSRRIDMS
VRARERAAVL LTVIVVGVGG LWSLLFIAFP AIGETLLGDT WPGAHHILVY SAFHYAGTAL
PTGPACIMYA LGRTKITFRI NLSMAPMLFA FPILGLLLAD ATGAVIGYNL VFWGIAPVWW
ILLRRIVREH ARGRPDARDA AEMSVQVGAP EQAETFEEIA TSDRAERFDH TGTSDRAGLP
AGNVRVVEPS GREGPGRERA QVSRLPALAP PPPSGRPSRG EAVHARPAHG GEAHGTSVRG
GAGAGGAGSG GAAQEDVERT TVLQGPPPRG SSVDVSPQAG ERDRRDPEPL DLLEPEQRGG
ADGPRGGGRN PGR