Gene Franean1_1350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1350 
Symbol 
ID5669759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1622148 
End bp1628288 
Gene Length6141 bp 
Protein Length2046 aa 
Translation table11 
GC content65% 
IMG OID641240277 
ProductYD repeat-containing protein 
Protein accessionYP_001505704 
Protein GI158313196 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCAT CCCCCGTACG TCGTCGTAGG TCATTCATCC GTTCTCTCTG CGTTCTGTTG 
GCCGCGTTGC TGGCGGCGAC CGTCACCCAG TTCCTGCCAC CCGTGAGCGG TGCCGCGTCC
GCCGCACCCA CGGACGCCGG CACGGAGGTA CCACCTCCGA CCGGCCCAGC CGAACGTCCT
GACCTGATCA CGGCTCAGCT CACCGCGCGG GCGGAGCAAC GCCGGATCGA GGTGACGTCC
CTGACCACGG AGTCGACGCG GACCCTGGTC AACCCCAACG GCTCCATCAC GGTGGAATCC
ACCTCCGGCA TCGCCCGCGT CCACCGTGAC GGACGCTGGC TACCCGTCGA CACCACCCTC
GTCCTCACCG ACGGCAAGAT CACCCCCAAG GTGGCCAAGG CCGCCATCGC CCTGTCCGCA
GGCGGCAAAG ACGCCGGCGA CATCGCGACG CTGCGCGACG GCGACCGGGA GATCGCCTTC
GGCTGGCCCA CGGCGCTGCC CACGCCCGAG CTGAAGGACA ACGTCGCCAC CTACAAGGCG
GTCGCGCAGG ACACCGACCT TCGCGTGAAG GTGACCCCGA CCGGCTACGA CGTCCAGATC
GTCGCGCACA CCCCGGCCGC GGCCCAGGCG GCACTGAGCC TGCCGATGCG GTTGAAAGGC
GTCACCGCCG AACGCACCGC CGGCGGCGAG CTGCGCCTGT CTGCCGCCGG GAAGATCGCC
GCCCGGTCCC CGACCCCGCT GATGTGGGAC GCGCACGTCG ACGCCAAGAC TGGGCTCCCG
GACCGGACCC ACACCGTCGA GGCGGCCCTC GAGCGGGCGG GCACCGCCAC CCCTGCCCTC
GCCTTGCGGC CAGACAAGGC CTGGCTGACC GCCGATGACC GCCAGTACCC GGTCACGATC
GACCCGGCCG CCGTCCTACC GGACAACCTC GACACCGACG TGGTCAACAC GTCAGCGACC
ACGAACTACG ACACCTACGA GTACCTGCGG GTCGGGAACG TCTTCGGGTC CGTGCACCGG
TCGTTCCTGC GGTTCGACAC CTCCGCCATC GAGGGCAAGC ACGTCACCGC AGCAGCACTG
AAACTCACTC AGGCCGGCTC CTACACCTGC ACGCCGCAGC GGATGGTCGT GCAGGGCTCG
GCCGGCCTGG CGTCGGGGGC GACGTGGAAC ACCCAGCCCA CCGCGGACGG GGTGAACTGG
TCGGACACCA CCTTCAACGC CGGCGGGTTC TGCGGCTCCG GCGATGTCTC CCTCGACATC
ACCGGACTGG CACAGGCATG GTCATCCAAC GGCCAGCCCT CACCGGAAAC CCTCACCCTG
CGGGCGCCCG ACGAGGTGCT GTGGGAGCCC TACAAGTACT TCACCTCCGG CGACACGGCC
ACCCCGCCGC GGATCGAGAC AACCTACAAC TCCTACCCGG CCACCGTCGG CGCCCGCACC
ACCTCCCCCT GCTCAGCCCA GTGCGGGGGC ACGCCGCCGA CGGTGCTGAC AAACTCGACG
ACGCCACGAC TGGCCGGATC GGCGACCGAC GCGGACGGTG GCACCCTTCG CCTCGACTTC
GAGGTATGGA ACTCCGCCGG CACCACAAAG ATCACCCAGG GGTCCAGCGG ATTCGTCGCG
CAGAACAGTG AGGCGGCTTG GACCGTCCCC TCCGGGCTGC TCGCCAACGG CACCAGCTAC
CAGTGGCGGG TCCGCGCCTA CGACGGCACC GACTACTCCC AGAACTGGTC CACCTGGATC
CCGTTCACCA TCGACACCAC CGCCCCCGCC ACCCCCACCG CCCTCGCCTC CACCGCATGG
CCATCGGGTG GCTGGGCCAG CGCCACCTCG GGCACATTCA CCTGGACCTC ACCCGGCGGT
GACACCGCCG GATTTCTCTA TGGCCTTGAC GAGCCATCCC CAACGACCTC ACCGACCCCA
CCGACCGGCA CGACCAGTGC CTCGCTGACC GCCGACGAGG GACTGCACAC CTTCTACCTG
CGCACCATCG ACACCGCCGG AAATCTCTCC CCCGTCATCT CCTATAACTT CGGCGTCGGC
AACGCCGCGC TCGCCTCACC AGCCGAGCAG TCCCGCACCC AGCGATTCGC CACCCTGCGC
GGCGAGGCAC CCTCATCGCA GGTATCGGTG ACCTACCGCT ACCGGGTCGG GACCAACCCG
TCTACCGCGT GGACGGACGT CCCCACCGCC GAAACCACCA CCCAGGGCAC CACCAACCAC
CCCACCTGGC CCGCCCCCCG CAACGGCTCA GGCGCCTTCG ACAACCAGAT CTGGAACATC
CCCGGCACCC TCGGTGGCGG CTCCGACGGC CCCATCCAGA TCCAGGCCTG CTTCCGCACC
GCAGCCGCAG TCGTCACCTG CACGGATGCG ACCACCATCC AGCTCACCCG CAATGCCTTC
AGCGACACCG ACGCGGTCGG CGAGCTCGGC CCGGGATCAC TAGCCCTACT CACGGGAGAT
TATTCCCTGT CGGCGACCGA CGTGTCGATG CCGACCTACA CCGGCACTCT CGCCGTCGGC
CGCACGCTGA CCACACTGAC CCCGCCCGCA GCCACCACCG CCCCTAACGG GATCTTTGGC
CCCGGGTGGA CATCGAGCGT TCCCGGTCCC GAAGCCGGAT CGGGAGACCG CACCCTGACC
GATAGCACGG CCACCGAGGG GTACGTTACT CTCACCTCAC CCGAAGGCGC CCCCGCCGTC
TACACCCGCG CTGGCACAGG TAGCTACCCT TACGAGTACA CCGGTGTTGC CGACACCGCC
GCCGACGGAT CCAAGCTCAT CAAAGACTCG GCGACCAAGT TCACCCTCAC CGAGATCGAC
GGCGCGAAGA CGATTTGGTA TTCCAAGACC ATCGGCAGCG CCACCATCTG GGTCGTCGAC
CGCGTCGAGG AACCCGGCTC CAACACCACC TCTACCTTCA CCACCGACAG CCTCGGCCGC
ATCACCCGCA TCCTCGCTCC CGTCCCCAGC GGCGTCGACT GCTCCGGCAC ACTCGGCGCC
GGCTGCCGTG CTCTCACTCT TACTTACGCC ACCAGCACCA CCGCGACGGG ATCCGGTGGT
AACCCTGCCG AATGGGGTAA CTACACCGGA CAGCTATCCT CGATCTCCAT GGCTCTGAAC
GGTGCCACCT CCATCGAGGT CGCCCGCTAC AGCTACGACA CCACTGGCCA TCTACGCGCC
GAATGGGATC CCCGCCTCGA CACCGCCGGC GGCAACCACC TCGCTACCCT CTACTGGTAC
AACGCCGAAG GCCGCGTCCA GGGCTTCATC CCCACTGGCC AAGAGGCATG GGGCTTCGGC
TACGACGGCT TCGGACGGCT CACCAATATC AGCCGACCTC GCCCGGCAGG TGCTGGCACC
GCCACCAACA CGATCGTCTA CGGCGTACTG CTCTCCGGCG CCTCAGCGCC GGTCGATGTG
TCCCCTGACC GGGTCGATGA CTGGGCCCAA CAGGACCTGC CCGTCTGGGC AACCGCCGTC
TTCCCAGCCA GTCACGTCCC CGCCAGCCCA CCCACCGCCG CCGACTGGCC CTACGCCGCG
ATCAACTACC TGAACGCTGA CGGCCGGCAG GTCAACACCG CCTCCTACGG GGCAGGTGCC
TGGCAGGTTA CTACCACGGA ATACGACACG TTCGGCAACG TGACCCGTGA GTTGACTGCA
GAGAACCGCA ATCAGGCGCT GACCCCTACC ACAGACACCG ACGCCACCGT CGCCGCCCTC
ACCGACTCCG CCGCCCGCGC CCAACTCCTC GATACCAAAA TCACCTACAG CGTCGACGGG
GTCGTACCCC TCGACGTATA TGGACCAAAA CATCGATACA TCAACAATGA CGGCGTCCGG
TCGTCGGTAC GACAGCACGT CCACACCGAT TACGATCAAG GCGCCCCGCC GGCAACAGGA
CCATACTGGC TGCCCACTAC AGTCACGACC ACGGCGTACA CCGGCAGTTC CGACATCGAC
CCCCGTACAA CAATCAATGG CTACGCTGCG AAGACAGGCG CGGACGCTTC CACCTCCGGC
TGGACCCTAC TTCAGCCGAC AACAATCACG ACGTGGATGG GTGGCGGGGC CACTCCCAAC
ATTGTCCGCA CCACCTACTA CAATGCCGCT GGCCAACCCC TCGAAGTACG CCAACCCAAG
GCCAACAGCG ACGGCACCGA CGCCTTCACC ACTATTTTCT CCTACTTCAC TCCAACTGGA
TCCGGTGGAT GCGTTAATGC CTCCTGGGCA GGTCTAACCT GCTCCACTGG ACCCGCATCG
CAGCCTACTT CTGGTAACAC TCTTCCAGTC ACCACCCTCA CCTACAACAA TCTCAATCAG
CCGCTCACCA AGACCGAGAC CGTCACTTCC AGCGGCACGA CCACCCGCAC CACGACCTAT
ACCTACGACA CCGCCGGCCG GACGCTCTCC GAAGGATTGG CCGTTAACCC CGCGGCCAAC
GGCGGCACCG CCGTACCCAC CGCCACCTAC GGGTACAGCA CGATAAGCGG TCTCCCGACT
ACCACGACCG CGAACAGCGT CACCCTGACC ACCGAATACG ACGCCTGGGG GCAAATCACC
TCCCAGACCG ACTCCGACGG CAACACCAGT AACACAACCT ACGACATCGA CGGCCGAGTC
TCGTCGGTCA ATGATGGTAA GGGTGCCTAC ACTTACATCT ATGATACCGC CACCGAACAC
CGCGGCCTCG TCACCAGCCT CGGCATCGAC GCCGGCTCCG CGCCGTCCAC CTTCACCGCC
ACCTATGACG GCGACGGGAA ACTGACATCC CAGAACTACC CAAACGGTCT GGTCGCCACC
AGCCACTATA ACAACACTGG CAGACCCACC ACCCTTACCT ACACCAAGGG CACGTCCACC
TGGCTGACCT TCACCCAAAT CGACAATATC AACGGCCAAG GGCGTGTCGT CGAATCCCCC
GGCGGGATCC GCGAACACCT CTACGACCTG GTCGGCAGGC TCATCACCGT CCGGGACATC
CGCAACGATA GCGGCAGCGT TATGTGCACC TCCCGCCGGT ACGTCTACGA CCCCGATTAC
AACCGCACCC AGCAGATCTC CTACCCCGAC GCCGGCACCA ACCCCACCGG CCCCGCCTGC
AGCACCTCCA CCACTCCCAC CTACTCGCTG AGCTCCAGCT ACGATCAGGC CGACCGCATC
ACCGACACCG GCTACACCTA CGATCTGTTC GGTCGCACCA CCGCCGTCCC GGCAGCGGAA
AACGATCTCA CGGTCGGCTA CCACGCCAAC GACATGGTCG CCTCCGAAAC CCAGGGCGCC
GCCACCCGCA CCTACACCCT CGATCCCGCA CGTCGCATCC GAACCTGGGC TCAAGGTGCC
ACTACCTGGA CCAACCACTA CACCTCGGGC TCCGACGACA GCGCCGCGTG GATCGGCGAC
AGCGGCGGTG GATGGACCCG CAACATCCTC GGAATTAGCG GCACACTAGT TGCTGTTCAG
GACCAGACTG GCACGGTCAC CCTACAGCTA TCAAACTTGC ACGGAGATAT CGTTGCCACC
GTCGACGACA ACAGCTCTGC GACGGCACCC ACCACCTTCC AGGAAACCTG GGAATTTGGA
CAGCCCTACA ACATTGCTAC CGCTTACCCT CGATATGGCT GGCTCGGTGG GCCACAACGG
TCTCGCGATA CCATTTCAGG TATCACTCTT ATGGGGGCGC GACTATACAA CCCGGGAACA
GGTCGCTTTC TTCAGGTAGA CCCAGTGCGC AATGGCGCCG CCAATCCGTA CGAATATGCC
TACCAAGACC CATGGAATAT GACCGATCTC GATGGTCGCA TCCCGACTCC GAGTCTTGTT
TACTCTTGTC CGCGAGGGTT CAACTGCCAG GTCCTCCTTC GCAGCACGAA ATACAGCGAG
TGGAGGGCAT GGCAGCTGAC CGGTGTCGCC GATGGCCACC TAAGCTCGTT TTACTACGAA
ACACGAAGAA GATCCGCTAA ATTTTATGTC TCGCGATACT TCTACTATAA CCGGCAAACT
CAATTTATTA TGTACATCTA CACACAGGTA CAGGACAGGA TTAGAACCTT TACGTGGCCA
GCAATCAGAA CGGAGAAGAT TCAAAGCGAC AGATTCGAGT ATATAACATA CGTAATGTGT
CGCTCGGCCT ATATTTGCTA A
 
Protein sequence
MRASPVRRRR SFIRSLCVLL AALLAATVTQ FLPPVSGAAS AAPTDAGTEV PPPTGPAERP 
DLITAQLTAR AEQRRIEVTS LTTESTRTLV NPNGSITVES TSGIARVHRD GRWLPVDTTL
VLTDGKITPK VAKAAIALSA GGKDAGDIAT LRDGDREIAF GWPTALPTPE LKDNVATYKA
VAQDTDLRVK VTPTGYDVQI VAHTPAAAQA ALSLPMRLKG VTAERTAGGE LRLSAAGKIA
ARSPTPLMWD AHVDAKTGLP DRTHTVEAAL ERAGTATPAL ALRPDKAWLT ADDRQYPVTI
DPAAVLPDNL DTDVVNTSAT TNYDTYEYLR VGNVFGSVHR SFLRFDTSAI EGKHVTAAAL
KLTQAGSYTC TPQRMVVQGS AGLASGATWN TQPTADGVNW SDTTFNAGGF CGSGDVSLDI
TGLAQAWSSN GQPSPETLTL RAPDEVLWEP YKYFTSGDTA TPPRIETTYN SYPATVGART
TSPCSAQCGG TPPTVLTNST TPRLAGSATD ADGGTLRLDF EVWNSAGTTK ITQGSSGFVA
QNSEAAWTVP SGLLANGTSY QWRVRAYDGT DYSQNWSTWI PFTIDTTAPA TPTALASTAW
PSGGWASATS GTFTWTSPGG DTAGFLYGLD EPSPTTSPTP PTGTTSASLT ADEGLHTFYL
RTIDTAGNLS PVISYNFGVG NAALASPAEQ SRTQRFATLR GEAPSSQVSV TYRYRVGTNP
STAWTDVPTA ETTTQGTTNH PTWPAPRNGS GAFDNQIWNI PGTLGGGSDG PIQIQACFRT
AAAVVTCTDA TTIQLTRNAF SDTDAVGELG PGSLALLTGD YSLSATDVSM PTYTGTLAVG
RTLTTLTPPA ATTAPNGIFG PGWTSSVPGP EAGSGDRTLT DSTATEGYVT LTSPEGAPAV
YTRAGTGSYP YEYTGVADTA ADGSKLIKDS ATKFTLTEID GAKTIWYSKT IGSATIWVVD
RVEEPGSNTT STFTTDSLGR ITRILAPVPS GVDCSGTLGA GCRALTLTYA TSTTATGSGG
NPAEWGNYTG QLSSISMALN GATSIEVARY SYDTTGHLRA EWDPRLDTAG GNHLATLYWY
NAEGRVQGFI PTGQEAWGFG YDGFGRLTNI SRPRPAGAGT ATNTIVYGVL LSGASAPVDV
SPDRVDDWAQ QDLPVWATAV FPASHVPASP PTAADWPYAA INYLNADGRQ VNTASYGAGA
WQVTTTEYDT FGNVTRELTA ENRNQALTPT TDTDATVAAL TDSAARAQLL DTKITYSVDG
VVPLDVYGPK HRYINNDGVR SSVRQHVHTD YDQGAPPATG PYWLPTTVTT TAYTGSSDID
PRTTINGYAA KTGADASTSG WTLLQPTTIT TWMGGGATPN IVRTTYYNAA GQPLEVRQPK
ANSDGTDAFT TIFSYFTPTG SGGCVNASWA GLTCSTGPAS QPTSGNTLPV TTLTYNNLNQ
PLTKTETVTS SGTTTRTTTY TYDTAGRTLS EGLAVNPAAN GGTAVPTATY GYSTISGLPT
TTTANSVTLT TEYDAWGQIT SQTDSDGNTS NTTYDIDGRV SSVNDGKGAY TYIYDTATEH
RGLVTSLGID AGSAPSTFTA TYDGDGKLTS QNYPNGLVAT SHYNNTGRPT TLTYTKGTST
WLTFTQIDNI NGQGRVVESP GGIREHLYDL VGRLITVRDI RNDSGSVMCT SRRYVYDPDY
NRTQQISYPD AGTNPTGPAC STSTTPTYSL SSSYDQADRI TDTGYTYDLF GRTTAVPAAE
NDLTVGYHAN DMVASETQGA ATRTYTLDPA RRIRTWAQGA TTWTNHYTSG SDDSAAWIGD
SGGGWTRNIL GISGTLVAVQ DQTGTVTLQL SNLHGDIVAT VDDNSSATAP TTFQETWEFG
QPYNIATAYP RYGWLGGPQR SRDTISGITL MGARLYNPGT GRFLQVDPVR NGAANPYEYA
YQDPWNMTDL DGRIPTPSLV YSCPRGFNCQ VLLRSTKYSE WRAWQLTGVA DGHLSSFYYE
TRRRSAKFYV SRYFYYNRQT QFIMYIYTQV QDRIRTFTWP AIRTEKIQSD RFEYITYVMC
RSAYIC