Gene Franean1_6784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6784 
Symbol 
ID5675097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8261621 
End bp8266201 
Gene Length4581 bp 
Protein Length1526 aa 
Translation table11 
GC content67% 
IMG OID641245633 
ProductTPR repeat-containing protein 
Protein accessionYP_001511024 
Protein GI158318516 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACA GCTTCGACGT GTTCGTTTCC TACGGGCATG ACGATCAGGT GTGGGTGCGG 
GCGCTGGCGG AGAACCTGGA ACGCGCCGGC CTGCACGTTT TCTACGACGA GTGGGAGATC
CTCGCCGGGG ATGTGCTGGC CCACCGGCTC GACGCCGGCG TTCTGGGCTC AACGTCAGGG
ATCCTGGTGG TCAGTCCGCA TGCGCTGTCG CGGCCGTGGG TGCGCGAGGA GTATGCGGCG
ATGCTGACCC GGGCGGTCGC TGGCCAGCAG CGGCTGATCC CGGTGATCCT TGAGGATGCG
GAGTTGCCGG CGTTTCTGGC GTCGCGGGTG TGGGTCGATT TCCGTGACAC CCACGACCCG
GCGACCTACA CAGTCCGGGT CGGGGAGCTG GTGGCGTCGT TGCGTGGCCA GCGGCGGTCC
CGGCCGGCTG CGGACGGCAC CGTACGCCCT CCACCAGGGC AGGCGTATGT GGCCGAAGGG
CCCCGCGTTG CAACGTTGCG GATCACCCCG GACCGGGTGG AACTGCATAC CACCGGCGGG
ACCGTGGAAG GCCGCCCAGT CGGGATGGAC TGGTCAGCAG GTCAGGTACT GGGCGAGCTG
GAACGAGTCC GGACCCGCAG AACGGCCAGC GGGCTGGCCC TGAAAGCGCC AGCCGCTGGC
GGTGTGGCCG CCGATCCCCT CGACGGGGCG CTGCGGGCTG TCGGGCGGAT GCTCGGCGAA
CGGTTCCTGC CCGCCGACGT CGCGGCCGGC CTCGCGGGTG AGGTCGACGC GGCGGTACGC
GGCGGGGTAT CCCTGCAGCT GGCGGTGCAG ACAGCTGATG GGGTGGCGGA TCTGCCGTGG
GAGACCCTCA CCCTACCCGC GCACACCCGC CCGCTGGTGC TGCACCCGGG CGTGCAGCTG
TACCGGGCGG TCGTCGGGTT GGGAGCGACA CCGGCGATGG GGATCCGGGG GCCGCTACGG
ATCCTCGCGG TGATCGCCAG CCCCGACCAC GGTGGCGGGG CGCTGCTGGA CTACGAAGCT
GAGCTCGCGC GGATCCTTGA CGCCGTCGAC CCGGCCCGCC GCAAAGGCGA GGCCTATGTG
CGGATCTTGA ACGGGGGTAG CCGGGAGGCG ATCCGGGCAG CCCTGTTGCA GGAGCGGTTC
CACGTCCTGC ACCTGTCCTG CCACGCCGAG CCCGGCCGCC TCGTGTTGGA AGCCGAGGAC
GGCGCCGCCG ACCTTGTCGA CACCCAGAGG TTCGTCGAGG AGGTCCTCGT CGTCGGGCGG
GGTGTGCCCC TGGTGGTGCT CGCTGGCTGC TCCACCGCAC TGCATAGTGG CAGCGCCGTG
GACCCGAACG AGGAACAAGC GGGGGGTGGC GCGGGGTTGC TGCCGGGGTT GGCCCGGGGG
CTGCTCGCGC ACGGGGTGCC CGCGGTGTTG GCGATGACCG CCCCGGTCAG CGATCTGTAT
GCCACGGATC TGGCCTCGCG GCTGTATGGC GAACTGGCCG GACGCGCGGT GGTCGCTCCC
CTCGCCGCGC TGACGGACGC CCGCCAGGCG GTAGAAACCA GCCGGGTGGG CCTACCCGCC
TCCGATCGGC GGGCGGGGCT TGCCGAATGG GCCACCCCCG CGCTGTTCCT CCGCGGCCCG
GTACTGCCGT TGTATGACCC GGCTGCGGGT TTCGACACGA TCACCGAACC GGTTACGCCG
GTGCTGGCGG AAGGGATCGT GGTCCGCCGG GTCGGGGAGT TCGTCGGCCG CCGCACCGAA
CTCCGCACCC TGCACCGCGC CCTCCACGGC GAAGGCGCCG GGGTTGTCCT GCACGGCATC
GGCGGCGTCG GGAAATCAAC CCTCGCCGCA CAACTCCTCA CCGACCTCGG CGACGACGCA
GGCCTGGTCG TATCGCTGAC CGGGCCGGTC GCGGTCGATC AGATCCTCGC CGCGATCGCC
GCCCGGCTGG CCTCCTGGTG CTTAAAGCAC AACATCAACA TCGACGATGT ACGCCGGCGC
CTGATCGATG CCCTCCGCTC CGGGGCCCCG TGGCAGGAAC GCGTCGCGCT GCTCGCAGAG
CATCTGCTGC CGACAGTGCC GGTCACGCTG TTCCTCGACA ACGCCGAGGA CACCCTCACC
CCCCACGGTG TCGGTCACCG GTTCACCGAC CCGCAGCTGA ACGCGTTCCT GGCCGCGTGG
GTCGGGCTGC CGGGGCGGGC CCGGCTGCTG GTCACCAGCC GCTACCCGCT GCCCGTCGAC
ACCGCTACCG CCCACCGGCT GACCGTGCAC CACCTCGGCC CGCTCTCGGT CGCGGAAGCC
CGCAAACTGT TGTGGCGGCT CCCCGCCCTC GACGCACTCA CCGCCAACGA ACAGGCCCGC
GCGATCGCGG ACGTAGGCGG GCACCCCCGC ACCCTGGAAT ACCTCGACGC GCTCCTCGCC
GGCGGCCACG CCCACTTCCC CGAGATCGCC ACCCGCCTCG AGAACACCCT GAAAACCCGC
GGTGTCACCG ACCCCGCCGC CTGGTACACC ACCCTCGCCG GGAACATCGA TCGGGCGCTG
GCCGAAACCA TCACCCTCGC CGTCGACGAC ATCGTCCTCG ACACCCTCCT CACCCGCCTC
GACACCATCC CCCTCGCCCG GCGGCTGCTG ACCGGCGCGG CGGTCTACCG CCTACCCGTC
GACCGCAACG GCCTCGCCTG GCAGATCTCC ACAACCGTCG AACCGACACC CGACTCCGCC
CAGGACCCGG ACATCGCCAG GATTCTCAAG TTTCTCGCGG AGGCTCGTAC CAGCGGGGTG
ACCGCCCTGG AAGACGCCGG CCTCACCGAT ACCCAGATCC AGAGGTATCA GCGGTGGGTG
GAACAGCAGC GCCGGCCACC ACTGGCCATC CCCCAAGAAT TCGACACCGC CCTGACCAGC
CTGCTCGAGC TCGGACTCCT CTCCCCCACC CGCGATACCG GCGGGCAGCT GGCGTTCACC
GTGCACCGCT GGACCGCCAC CGCCCTCCTG TCCCGCGCCA CCCCCCACGA CCACACCGAC
GCCCACCACC GGGCCGCCGC CTACTGGGAA TGGCGGGTCG CAGTCTGGCC TCAGGACCAG
TACACCGACG TCACCCAGCT GTTGGAGGCC GGCCATCACC ATCTCACCAC AGGCGCTCAC
GACCGCATCG ACCAAACCAC CGGCGCCGCC TGTCAGCAGC TACACACCTG GGGCCTGTGG
GCTTGGGAAG AACAGATCTG CCGCCACTCC CTGAACCGCC TTCCCACCAC CGACACCAGC
CGGCTTACGG CCGCGTTCAC CCAGCAACTC GGCATGATAG CTCAGCTATG GGGGGACTAC
ACTACCGCCG AAAATCGCTA CCACAAATCC CTCGCCATCT TCCAAGAGCT CGGAGACCAA
GCTGGCATCG CCAACTGTTA CCACCAGCTC GGCAGGATCA CCCAAGAGCG GGCAGACTAC
ACCACCGCCG AAAACCGCTA CCAAAAATCC CTCACGATCC GCGAGGAGCT CGGAGACCGA
GCCGGCATCG CCACTAGTCA TCACCAGCTT GGCATTATTG CCCAAGACCG AGGGGACCAC
ACCACCGCCG AAAACCGCTA CCAGAAAGCC CTTACCATCT TTCAAGAGCT CGGGGACCGA
GCTGGCATCG CCGGCAGCTA TGGTCAACTC GGCATGATCG CCGAGCTTCG AGGGGACTAC
ACCACCGCCG AAAACCGCTA CCAGAAAGCC CTCACGATCC GCGAGGAGCT CGGAGACCGA
GCCGGCATCG CCACTAGTCA TCACCAGCTT GGCATTATTG CCCAAGACCG AGGGGACCAC
ACCACCGCCG AAAACCGCTA CCAGAAAGCC CTCACGATCC GCGAGGAGCT CGGAGACCGA
GCCGGCATCG CCACTAGTCA TCACCAGCTT GGCATTATTG CCCAAGACCG AGGGGACTAC
ACCACCGCCG AAAACCGCTA CCAGAAAGCC CTTACCATCT TTCAAGAGCT CGGGGACCGA
GCTGGCATCG CCGGCAGCTA TGGTCAACTC GGCATGATCG CCGAGCTTCG AGGGGACTAC
ACCACCGCCG AAAACCTCTA CCAACGGACC CTCACTATAT TTAGAGAGCT CGGGAACCGG
GTCGGAATTG CCACCAGCTA TCAGAGGCTC GGCATCGTGG CCGAACTGGT AGAGGACTAC
GCGACCGCGG AAGAAAACCA CCAGAAATCC CTAGCCATCT TCAAAGAGCT CGGACACCGG
ATGGGCATCT CCACTAGCCA TCTCCAGATC GGCGTCATCG CCCAGAATCA GGGAGATTAC
ACCACCGCCG AAAAACGCTA CCAGAAAGCA CTCACCATCT TCCAAGAGCT TGGGAAGCGA
GCCGACATCG CAGCCACCGC CTCCCAAATC GGCGTTCTGT ACACCACTCG GGACCGCATC
GCCGATGCCA TCCCGTACAA CCTAACGGCA CTATCCATTC GCCTCAGCAT CAAATCACCC
GAATCGGCCA TCGACATCCA TTGGCTGAAA AAGCAACGCG AAGCACTCAC CGACGCCGGT
TTTCGGGCAG CCCTTGCCAA CCATGTATCC CACAATGATA TCAACACCAC CATCGATTTC
CTCGACTCGA TCCCCGAATA G
 
Protein sequence
MTDSFDVFVS YGHDDQVWVR ALAENLERAG LHVFYDEWEI LAGDVLAHRL DAGVLGSTSG 
ILVVSPHALS RPWVREEYAA MLTRAVAGQQ RLIPVILEDA ELPAFLASRV WVDFRDTHDP
ATYTVRVGEL VASLRGQRRS RPAADGTVRP PPGQAYVAEG PRVATLRITP DRVELHTTGG
TVEGRPVGMD WSAGQVLGEL ERVRTRRTAS GLALKAPAAG GVAADPLDGA LRAVGRMLGE
RFLPADVAAG LAGEVDAAVR GGVSLQLAVQ TADGVADLPW ETLTLPAHTR PLVLHPGVQL
YRAVVGLGAT PAMGIRGPLR ILAVIASPDH GGGALLDYEA ELARILDAVD PARRKGEAYV
RILNGGSREA IRAALLQERF HVLHLSCHAE PGRLVLEAED GAADLVDTQR FVEEVLVVGR
GVPLVVLAGC STALHSGSAV DPNEEQAGGG AGLLPGLARG LLAHGVPAVL AMTAPVSDLY
ATDLASRLYG ELAGRAVVAP LAALTDARQA VETSRVGLPA SDRRAGLAEW ATPALFLRGP
VLPLYDPAAG FDTITEPVTP VLAEGIVVRR VGEFVGRRTE LRTLHRALHG EGAGVVLHGI
GGVGKSTLAA QLLTDLGDDA GLVVSLTGPV AVDQILAAIA ARLASWCLKH NINIDDVRRR
LIDALRSGAP WQERVALLAE HLLPTVPVTL FLDNAEDTLT PHGVGHRFTD PQLNAFLAAW
VGLPGRARLL VTSRYPLPVD TATAHRLTVH HLGPLSVAEA RKLLWRLPAL DALTANEQAR
AIADVGGHPR TLEYLDALLA GGHAHFPEIA TRLENTLKTR GVTDPAAWYT TLAGNIDRAL
AETITLAVDD IVLDTLLTRL DTIPLARRLL TGAAVYRLPV DRNGLAWQIS TTVEPTPDSA
QDPDIARILK FLAEARTSGV TALEDAGLTD TQIQRYQRWV EQQRRPPLAI PQEFDTALTS
LLELGLLSPT RDTGGQLAFT VHRWTATALL SRATPHDHTD AHHRAAAYWE WRVAVWPQDQ
YTDVTQLLEA GHHHLTTGAH DRIDQTTGAA CQQLHTWGLW AWEEQICRHS LNRLPTTDTS
RLTAAFTQQL GMIAQLWGDY TTAENRYHKS LAIFQELGDQ AGIANCYHQL GRITQERADY
TTAENRYQKS LTIREELGDR AGIATSHHQL GIIAQDRGDH TTAENRYQKA LTIFQELGDR
AGIAGSYGQL GMIAELRGDY TTAENRYQKA LTIREELGDR AGIATSHHQL GIIAQDRGDH
TTAENRYQKA LTIREELGDR AGIATSHHQL GIIAQDRGDY TTAENRYQKA LTIFQELGDR
AGIAGSYGQL GMIAELRGDY TTAENLYQRT LTIFRELGNR VGIATSYQRL GIVAELVEDY
ATAEENHQKS LAIFKELGHR MGISTSHLQI GVIAQNQGDY TTAEKRYQKA LTIFQELGKR
ADIAATASQI GVLYTTRDRI ADAIPYNLTA LSIRLSIKSP ESAIDIHWLK KQREALTDAG
FRAALANHVS HNDINTTIDF LDSIPE