Gene Franean1_0045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0045 
Symbol 
ID5668471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp56774 
End bp60883 
Gene Length4110 bp 
Protein Length1369 aa 
Translation table11 
GC content71% 
IMG OID641238974 
Producthypothetical protein 
Protein accessionYP_001504419 
Protein GI158311911 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGCG GCAAGCTCAA CCCCGACCCG GTGCGGCCGC TCGACGGCTA CCGCGTCACC 
GCCAACGTCG ACGTCCCGAT GCTCGGCGAC GACGGTCAGA CCACCACCAC CGTCGCCGTC
GGCGCCGACG TCACGTCGGG CGGGGATGTC GCCATCGAGC TGCCGGAGCG GCGCACGGGC
GACGTCCGTC TCGTGGCTAC GGCACCCGAC GGTAGTCGCG TCGGGGCGAC GACCATTGCC
GCCGCCAACG TCGACCAGCC GTTCACGCTG GACGTCACCA CCGTCGATCC GGTGACCGTG
GTGCCGTCGG ATGTGCCCGG CCTCGGCCGA CGAGAACGCC TCGTCGGCCG GGTGCTCGAC
CCGCAGGGCG CCGGCGCTCC GGCCAAGCTG CTCGTCGTCG TCTGGGGTGT GCCCCACGGC
GCCCAGCCGA CCGACGCGGT GCCGTTGCTG GTGGCCGAGA CCGCCGCCGG CGGGTACTTC
GGCGACGACC TGCCCCCTGA CGAGCTCGAC GACGCGTGGG CCAAGGTGGC TGGCGGCGCG
CCCGTACCGA TCCTGCTGGA CGGCGGCCGC CTGCCGCGCA CCGTTGTCAT CGTCACCCCG
GTGAAGGCGT CGCACGATAA CGACGACGAC GGCTGCGGAT GCCACGCCGC GCCGCCGCGC
AGCCCGGACC AGGCCGACCT GGCGGCCAAC CCGGAGGCGT TCGCTGCCGA CCCCGGCCGT
TGCGTCGACC TCACGGTGCC CAACCGGGCG CTGGAGGAGG TGACGTTCCA CGCCGTGGTC
CGCACCACCC AGCCGGAGAT CCGCGGCCTG CAGATGCCCA ACGCCGACGA CGTGCCCTCG
CCGGTCGTCG ACCGGCTGGT CCAGCTCGTG GCCAGCCGGC CGATCGAGGC GATCGCGCGG
CTCGAGTCGC CGAGCACGTA CACGACGAGC GTCGACGCGG CGATCAAGCC GGCGTCGAGC
CCGCAGCTCG CCGCGCTCGC CGTGCGCGCC GCGGTGCCGG AGGTGGACGA ACCGACCGTG
CGCCGATCGT TGAACCTGCG CGCCGACGTC GTCGGCGAGC TGATGCGTGA CGGCGCCCAC
CTGCGCGTCG ACGACCTGCT CCACGCCGAG CGGGTGAGCA CGATCCGCAC CGTGCGCGAC
ATCGTCCAGG CCGTCGGCGG CGCACCGGCC GGGCGGGTGG AGCTCGGCGG CGAGACGCAG
GTGGACTGGG ACGACACCCC GACGCTGTAC CAGGCCACCA CCATCGCCCA TGGTCACCTG
TTGACCTTCA AGCAGGTGTG GCGGGCCGAC GGCTACTCGC TCGGTGACCT GCTCTACTCG
CTGCCGCTCG CGCCCGGCCA GAAGAAGCTC ATCGCCGTGG TCGACTGGGA CCGCCGTGAG
GTGGAGGTTC GCCAGGCGCA GCGGTCGGAG ACCGAGGTGC TCGATGCCTC GCTGGCTCAT
GACCGAGACA TCTCCGAGGT CATCAGCAGC TCGCTGCGCC AGTCGATGCG CGGCTCGTCC
AGCGCCGACA CCGAATCGAT CGCCGGCGGG ATCGCCGGGT TCATCGGCCC CGTCGTGTTC
GGCTTCGGTG GCGGCTCGTC CAGTGCCGGC TCGACCGCCA GCCAGCGCTC GTCGCGCGAC
GTCGCCGGGA CGATGCTCAA CCAGGCACGC GACCGGACGT TGCAGGCGGC CAGCGCAGTG
CGCAGCCAGC GGGCCACGGT CGTGCAGACG GCGCGCCAAG GAGAGTCGCT GCGCGTCCAG
ACCGACGTGG TGGCCAACCA CAACCACTGC CACGCGATGA CCGTCGAGTA CTTCGAAGTG
CTCCGTCACC TGCAGGTCAC CCAGGAGATC GCCCACGTCC AGGAGTGCCT GTTCGTCCCG
TTCGAGATCA CCCCGTTCAA CGGCGCCAAG GCGCTGCGCC ACCGCGACGT GCTGGTCCGC
CACGTGCCCG GCGCGCTGCG TCGCGGCTTC GACGCGCTGG AGCGGATGCG CACGAACTGG
TCCGATGCCG ACTTCCCGCT GGCGCGCTAC GCCGACGAGC CGCTGACCTA CCTGGACGGC
GAGCTGCAGA TGAGCTTCAG CATCCCGCGG CCAGCCGACG GTGGCACTGA CAAGGATCAG
TTCGACTCGT CGAAGTGGTC GGTCTACGAG GCCAGCGGGC TGCTGGCCCA ACCGGCGCAG
ACGCTGTGGA ACGCCTACCT CGGCACTGCC GTGGTGGCCA ACCGTGACGC GATCTTCGCT
GCGTCGATCG CCCCGAACAT CGCCCAGCGG ATCGTCGACC GGCTGGCGCT GCGGCTCGTG
CTCGACAACG GCTCGCCGGT CGGCGTCGGC ATCGACGCCA CGCTCGTGTC CCGCTATCGC
CCGGACGCGC CGCTGCTCGT CAGCCTGCGC CCGACCTCGA CCACCCCGGC GGTGGTGCGC
GCTCAGATCG AGCGCGCCGT CATCTCGCTG GGCGCGTTGT TGCCGCCGGA GGCCAAGCTC
GTCGTCCAGC AGGCCGGCCT GCGCTACCGC ACCGCCCACC TCGAGCACGA CCTCGTCCCC
ACGCAGCGCA CGTACAACGA TCTCGGCCCG GGCGACGAGG TGGAGCTCGT CGTCGGCCTG
GACCGTCAGG AGAAGCGCAA CCCGCGGGCC GAGGACCCGC GCCTGGCCGA CGCCCTCGTG
GATCACCTCA ACGAGCACAT CGAGCACTAC CACCAGGCGA TCTGGCGGGA GATGGACCCG
AACCGGCGGT ACCTGCTACT CGACGGCTTC GTCGCCCCGA ACGCCGGCGG CCGCAGCGTG
GCCAGTGTCG TGGAGAACCG GCTGATCGGC ATCGTCGGCA ACTGCCTGGT GATGCCGGTC
GTGCCGGGCG TCAAGCTCGA CCGGTCCTAC GAGCACTCAA AGCGCACCCA GACCGACCTG
CTCGACCTGT ATGCCACCGA CCCCGCGCCG CCGATGCGCA TCTCTCTGCC GACCAAGGGT
GTGTTCGCTG AGGCGGTGCT CGGAGCCTGC AACAGCTGCG AGCTGAAGGA CGACACCCGC
TTCTGGCGGT GGGAGGAGTC GCCGATCCCG GACGAGCCGA CCCCGATCGA GCAGGTGTCC
ACGGCAACAC GGCGCAGCAC GGTGCCGAAC CTCACTCCCG ACGCGTTCCC CCAGGCGCTG
GTGCGCTTTC AGGACGTGCC CACGGCTCCG GCACCGTCCG CGCTCGCCGC CGCGCTGGAG
TTGATCGGTA CGCCGAACCT GTTCTCCGAC ATCACTGGCC TGGCGCTGAA CCAGGCGAAC
TCGGCCGCGG CGTTGGAGGA GGCGTCGAAG ACGGCGACGT TCTTTGCCCA GCAGGGCGCG
GCGCTCGCCC AGCAGCGCTT CCTCGCCTCG GACACCCAGC GCCAGCTGAA GCGGATCAAG
GATGCTCGCG ACCAGAAGCT GATCACGCCG GAGCAGGCCA ATGAGCGGGC GATGAGCCTG
CTGCAGGGCG CGACGGGTGA GCGCCGGCCG GAGGGCCAGG TGACCACCGC CAACCCCGCC
GTGCAGAAGG TGCTCGACCG GGTCAGCCAA TCGTCCAACG CCTCCATGAC GCTGCGCCGC
CCCGACGGCA CCGTCTCGGT GAAGAAGGGC GACGACGCCG GCGGTCGCAT CGACGTCACC
GCCGATCCGC CCGTCGGGCT GATCAAGCAA CCGACCAACC TGGTGTGCTG GGCGGTTGCC
GGGGCGATGA TGGCGAACTG GCGCGACCGC CGCTCGGCGA CGATTCAGAC GGTGCTCGAC
GAGCTCGGCG GCACGTGGCG GGCGAAGTTC GACGCCAACC AGGCCCTAGC GGTGGCCGAA
ACCGTCGGGT TCACCAAAGC GCTCGGTCTC GTCGCCGAAG GACCGGCGTC GTACCTGCCG
ATCGGCATCG CTCGGCTGGT CGAGACCTAC GGCCCGCTGT GGGTGATCTC CGACGACACC
GTGGAGAACA ACCAGCTCGT CCACGTGCGC ATCGTCACGG GGATAAGCGG CGGCGGCACG
GCCGAGACGA CGATGGTCAA CATCGTCGAC CCGGACCTCG ACCAGCCGAC GACGGAGACG
TTCGCCGAGT TCGCCCGCCG GTTGGAAGCC AACGACGCCA TCGACGTGGG CCTCGGGATC
ATGCGCTTCC CGACCGTGCA GACCACTTGA
 
Protein sequence
MMRGKLNPDP VRPLDGYRVT ANVDVPMLGD DGQTTTTVAV GADVTSGGDV AIELPERRTG 
DVRLVATAPD GSRVGATTIA AANVDQPFTL DVTTVDPVTV VPSDVPGLGR RERLVGRVLD
PQGAGAPAKL LVVVWGVPHG AQPTDAVPLL VAETAAGGYF GDDLPPDELD DAWAKVAGGA
PVPILLDGGR LPRTVVIVTP VKASHDNDDD GCGCHAAPPR SPDQADLAAN PEAFAADPGR
CVDLTVPNRA LEEVTFHAVV RTTQPEIRGL QMPNADDVPS PVVDRLVQLV ASRPIEAIAR
LESPSTYTTS VDAAIKPASS PQLAALAVRA AVPEVDEPTV RRSLNLRADV VGELMRDGAH
LRVDDLLHAE RVSTIRTVRD IVQAVGGAPA GRVELGGETQ VDWDDTPTLY QATTIAHGHL
LTFKQVWRAD GYSLGDLLYS LPLAPGQKKL IAVVDWDRRE VEVRQAQRSE TEVLDASLAH
DRDISEVISS SLRQSMRGSS SADTESIAGG IAGFIGPVVF GFGGGSSSAG STASQRSSRD
VAGTMLNQAR DRTLQAASAV RSQRATVVQT ARQGESLRVQ TDVVANHNHC HAMTVEYFEV
LRHLQVTQEI AHVQECLFVP FEITPFNGAK ALRHRDVLVR HVPGALRRGF DALERMRTNW
SDADFPLARY ADEPLTYLDG ELQMSFSIPR PADGGTDKDQ FDSSKWSVYE ASGLLAQPAQ
TLWNAYLGTA VVANRDAIFA ASIAPNIAQR IVDRLALRLV LDNGSPVGVG IDATLVSRYR
PDAPLLVSLR PTSTTPAVVR AQIERAVISL GALLPPEAKL VVQQAGLRYR TAHLEHDLVP
TQRTYNDLGP GDEVELVVGL DRQEKRNPRA EDPRLADALV DHLNEHIEHY HQAIWREMDP
NRRYLLLDGF VAPNAGGRSV ASVVENRLIG IVGNCLVMPV VPGVKLDRSY EHSKRTQTDL
LDLYATDPAP PMRISLPTKG VFAEAVLGAC NSCELKDDTR FWRWEESPIP DEPTPIEQVS
TATRRSTVPN LTPDAFPQAL VRFQDVPTAP APSALAAALE LIGTPNLFSD ITGLALNQAN
SAAALEEASK TATFFAQQGA ALAQQRFLAS DTQRQLKRIK DARDQKLITP EQANERAMSL
LQGATGERRP EGQVTTANPA VQKVLDRVSQ SSNASMTLRR PDGTVSVKKG DDAGGRIDVT
ADPPVGLIKQ PTNLVCWAVA GAMMANWRDR RSATIQTVLD ELGGTWRAKF DANQALAVAE
TVGFTKALGL VAEGPASYLP IGIARLVETY GPLWVISDDT VENNQLVHVR IVTGISGGGT
AETTMVNIVD PDLDQPTTET FAEFARRLEA NDAIDVGLGI MRFPTVQTT