Gene Franean1_1834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1834 
Symbol 
ID5670236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2200565 
End bp2202724 
Gene Length2160 bp 
Protein Length719 aa 
Translation table11 
GC content77% 
IMG OID641240755 
Producthypothetical protein 
Protein accessionYP_001506178 
Protein GI158313670 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.610374 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.1985 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGCAG GGCGCGGACC GCGCTGGCGC GGCGGGCGGG CCGGCCGCGC CGCCGAGGTG 
CTCGACGGCC GGCGTGAGCC CCTCGACGAG ACCGAGCTGC AACTGGCGCG CGCGGTCGCG
ACCCTGCGGG TGCTGCCCGG GGCACTGCCC GCGCCGGCGA TGTCCCCCCG GGTCCGCGCC
GAGATCCGGG CGATGCTCCT CGCCGAGCCG ATCGGCGCCC GCGGCACCCA CGAGGTTGGT
GAGGGCAACG GCAGGGAAAC CGTCGGCGCG GCTGAGAGCA CCCGGCGGGC ACAGCCGACC
CGACCGGCCC GGCCCGCCGC GCTCGTCACC CGGCCAGGCG GGATGGGCGT CCGGCAGGCG
TTGCGTGCCG CCCGGCCGGC CCTGATCGGG GCGCTGGCCG TCTCGGTCGC GACGGTGGGC
GTCGCCGTGA GCGCCGAGCA GGCCCTGCCC GGTGAGCTGC TCTACGGAGT GAAGCGGCAG
GTCGAGCAGA TTCAGGTCAG CCTGGCCGGC AACCGCGTCG ATCGCGCCAA GATCCAGCTC
TCCGTCGCCC GGAACCGGAT GGACGAGCTC GCCGCGGTCG TCCATCCGGC AGCCCGGCCA
CCGGCCAGCA CGCGCACCGA CTCCCCCACC ACCGGCGGAA CCACGGCTGG CCTCGGCCCA
ACTGACAGCG ACCCAACTGA CAGCGGCCCG ACTGGCGGCG GTGCCGCCGG ATCGCCCTCG
GGCCAGCCAC TTCCAGCAAC GACCCCGGGC CCCGGATCTG CCGGGGCGCC GCCGACAGAG
GCGGCGCCAG AGCAGACGGG CGACCCGGTC GCAGGCGGCC TGGTGCGAAC GGGCGGATCG
GTGCCGGCGG GCGCCGCGAC CCCCGATCCG GCCGCTTCCT CCCCGAGCCA CGCGGCGCCC
GGGGGGCGTC CGGCCACGGA CGGGGACATC GGCACGGTGA CCAGGCTGCT GCGCGACTGG
TGCGACGAGG CCGGGGCGGG GAGCGCGGTC CTGATCGAGG AGGCGCTCGC GGGCAGCCGC
GACGCCCGGG CGACCCTGAA CGAGTTCGCC GCGGACCAGT CGACGCGCCT CGAGGCGCTG
TTCGACGCGC TACCGACCGG CTCGGTGCCC GGCGCGCACA AGGCTCGGCG GATCATCCAC
GACGTCGACT CGGCGCTGGC CGTCACCGCC CCGGAGGCCG ACGGGCCGTC CGGGGCCGGC
ACCGCGGCGG GGCGCGGCGG CACGACGGCC GGGACGGCGA GCGGCGGCAC CTCACCTGAC
CCGACCGGCC GGGCACGTTC GGCCGACGGG ATGACCAGCC CGACGGTCAC TGAGCGCGGC
GACTTCCAGC GACAGTCCAC CGGGCCGCGG GCTACGGCGC CCGTGCCGAC CGTCTCCGCG
GCGATCGGCG GCATGGCCGG CGGGTTCCTC CCCGGCCTGC CCTTCCTCAC ACCGTCACAG
TCACTGACAC CTGACCAGCC ATCGGCGTCC GGCAAGCCGT CAGCGCCTGG CCAGCCAGCG
GCATCTGGGC AGGCGTCGAC GCCCACGCCC GCATCGCCGC CGCCCGGCAC GGTTCCCGGA
GATCCGGGCG CCCCGCCGGC GGACCGGTCG GCACCCGATC TCCCGCTTGT TCTGGAACCA
GAGCCGTCGG CACCGCCGCC GACGGGCGGG CATCCCGGCG GCGGCACGTC GGCACAGTCC
TCCCCCACGA CGCCGCCGGA CGTCCTCGAG ACGTTCGACG CGCCCGCGAA GGCGGTCGGC
CCCGCCGATG ACCCCGCCGA TGACCCCGCG GTCGACGATC CCGCGGTCGA CGATCCCGCC
GGTGGCGGGG CCGAGCAGAC TCCGGCGGAC GGTTCGTGGT CGGACGGTCC GCGGCCCGAG
GTGTCCCCCG TCGCTCAGCC GTCACCGGCG GCGTCGGACG ACCCGCTGGC GGAGAAGGCC
GTCTTCGGGG AGTGGGCGGC CGAAACCGCC CCGGACAGCG CGGAACCGCC CAGCCCCGAG
ACAGCCCCGA GCGGAAAGAC GGCCCCGACC GCCGGGACGA CGCCGACCAG CGCGACGACA
CCGACCGGCG AGGCAGAGCC GACCGGCGAA GCGGAGCCGA CCGGCGGAAC CGCGGTCACC
CCGACGACCG AGGCCACCGA GGACGTCACC GGAGGGAACG CGCCGGCCGG GCAGCCCTGA
 
Protein sequence
MAAGRGPRWR GGRAGRAAEV LDGRREPLDE TELQLARAVA TLRVLPGALP APAMSPRVRA 
EIRAMLLAEP IGARGTHEVG EGNGRETVGA AESTRRAQPT RPARPAALVT RPGGMGVRQA
LRAARPALIG ALAVSVATVG VAVSAEQALP GELLYGVKRQ VEQIQVSLAG NRVDRAKIQL
SVARNRMDEL AAVVHPAARP PASTRTDSPT TGGTTAGLGP TDSDPTDSGP TGGGAAGSPS
GQPLPATTPG PGSAGAPPTE AAPEQTGDPV AGGLVRTGGS VPAGAATPDP AASSPSHAAP
GGRPATDGDI GTVTRLLRDW CDEAGAGSAV LIEEALAGSR DARATLNEFA ADQSTRLEAL
FDALPTGSVP GAHKARRIIH DVDSALAVTA PEADGPSGAG TAAGRGGTTA GTASGGTSPD
PTGRARSADG MTSPTVTERG DFQRQSTGPR ATAPVPTVSA AIGGMAGGFL PGLPFLTPSQ
SLTPDQPSAS GKPSAPGQPA ASGQASTPTP ASPPPGTVPG DPGAPPADRS APDLPLVLEP
EPSAPPPTGG HPGGGTSAQS SPTTPPDVLE TFDAPAKAVG PADDPADDPA VDDPAVDDPA
GGGAEQTPAD GSWSDGPRPE VSPVAQPSPA ASDDPLAEKA VFGEWAAETA PDSAEPPSPE
TAPSGKTAPT AGTTPTSATT PTGEAEPTGE AEPTGGTAVT PTTEATEDVT GGNAPAGQP