Gene Franean1_5623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5623 
Symbol 
ID5673950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6836922 
End bp6839258 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content78% 
IMG OID641244476 
Producthypothetical protein 
Protein accessionYP_001509880 
Protein GI158317372 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00516948 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCCTT CGATGGAGGA TCCGCGCGTA GTCGACGTCG AGCGGTACCA CGGTCAGTTG 
CGCTCGGAAC TCGAGATGAT CGCGAACGGA CCAGTACTGG CCGGCCCACG GATCGTCGAG
CACATCAATT CCCTGATCAC ACTGATTGAC CTGCACCAAC CGAACGAGTT CGACCTCTGC
CTCAGCTGTG ACCGGCTCTG GCCCTGCGCG ACCGTCGTCG CCATCACCGG TGAGCTAGCG
GCCACCGAGG ACGAGGTGAC GCCCCGGCCG CTGCCCGAAG CCCCGGCCAC GCGGTCCGGC
GACGGCCGCC AGGCGGACAG GCGCCCGGCC GACCCCCGCC ACGCGGCGGC GGGAGGCCCC
GCCGACCAGC GCCGGCCGGA CGTTCGCGCG GCCGCGCCAC CACCATTCGA CACGGGCTCC
TACTCGACGA CGACGCAGAC CAGCACCGGC CAGTCGACGA TGCCGCACGA ACTGCCACGG
GGCACGGACC CGGTCCGGCA GCGCGGCGCC CCCGGCCCGG CGGACGAGCC GATGGCCGCG
CCCCGGCCGA CCGGGCCGCA CCGCGTGCCG CCGCCGCCCT CCATGCCACC ACCCTCCATG
CCACCGCCGT CGCATCCGTC CGGCATGCCG CCGGGCACGC GCCCGCCGGC CGCCGTTCCG
CCGGCCGCGG CCGCGGCCAC CGGCGGGATG CCCACCGGCG GCTCGCGGAC GGGTGGGATG
CCCCTGGGTG GCGCGGGCAA CAGCGCGTCC GCCCGCACCC CCGTATTCGG CGACCCGTTG
GCCGGGATCC GCCGTCCGGG CGCCGGCATG CCGCCGCGGG TCGGCGGGGG CCCGCCCGGC
GAGCAGGCCC CACCCGGCCG GCCGGGGCCG CCCCCCGCCG ACACCGGTCA CCGCTCCGCA
CCGCCCGGCC TCCAACCGCC CGGCCTCCAA CCACCTGGCC TCCAGCCGCC GGGGCTCCAA
CCACCCGGCT TCCAGCCGCC CGGACTTCAA CCACCTGGGC TCCAAGCAGC CGGGCCGCCG
CCGGGCACCG CCGGGCAGCC GGGACGTCCG ATGTCGGCCC CCGGCTTCGA GCGAGCCCGC
GAACAGCAGC TTCCCGGTGG CCCGGGCAGC CGGCCCGCGC CCGGGTTCAA CCGCCCGGGC
GGGCCGCCCG CGCACCGCCC CGCGGCCGGC CCGGCCGGCG GCCCGCCGAC CGGGCCGATG
GAGCGCCCCG GCTACAACGG CCAGCCCGGC TACAACGGGC AACCGGCGTA CAACGGGCAA
CCGGCGTACA ACGGGCCGCC CAGCCACAAC GGGCAACCCG GCCACAGCAA CGGGCAGCCT
GGCTACGCCG GGCAGCCCGG CCAGGGCCGG CAGCCAGGCC ATCCCGGTTA TCAGGGCCAG
GTGGAGCAGT CCGGCCCGAT CGCGCGGCCC GGCCTGTTCG GCCAGCCCGC GGCAACGGGC
CCGGGACGGT CCACGGGCCC GGTCTCCACC GGGCTCGGTG ACCGCATGCC CGGACGGGCC
CAGCATCCGG GCGATCCGTC CCGCCCCCTG CCCCACCCCG CGCAGTCCCA ACCGGCGCAG
CACCAACACG CGGCCGGCGT GTCCCGGTCC GAGCCGTCCC GGGCTCCGGC CCGTCCTGAC
GGTGTCGAGG CCCCGGCGGC TCAGGCCGAC GGTTTCTCGG TGCCGACGTC CCGGCAGGCG
CCGCCGATCT CCGGCCCGGT GGAGATGCTG CCCGCCAGCA TCGCCGCCGC CCGGGCCGCG
GAGCTGGCGC GCGGGCAGGC GGCCGCCGCG GCCGGACCAC CGCCGCCGCG GCATCCCGAC
ATCGTCCGCG GCCCCGAACG TGGAACCCGG CCGCCCGTCG GACGGCTCAC CTCCGACCCG
GCGCGCACGC GCGGCCGGCA CGCGGGCCCG GACCAGCCGC GTCCCCAGGG CTCCCCGTGG
AGCAGCCAGG CCGCCGAGCA GCGCCGTCCC ATCTCCGTCG ACGACTCCTG GGCCGGTGTC
GGGCGCAGCC CGCGTGGCCA GAACGGCGAC GGCGGCCAGG GCGGTCAAGG CGGCCAGCCC
GGGCACGGTG GCCCGAACGG CCAGAACGGG CACGGCGGTC AGAACGGGTA TGGCGGCCAG
AACGGCCAGC ATCAGCCGAA CGGCCAGCGG GGGCCCGGCG GCCCGGGACG TCCCGACACC
GACGTCGGCG CGGTCGGCGC GAACCGTTCC AGCGGCCCGG CCGGCCCGGG CCCGGCGGAC
CGTGGCCCCG GCCGCGACCA GGCCTCGGAC CCGTCGCTGA GCCCTGAGGT GGAGGCCGTC
ACGAGGGCCT GGCTCGCGCG CAAGGATTCG GTGCTCGACG GCATCGACGT CATCTGA
 
Protein sequence
MLPSMEDPRV VDVERYHGQL RSELEMIANG PVLAGPRIVE HINSLITLID LHQPNEFDLC 
LSCDRLWPCA TVVAITGELA ATEDEVTPRP LPEAPATRSG DGRQADRRPA DPRHAAAGGP
ADQRRPDVRA AAPPPFDTGS YSTTTQTSTG QSTMPHELPR GTDPVRQRGA PGPADEPMAA
PRPTGPHRVP PPPSMPPPSM PPPSHPSGMP PGTRPPAAVP PAAAAATGGM PTGGSRTGGM
PLGGAGNSAS ARTPVFGDPL AGIRRPGAGM PPRVGGGPPG EQAPPGRPGP PPADTGHRSA
PPGLQPPGLQ PPGLQPPGLQ PPGFQPPGLQ PPGLQAAGPP PGTAGQPGRP MSAPGFERAR
EQQLPGGPGS RPAPGFNRPG GPPAHRPAAG PAGGPPTGPM ERPGYNGQPG YNGQPAYNGQ
PAYNGPPSHN GQPGHSNGQP GYAGQPGQGR QPGHPGYQGQ VEQSGPIARP GLFGQPAATG
PGRSTGPVST GLGDRMPGRA QHPGDPSRPL PHPAQSQPAQ HQHAAGVSRS EPSRAPARPD
GVEAPAAQAD GFSVPTSRQA PPISGPVEML PASIAAARAA ELARGQAAAA AGPPPPRHPD
IVRGPERGTR PPVGRLTSDP ARTRGRHAGP DQPRPQGSPW SSQAAEQRRP ISVDDSWAGV
GRSPRGQNGD GGQGGQGGQP GHGGPNGQNG HGGQNGYGGQ NGQHQPNGQR GPGGPGRPDT
DVGAVGANRS SGPAGPGPAD RGPGRDQASD PSLSPEVEAV TRAWLARKDS VLDGIDVI