Gene Franean1_5862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5862 
Symbol 
ID5674185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7111210 
End bp7112787 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content80% 
IMG OID641244712 
Producthypothetical protein 
Protein accessionYP_001510114 
Protein GI158317606 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000662704 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.168546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCGGA TCGTGGTGCC GGCCGGGGTG GCGTGCCTGG TCGGGGTGGT CGCGGCCCTG 
CTCACCACGG CGGGGCCTCG TCCGCCCGCA GCCGTCCCCA GACCGGTCAC CGCGGCCACC
CTGGACTGCC CGGATCTCGG CCTGAGGGGG CAGGTGCCCC AGCTGCTGGA CGTCGTGCGC
GGCTCGGGGC CCGACGGGGT GGTCCGGCCA TCCGGTGGCG GCGCGCTGCT CGGCGGCGAC
CGCCAGCACG ACGAGCTGCT CTACCTGCCG CGTCCAGACC CGGGTGGCCC AACTCCGGGC
GGCCCGGCTC CGGGCCGCAC GGACCTGGAC GGGCCGGACA GCGGCCAGTC GGTGGCACGG
GGGCCTGCCG GCGGTCCGTT GCGGCTTGTC GCGACCGGGT CCGCCGCGGC CGGCCTCACC
GCGACCGTGA CCTCCCCCGG GTCGGGAGCG GGACCGTTGC GAGCCCGCTG CGAGCAGTCC
CGCGCGCGGA CGTGGTTCGC CGGCCCGGCC ACCGTCGCCG GCCGCGATCC CGTCCTGTAC
TGGACGAACA CCGGTCCGCG GCCGGCCCGG GTCAGTGTGG GCGCCGTGTC GTCGGGCCAG
ACGGCGCCCC GGGTGGAGGT GACCGTTCCG GTCGGGCGCA CGGTCAGCCG GCGCCTGGCC
GAGCTCGCCC CCGAGGCGAC CGTGACCACC GTCGACGTCG ACGTGCACAC CGGCCGGGTC
CTGAGCTGGA TGGTCGACCG CGCGAGCGGC TCCGGGCCGG CGGCGGCCAC GCCCGTGCCG
CCGACCGCCG GTCCGGCCAC CCGGGTCCTG CTCGGCGGGT TCCTCACTCC TGCCGGGTCC
GGCGGTACGG GGGCACCGGC CGCCGGCCCG CCAACGGCCG ATCTCGTGCT CTCCGCTCCC
GGAGCGGCGG CGACCGTACG CGTCAGTGTG ATCACGGCCT CGGGTCGCCA CACCCCGGTC
GGCCTGGAGG CCGTGCGGAT CCCGGCCGGC GCGGCGCTGC GCCGCCTGGT CACGCTGACT
CCGGCGAGTC CGTCCGCCCT GCTGGTCGAG TCGACGGACG GGGGCGGGAT CGTCGCGGCG
CTCGGCCTGC CCACCGGCGT GGCCGCGGCG GCCCCCGCGG CGCCGGGCAG CGGTCCGCCG
AACGGGCGTA CCTGGGTCGC CGGGGTCGTT CCGGAGCGGC CCCGCTGGGC GGGCGTCGTC
GTCGGCGACC CGGGCGCGGC CGGGCCGACA CCGCCGGGTC TGGTCGTCGC GGCGGCGCCG
GTACCGGCCT GGACGGCCGG CGCGCTCGTC CTGGTGGCGC CCCGGCGGGC CGCCACCGTC
TGGGTCGACG GCCGCCGGTT CGAGGTCGGC GCGGGTCGCG CGGTGCTGGC GCCGCTGCCG
GCGGGCCGCG TCGGCGCGCG TGTCGTCGGC ACCGGCGGCC CGCTCGTCGC GAGCCAGGTC
CTCGGCACCG CGCCGCCGTC CGCGGGAGTG GTGACCGCGC TGGTGCCGCG GACCGTCTCG
GCCGTCGTGC CGCTTACTGG CGCGTGGCGC CTCCGGTACG GTCCGGCGTC GCTGGCCGAT
CCGCGCGTCG CCTGGTGA
 
Protein sequence
MKRIVVPAGV ACLVGVVAAL LTTAGPRPPA AVPRPVTAAT LDCPDLGLRG QVPQLLDVVR 
GSGPDGVVRP SGGGALLGGD RQHDELLYLP RPDPGGPTPG GPAPGRTDLD GPDSGQSVAR
GPAGGPLRLV ATGSAAAGLT ATVTSPGSGA GPLRARCEQS RARTWFAGPA TVAGRDPVLY
WTNTGPRPAR VSVGAVSSGQ TAPRVEVTVP VGRTVSRRLA ELAPEATVTT VDVDVHTGRV
LSWMVDRASG SGPAAATPVP PTAGPATRVL LGGFLTPAGS GGTGAPAAGP PTADLVLSAP
GAAATVRVSV ITASGRHTPV GLEAVRIPAG AALRRLVTLT PASPSALLVE STDGGGIVAA
LGLPTGVAAA APAAPGSGPP NGRTWVAGVV PERPRWAGVV VGDPGAAGPT PPGLVVAAAP
VPAWTAGALV LVAPRRAATV WVDGRRFEVG AGRAVLAPLP AGRVGARVVG TGGPLVASQV
LGTAPPSAGV VTALVPRTVS AVVPLTGAWR LRYGPASLAD PRVAW