Gene Franean1_5336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5336 
Symbol 
ID5673670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6431591 
End bp6432943 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content65% 
IMG OID641244194 
Producthypothetical protein 
Protein accessionYP_001509600 
Protein GI158317092 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.975729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.140148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTCCA CCGGCGTCCG TCCCTGTCCG CTGGCGTCCG TGGCTGTTGC TACAGAGGTA 
GCTACACCTG GCGGGGCCTC CGACGCCCGG TGCCTGGACG AACCTTCCTC CGGCCTGGGT
AGCCTGCGTG CCGTGGACGA TAAGTCACAG GTCGTTCCGT TTGTCGACCT CCCCACCGCC
GCCCTTGTAG TCGATCAGCT GTATGAGGGG GGCACGGCGG GCACCCTTGC CGATGACCCT
CTGGCGCGGC TGCTGCCTGT CGGCAACCAA GGAGGGTTTC GGTACGCGGG CTCCCCCCGC
AAAGGCACCG TCCGTCTCTC GGTGCTTTAC ACCACCGGGG CAGTAGCAGA CTGGCCAGAC
ACTCTCGATC CCTCGACCGG GGTCTTCACC TACTACGGCG ACAACCGCAA ACCAGGTCGG
GACCTGCACG ATACCCAACG TTCTGGCAAC CTCCTCCTGC GTGACGTGTT TGAACACGCC
CACGGCAGCG TGGAGGAACG CCGTACAGTC CCGCCGTTCC TGCTGTTCGA AACAGCGCCA
CCGGGACGGC GCATCATGTT CCGTGGCCTA CTTGCCCCCG GCGCGGCCAC CCTCACCAGC
GACGACGATC TCGTCGCGAT CTGGCGTAAC ACCCGCGGAC ACCGCTTCCA AAACTACCGC
GCCCACTTCA CCGTGCTCGA CGTCGCGACC GTCACCCGCA CCTGGCTAAC CGACATCCTC
GCCGGACACG CTACCGACAG CGAGCACTGC CCACCTGCGT GGACAGCCTG GGTCGACGGT
CGCGCCTACA GCCCGTTGAT CGCACCTTCG ACCACCATCA TCCGGACCAA AGCAGAACAG
CAACCCCCCG ACCCTACCGG GGTAGCGATA CTCGCCGCCA TCCGCGAGCA CTACCGGGGA
CACGAACACG ACTTCGAGTT CTGCGCGGTC GAGCTGTGGC GACTCATCGC GCCAGCCACT
GGCAGATGTG ATGTCACCCC GCCGAGTCGG GACGGGGGCC GCGACGCCAT CGGCGACTAC
ATCCTCGGCC CACTCTCTGA CCCGATCGCC ATCGACTTCG CTTTGGAAGC CAAGTGCTAC
ACCGACACCA ACTCCGTCGG CGTCCGAGAT GTCGCCCGGC TGATCTCCCG GCTACGCCAC
CGCCACTTCG GCGTCTTCAT CACCACCTCC CACTTCAACC AGCAGGTCTA CACCGAAGTA
CGCACCGACC GGCACCCCAT CGCCCTGGTC AGCGGACGCG ACATCGTCAA TGCCCTCCGC
GCCCACGGCT ACGCGGACGT CAACGCCGTC AACGCATGGT TAGGCAAGAT CCCGAATGTC
CATGTCTCCG CGAAGGGAGC ACCTAATCCG TAG
 
Protein sequence
MQSTGVRPCP LASVAVATEV ATPGGASDAR CLDEPSSGLG SLRAVDDKSQ VVPFVDLPTA 
ALVVDQLYEG GTAGTLADDP LARLLPVGNQ GGFRYAGSPR KGTVRLSVLY TTGAVADWPD
TLDPSTGVFT YYGDNRKPGR DLHDTQRSGN LLLRDVFEHA HGSVEERRTV PPFLLFETAP
PGRRIMFRGL LAPGAATLTS DDDLVAIWRN TRGHRFQNYR AHFTVLDVAT VTRTWLTDIL
AGHATDSEHC PPAWTAWVDG RAYSPLIAPS TTIIRTKAEQ QPPDPTGVAI LAAIREHYRG
HEHDFEFCAV ELWRLIAPAT GRCDVTPPSR DGGRDAIGDY ILGPLSDPIA IDFALEAKCY
TDTNSVGVRD VARLISRLRH RHFGVFITTS HFNQQVYTEV RTDRHPIALV SGRDIVNALR
AHGYADVNAV NAWLGKIPNV HVSAKGAPNP