Gene Franean1_5364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5364 
Symbol 
ID5673698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6469229 
End bp6470287 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content69% 
IMG OID641244222 
Producthypothetical protein 
Protein accessionYP_001509628 
Protein GI158317120 
COG category[V] Defense mechanisms 
COG ID[COG0842] ABC-type multidrug transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAGC GAGTCCCCAC CCGACAGCCC CACGACCACA TCCTGGACGT GGCCCATCCG 
CCTCCCCGCC CTGGAACCTC GCGGGCCGTC TTGCTTCCGG TAGCGATCGT CCTGCTGATC
GGCACCGTGT TCGTCAGTGT CTATCTCGCT GCCTTCCACG CGCCGCGCCC CCATCAGCTC
CGCGTGGGCA CAACGATGAT CGGCACGCAT CAGGTGGACC TGCGCCGGGA TCTCGCGCGC
GCCATCCCCG GCGGCTTCAC GCTCGAGACC TACCCCGACG AATCCACCGC CCGGCAGGCC
GTCCAGCACA GGTTCGTGTA CGCGGCCTAC CTCGGCGGGG GGAGGTTGCT CTACGGCAGC
GCCAATGGAG CCGCCGTCAC CGCAACTATG ACCACTGCTT TCGGTTCCGT GGCCCGCGCG
GAGCACGATC ACCTTTCTGT CGAGGACGTT GCCCCGGCAG CAGCGGGGGA CACTCGGGGG
CTGTCCGTCT TCTACACAGC CTTCGGCCTG GTGCTCGCCG GCTACCTTTT CGGCATGACG
ACCTACCAGG TCGCACCCCG GCTCCAGTAC CGCTGGCGTA TGGCCAGCCT GGCCTTGTTC
GGAGTTGTCG GCGGCGTCCT CGTCGCCGCC ATCGCCGGGA GCGCGGGCTT CGGTGCGCTA
CCCGGCCCCT TCCTGCCCCT CGCCATCATC GTCGCGCTGA TGGGCGCCGC GGTCGGCGCC
ACGACCATGG TGCTGCTGCG GCTGTTCGGC TCCGCAGGCG TCAGTCTCGC CTCGATACTC
CTGCTGATCC TTGGCAACGC CAGCAGCGGC GGGATCATGC CACCGGCTTA CCTTCCTGCT
TGGCTACGCC CGCTGTCTGA GATCCTGCCC GTCGGGGTGG GCGTCCGCGC AATGCAGGGG
CTGTCCCGGT TCCAGAACGA CGGTTTGTCC CGCGCCCTGG TGATCCTCCC GCTGTGGGTG
CTCGGCGCCG CCGTGGTGCT CCACCTGAAG GACGTGTTCC GGCGCGATGA TCCGAGTGGC
GCGCGAGACA AGGACGAATC TCAGCCTGCG GTGGGCTGA
 
Protein sequence
MSQRVPTRQP HDHILDVAHP PPRPGTSRAV LLPVAIVLLI GTVFVSVYLA AFHAPRPHQL 
RVGTTMIGTH QVDLRRDLAR AIPGGFTLET YPDESTARQA VQHRFVYAAY LGGGRLLYGS
ANGAAVTATM TTAFGSVARA EHDHLSVEDV APAAAGDTRG LSVFYTAFGL VLAGYLFGMT
TYQVAPRLQY RWRMASLALF GVVGGVLVAA IAGSAGFGAL PGPFLPLAII VALMGAAVGA
TTMVLLRLFG SAGVSLASIL LLILGNASSG GIMPPAYLPA WLRPLSEILP VGVGVRAMQG
LSRFQNDGLS RALVILPLWV LGAAVVLHLK DVFRRDDPSG ARDKDESQPA VG