Gene Franean1_4243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4243 
Symbol 
ID5672598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5050990 
End bp5052504 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content72% 
IMG OID641243116 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001508533 
Protein GI158316025 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.891003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGTTT CCACCGACGT CGTCGGCGAC GGGCACCCGC CCGCCGCGGT CGGGTCACCA 
GGGGCGAGTA GCCAGCTCGA CCCGCGACGC TGGCTGGCGC TCAGCATCAT CGCGGTCGCC
CAGCTCATGG TCGTGCTCGA CGCGTCGATC GTGACGATCG CCCTCCCGCA CGCCCAGAGC
GACCTGGGCA TCTCCACCGC CAACCGGCAG TGGGTCATGA CCGCCTACAC CCTGGCGTTC
GGCGGCCTGC TCCTGCTCGG CGGGCGCATC GCCGACTTCC TCGGACGCAA GAGGATCTTC
ATCTGGGGGC TGGTCGGCTT CGCGGCCGCC TCGGCGCTGG GCGGCGCCGC CCCCAACGCC
GAGTTCCTTT TCGCGGCCCG GGCCCTGCAG GGTGCCTGCG CGGCGCTCCT CGCCCCGGCG
GCACTCTCCC TGATCACCGT CACGTTCACC GAGGGCAAGG AGCGCGCCCG CGCGTTCGGC
GTCTACGGCG GGATCTCCGG TGGCGGCGCG GCGATCGGGC TCATCGTCGG CGGGCTGCTC
ACCGAGTACG CGTCCTGGCG CTGGTGCCTG TTGGTCAACG TCCCGATCGC CCTCGCCACC
GCCGCGGCCG CGCTGCCCAT CGTGCGGGAG AGCAAGGCCG AGGGCACGCC GAGCTACGAC
ATCCCCGGCG CGGTGACCGT CACCACCGGC CTGCTCGCGC TCGTGTACGG GTTCACCGTC
GCGGCCGACG ACGGCTGGGG CTCGGCGACC ACGATCGGCC TGCTCGCCGG CGCGGTGGCG
CTGCTCGCGG TGTTCGTCGT GATCGAGATG CGCACGGCCG CCCCGCTGCT GCCGATGCGC
GTGCCATTGG AGCGCAACCG AGGCGGCTCC TTCCTGGCGT CGCTACTCAT CGGCGGCGGC
CTGTTCGCGA TGTTCCTGTT CCTCACCTTC TACTTCCAGT CGACGCTCGG GTACAGCGCG
CTGCGCAGCG GCTTCGCCTT CCTCCCGTTC AGCGCGGGCA TCATCCTCTC GGCCGGTCTG
GCCAGCCAGT TCCTCCCCCG GGTGGGGCCG ACGATACTAA TGATCATCGG CACCGCGCTG
GCTGCCGGCG GGCTGGTCCT GCTTAGCCAG ATCGGGGCGG ACTCCAGCTA CGCGGGCCAC
GTCCTGCCCG CCGAGGTCCT GATCAGCCTC GGGATGGGCC TCGCGTTCGT CCCGATGTCC
AGCGTCTCCC TGCTGGGGGT CGCCGACCAC GACGCCGGTG TGGCGAGCGC GCTGGTCAAC
ACCACCCAGC AGGTCGGCGG GTCACTCGGC GTCGCGCTGC TGAACACCGT GTACGCGACG
GCGGTCTCGG ACTACCTGGG CTCGCACGGC ACCGGCGCGG CCGCGCAGCG GCAGGCGGCC
ATCGAGGGCT ACACCACGTC GTTCGTATGG AGCGCCGTGC TCGTGGGGAT CGCCCTGGTC
GCGGTGATCC TGCTGGTCCG TGCGGGCCGG GACGACGTCC CCGCGGTCGA CGGAGTGCCC
GTCCACGCCG GATGA
 
Protein sequence
MTVSTDVVGD GHPPAAVGSP GASSQLDPRR WLALSIIAVA QLMVVLDASI VTIALPHAQS 
DLGISTANRQ WVMTAYTLAF GGLLLLGGRI ADFLGRKRIF IWGLVGFAAA SALGGAAPNA
EFLFAARALQ GACAALLAPA ALSLITVTFT EGKERARAFG VYGGISGGGA AIGLIVGGLL
TEYASWRWCL LVNVPIALAT AAAALPIVRE SKAEGTPSYD IPGAVTVTTG LLALVYGFTV
AADDGWGSAT TIGLLAGAVA LLAVFVVIEM RTAAPLLPMR VPLERNRGGS FLASLLIGGG
LFAMFLFLTF YFQSTLGYSA LRSGFAFLPF SAGIILSAGL ASQFLPRVGP TILMIIGTAL
AAGGLVLLSQ IGADSSYAGH VLPAEVLISL GMGLAFVPMS SVSLLGVADH DAGVASALVN
TTQQVGGSLG VALLNTVYAT AVSDYLGSHG TGAAAQRQAA IEGYTTSFVW SAVLVGIALV
AVILLVRAGR DDVPAVDGVP VHAG