Gene Francci3_1244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1244 
Symbol 
ID3903543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1488854 
End bp1490533 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content71% 
IMG OID637878578 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_480351 
Protein GI86739951 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.198291 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCGC AGACCCAGGC GCCCGCCGCG GCGGGCGGGG GCATGACGGG CGGTAGCGGA 
GATCGGCTCG ATCCGGCGCT GATCCGTCTC GCCGGGATCG TCCTCGTCGG TGCGGTCGTG
GTGCAGCTCG ATGCGACGAT CACCTCCGTG GCGATCAACA CCCTCGCCCG ATCCTTCAAC
GTCGGAATCT CGACAATCCA GTGGGTGAGT ACCGGTTACC TGCTCGCGCT CGCCATGGTG
ATCCCGGTGA CCGGCTGGTC AGCCGAGCGG TTCGGCGCCA AGCGGATGTG GCTGCTGTCG
CTGGTCCTGT TCCTCGTCGG CTCGGCGCTG TGTGGGGCGG CATGGTCGGC CGGCAGTCTC
ATCGCCTTCC GGATCGTGCA GGGCCTCGGC GGCGGCCTGC TTCTGCCTCT GATGCAGACG
ATCATCGCGC AGGCGGCCGG GCCGGAACGG CTGGGCCGCC TGATGGCGGC GGTGGGGGTG
CCCGCGCTGG TCACTCCCGT GCTCGGGCCG GTCATCGGCG GGCTGATCGT CGACGATCTC
GACTGGCGTT GGATCTTTTT CATCAACGTG CCGGTCTGCC TGATCGGGCT GGTCCTGGCC
TGGCTGGGGA TGCCGGATGT GCGGACTCCC GGGCGGCATC GCTTCGACGC TCTCGGGTTC
GCGTTGCTGT CCCCGGGGCT GGCCGCGATC GTCTACGGCT TCTCCGTGGC CGGCCGGCAG
GGTGACTTCA CGGGTGTGCG GGTGATCGTG CCGCTGGCCC TCGGCGCGGC CCTGCTCGTT
CTGTTCACGG TGCATGCCCT GCGGACCGCC GTCGAACCGA TCATCGACCT GCGGCTGTTC
CGGTCCCGGG CCTTCGCCGG TTCATCGGGG ATGATGTTCC TGTTCGGGAT CTCGCTGTTC
GGGGCGATGT TCCTGCTGCC CCTGTACGAG CAGCAGGCCC GTGGCCGCAG CGCCGCCGCC
GCCGGCCTGC TCCTCGCCCC GCAGGGGTTG GGGATGATGA TCGCCCTGAT TGTGCTGGGC
CGGGTGGCGG ACCGGCGCAG CCCGCGGTTG TTCGTTCTGG TCGGCCTTCT GCTCAGCGCG
CTGGGATCGG TTGCGTATAC ACAGGTCGCC GCCGACACCA GCGAGGTGCT GCTCGGGGTC
TCCCTGACGG TCCGCGGCAT CGGGCTGGCC ATGGCGCTCA TCCCGGTGAT GTCCTCTGCC
TACCACGGGC TGCGCCGGGA GGAGATTCCG CGCGCCACCT CCGCGGGGCG GATCTTCCAG
CAGATCGGCG GCTCACTCGG GACGGCCATC CTCGCCGTGG TGCTGTCCCA CCAGATCACC
GGCCGACCCG CCGGGACCGG ACCCGCCGAT CCGGTGGCGC TGGCCGGCGC CTTCGGTACG
GCGTTCTGGT GGACCCTCGG GTCCACGGTC CTCGCCGCAC CGTTCGCCTT TCTGCTGCCG
GGACGGCCAG CCGGCGCGGA GCAGCCCGCC CCGTCAGGCC CGCCGGTCGA GCTCCCGCGG
CCGGCCGGGG CCGTCCCCAC CCGTCGCCGG TCGGGCGACC ACGGGGAGCT GAAGAGCCAT
GAAACTTCTC CCGGCCGCCG CGCCGCCGTC TCAGGGGCCG AGGGGAACCG CCACGATGGT
CGGGAGTACC GGGTCCCCGC TCCCCGTGGC GGCCGAGGCG GTGGCCAGTG GCGCGGATGA
 
Protein sequence
MAAQTQAPAA AGGGMTGGSG DRLDPALIRL AGIVLVGAVV VQLDATITSV AINTLARSFN 
VGISTIQWVS TGYLLALAMV IPVTGWSAER FGAKRMWLLS LVLFLVGSAL CGAAWSAGSL
IAFRIVQGLG GGLLLPLMQT IIAQAAGPER LGRLMAAVGV PALVTPVLGP VIGGLIVDDL
DWRWIFFINV PVCLIGLVLA WLGMPDVRTP GRHRFDALGF ALLSPGLAAI VYGFSVAGRQ
GDFTGVRVIV PLALGAALLV LFTVHALRTA VEPIIDLRLF RSRAFAGSSG MMFLFGISLF
GAMFLLPLYE QQARGRSAAA AGLLLAPQGL GMMIALIVLG RVADRRSPRL FVLVGLLLSA
LGSVAYTQVA ADTSEVLLGV SLTVRGIGLA MALIPVMSSA YHGLRREEIP RATSAGRIFQ
QIGGSLGTAI LAVVLSHQIT GRPAGTGPAD PVALAGAFGT AFWWTLGSTV LAAPFAFLLP
GRPAGAEQPA PSGPPVELPR PAGAVPTRRR SGDHGELKSH ETSPGRRAAV SGAEGNRHDG
REYRVPAPRG GRGGGQWRG