Gene Francci3_2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2014 
Symbol 
ID3906730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2365197 
End bp2366768 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content72% 
IMG OID637879350 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_481117 
Protein GI86740717 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.646267 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTGACC AACCCAACGC GGCCTCGCCG CCCGCCACCG CCGCGCCCGG GCCCGCCTCG 
GCCGCCAAGC TGGACCCGGC GCTGATCCGA CTGTCCGCCA CAGTGATGCT CGGCGCGATC
ATGGTGATCC TCGACACGAC GATCGTCTCG GTCGCCATCC ACGCGCTCGG TAACGAGTTC
CACACGTCGC TGTCCACGAT CTCCTGGGTC ACCACCGGTT ATCTGCTGGC GCTGGCCGTC
GTGATCCCGC TGGCCGGCTG GTCGGTCGAG CGGTTCGGCG CGACGCGGAT GTGGAACATC
TCGCTGGCGC TGTTCGTCGC CGGCAGTGCC CTGTGCGGCC TCGCCTGGTC GGCAGGGACG
CTGATCGCCT TCCGGGTCCT GCAGGGCCTC GGCGGCGGCA TGATCATGCC GATCTGCATG
ACCCTGCTCG CCCAGGCCGC GGGCCCGCAG CGGATCGGCC GGGTCATGAG CATCATCGGC
GTCCCGATGC TCATCGCGCC GGTGCTCGGC CCGGTCATCG GCGGACTGAT CGTCGACCAC
CTCAGCTGGC GCTGGATCTT CTACGTGAAC GTGCCGATCG GCGTGCTCGC CCTGTTCCTG
TCCTGGCGGG TGCTGCCACG CGATGACCGC GGCTCGGCGC ACCACCGGCT CGACCTGCCC
GGCCTGGCGC TGATCTCGCC GGGGCTGGCG GCGCTCGTCT ACGGCCTGTC CGAGGCGGGC
AACGGCGGCG GCTTCGGGGC TGCGAAGGTG CAGGTCAGCA TCGCCGTCGG CGTAGTGCTG
CTGGCCGGCT TCGTCGTGCA CGCCCTGCGC CGCGACGGCG CCCTGCTCGA CATGCGGCTG
TTCGGCGACC GGGTGTTCAG CATCGCGAGC GTCACGACCT TCGTGATCGG CGCCATGCTG
TTCGGTGCGA TGTTCCTGCT TCCGCTCTAC TACCAGGTCG CGCGCGGGCA GAGCGCCAGT
GACGCCGGCC TGCTCATGGC GCCACAGGGA CTGGGCGCAA TGATCTCCAT GGCGATCGCG
GGACGGGTCT CCGACCGCCG CGGCGCCCGG TCGGTCGTCC CGCTCGGCAT GGTCCTCGCC
CTGCTCGGCA CGCTGCTGTT CACCCAGGTC GACGCGCACA CCAATGAGGT GCTGCTGGCG
TTCTCGCTGT TCGTGCGCGG TCTGGGCTTC GGGGCGACGA TGATGCCCGC GATGGGCGCC
GCCTATGCGA CGCTCGGGCA GGCGGCGGTG CCGCGCGCGA CCACGACGCT GAACATCCTG
CAGCGGGTCG GCGGGTCGCT GGCCACCGCG CTGGTCGCCG TCGAACTGCA GCACGCCATC
GCCAGCCGGC TGCCGGGCGT CGGTGGCGAG GGTGCGCTCG CCGAGTCGGC GGGGGTGAAG
CTTCCCGGCC CTGTCGCCGA CAAGGTCGCC GAGGCGTTCG GCGTGACGTT CTGGTGGGTC
GTCGCCCTGA CCGCCGTCGG CTTCGTCGCC AGCCTCTTCC TCCCCGGCCG CCCCGCTGCC
ACCGTGGCGC ACGGCCCCGC GAACGCCCAG CCGGCGGCCC CGCAGCGCGA ACCGGTAGCG
GTGGTGGAGT AG
 
Protein sequence
MPDQPNAASP PATAAPGPAS AAKLDPALIR LSATVMLGAI MVILDTTIVS VAIHALGNEF 
HTSLSTISWV TTGYLLALAV VIPLAGWSVE RFGATRMWNI SLALFVAGSA LCGLAWSAGT
LIAFRVLQGL GGGMIMPICM TLLAQAAGPQ RIGRVMSIIG VPMLIAPVLG PVIGGLIVDH
LSWRWIFYVN VPIGVLALFL SWRVLPRDDR GSAHHRLDLP GLALISPGLA ALVYGLSEAG
NGGGFGAAKV QVSIAVGVVL LAGFVVHALR RDGALLDMRL FGDRVFSIAS VTTFVIGAML
FGAMFLLPLY YQVARGQSAS DAGLLMAPQG LGAMISMAIA GRVSDRRGAR SVVPLGMVLA
LLGTLLFTQV DAHTNEVLLA FSLFVRGLGF GATMMPAMGA AYATLGQAAV PRATTTLNIL
QRVGGSLATA LVAVELQHAI ASRLPGVGGE GALAESAGVK LPGPVADKVA EAFGVTFWWV
VALTAVGFVA SLFLPGRPAA TVAHGPANAQ PAAPQREPVA VVE