Gene Francci3_4154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4154 
Symbol 
ID3907119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4954306 
End bp4955757 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content71% 
IMG OID637881482 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_483231 
Protein GI86742831 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGGC CCGAGGCCCA CTCCGGCTCC GCGGATGGAC CGAACCCGTC GAACGATCTG 
GAGCAGGCCG GTCCGAGTCT CGTGCCGCTG GCCACGACCC TGATCGTCGG GGCGTTCGCG
GCCCTGCTCG ACACCACTGT CGTGGCGGTC GCGATCGACA CGCTGGGGCG CGACCTGCAA
GCGGACATCA CGGTGATCCA GTGGGTCACG ACGTCCTACC TGCTGGCGAT GGCTGCCGTC
ATCCCGGTGG TCGGCTGGCT GGTCGACCGG TTCGGCGCCC GCGCGATCTG GTCGGGTGCC
CTCGGGCTGT TCCTGGCCGG ATCGGTGCTG TCAGGCCTGG CCTGGTCCGC TGGCGCGCTA
ATCGCCTTCC GGGTGCTGCA GGGCCTGGGT GGCGGCATGA TCCTTCCGCT GACCCAGCTG
GTACTCGCCC GGGCCGCCGG CCCGCAGCAC TTCGGGCGGG TCATGGGCGT CGTCGGCCTG
GTCGGCCAAC TGGCGCCGAT CTCCGGCCCG GTGCTGGGCG GCCTGCTGAT CGACACCTGG
GGCTGGCGGT GGATCTTCTT CGTCAACGTG CCGATCGTCG TGGTCTCGCT GCTCATGACA
ACACGGTGGT TCCCCCGCGA AGACCCGCGC ACGGAGCGTT CCCTGGACGT GGTGGGCCTG
GTCCTGCTGC CCACCGGCAT CGTGGCGATG ATCTACGCGC TGTCGAACGT CGAGTCCGGA
AGCACGGTGG TGTCCGCACA GGTGCTCGTC GCCGCGCTGG TCGGCGTCGC GCTGCTGGCC
GCCTTCGTCC TGCGGCCGAC GACACCAGGC CGGCCGTCGC TCATCGATCT ACGCCTGTTC
GGCGACCGTT CCTTCCGCGG CGGCTCGGTG ATGCTGTTCG TCTTCGGCGT GACGAGCTGG
GGCCCGATGT TCGTGCTCCC CCTCTACTAC CAGCAACTGC GGGGGCTGTC CGCACTTGAC
GCCGGGTTCG CCCTCGCACC GCAGAGCGTC GGCCTGGGGC TTGCGTACCT CGCAACCGGC
CGGTACGCCG ACCGGCTCGC GCCACGCCCG CTCGTGGCAG GGGGTCTGGT GGTCGCGAGC
GCGGGCACTC TGCCGTTCGT CTTCGCCACC GCCGACAGCA ACCTGACCCT GCTTGGCATC
TCGCTGTTTG TCCGCGGGAT CGGGTTCGGG GCCGCGAGCC TGCCTGCCAG CGCCGCCGTG
TACCGGACGC TACGAACGGC TGACATTCCG GGCGCGACCA GCGCGAGCAA CGTCATCCAG
CGCGTCGGCG CGGCGACCGG CACGGCCGTG ATGGCTCTCA TTCTCCAGGC GGACGGGTTC
ACCCCTGCAC TCACCTGGAT GTTCGTCCTC ACCTCGGGCG CGCTCGCCGG GACCGTGTTT
CTGCCAGGGC AGAAGCCGGC ACCGGCGCCG CGGGACACGG TGCCCACGAC GACGGCCACC
GGCGCCCAGT AG
 
Protein sequence
MTRPEAHSGS ADGPNPSNDL EQAGPSLVPL ATTLIVGAFA ALLDTTVVAV AIDTLGRDLQ 
ADITVIQWVT TSYLLAMAAV IPVVGWLVDR FGARAIWSGA LGLFLAGSVL SGLAWSAGAL
IAFRVLQGLG GGMILPLTQL VLARAAGPQH FGRVMGVVGL VGQLAPISGP VLGGLLIDTW
GWRWIFFVNV PIVVVSLLMT TRWFPREDPR TERSLDVVGL VLLPTGIVAM IYALSNVESG
STVVSAQVLV AALVGVALLA AFVLRPTTPG RPSLIDLRLF GDRSFRGGSV MLFVFGVTSW
GPMFVLPLYY QQLRGLSALD AGFALAPQSV GLGLAYLATG RYADRLAPRP LVAGGLVVAS
AGTLPFVFAT ADSNLTLLGI SLFVRGIGFG AASLPASAAV YRTLRTADIP GATSASNVIQ
RVGAATGTAV MALILQADGF TPALTWMFVL TSGALAGTVF LPGQKPAPAP RDTVPTTTAT
GAQ