Gene Francci3_2086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2086 
Symbol 
ID3905613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2452317 
End bp2453792 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content70% 
IMG OID637879421 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_481187 
Protein GI86740787 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.377299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTACCG TGAGTACGCC CGCAGCGGTA GCGGACGAAT CCGACCGGCT GGATCCGGCG 
CTGCGCAGGC TGATCGCCGT GCTGATCATC GGTGCGATGG CGCCGCTGCT GGACACCACG
ATCGTCAACG TCGCGCTGAA CACCATCGTG CACGACCTGC ACACCACGGT CTCCGCCGTC
CAGTGGGTCA GCACGAGCTA CCTGCTCGCC ATGGGGATGT CGATCCCGCT GGCGGGATGG
GCCGTCGCCC GCTTCGGCGG CAGGCAGACC TGGATGATGT CGCTCGTCCT GTTCCTGGTC
GGCTCGGCGC TGGCCGGGGT GGCGTGGAAC ATCGGCAGCC TGATCGCCTT CCGGGTGGTC
CAGGGCACCG CGAGCGGCAT GGTGCTGCCG GTGCTGCAGA CGCTGCTTCT GCAGGCGGCT
GGCGGGCGCC GCACCGGGCG GCTGATGTCC GCGGTGAGCA TGCCCGCCGT GATCGCGCCC
ATACTTGGGC CGGTGCTCGG CGGTCTGATC GTCGGCAACA CCACCTGGCG GTGGATCTTC
CTGATCAACA TTCCCGTCTG CCTGGTCGGT ATGTTGCTCG CCTGGCGGGG TCTCGAACCC
ACCAGCGCGC GGCCCGGTGA CCCTCTGGAC GCGGTGGGCC TGGCCCTGCT CTCGCCCGGC
ATCGCGGGCA TCCTGCTGGC CCTGTCCCAG TTCAGCAGCC GCCGCGAGTT CGGCCTTCCG
GTGATCATCC CGCTCGTTCT CGGGCTGGCG TTGCTCGTCG CGTTCGTCGT GCACGTGCTG
CGCGGGCGCG TCGCCAACCC GATCATCGAC GTCCGGCTCT TCCAGGTGCG TGCCTTCAGC
GCCGCCTGCG CCCTGCTGTT CCTGTCGGGC CTGTCGCTGT TCGGTGGAAC GCTGCTGGTT
CCCCTCTTCC TCCAGCAGGC CCGCGGCTAC TCCGCGCTGT CGGCGGGCCT GCTCCTAGTG
GCTCAGGGGG TGGGGGCGAT GCTCGCCCGG AGCACGGTGG GCAAACTGGC CGACCGCACG
GGCTCGAGAC CTCTTGTGCT GGTCGGCATC GTCCTGATCG CGCTGCCGAT CATCGGCTTC
ACTCAGGTCG GCAGCCACAC CCAGGTGCTG TTCATGGCCG CCTTCCTGCT GGTCTTCGGC
TGCGGTATCA GCACGGTGAG CATCGCGGTG ATGACCTCGG CCTTCCAGGG GCTGGACCGC
GCGCAGGTCC CGCACGCGAG CGGGGCCACC CGAATCGTGC TCCAGGTCGG CGGCTCGTTC
GGCGCGTCGA TCGTCTCGCT CATCCTCGCC CGACAGATCG CCACGCACGG CGGGGGAGGC
CAGACCGGCC TGATCACCGC GTTCAGTCAC ACGTTCTGGT GGGCGGCCGG GTTCGCCCTG
ATAGCGCTGA TCCCCGCGGC GTTCCTCCCT GGGCGCGGGC ATCAGCTCGC GCCGGCCGTC
CCGAAACAGC CGCGGGGGGA GGATGCGCGC CACTGA
 
Protein sequence
MSTVSTPAAV ADESDRLDPA LRRLIAVLII GAMAPLLDTT IVNVALNTIV HDLHTTVSAV 
QWVSTSYLLA MGMSIPLAGW AVARFGGRQT WMMSLVLFLV GSALAGVAWN IGSLIAFRVV
QGTASGMVLP VLQTLLLQAA GGRRTGRLMS AVSMPAVIAP ILGPVLGGLI VGNTTWRWIF
LINIPVCLVG MLLAWRGLEP TSARPGDPLD AVGLALLSPG IAGILLALSQ FSSRREFGLP
VIIPLVLGLA LLVAFVVHVL RGRVANPIID VRLFQVRAFS AACALLFLSG LSLFGGTLLV
PLFLQQARGY SALSAGLLLV AQGVGAMLAR STVGKLADRT GSRPLVLVGI VLIALPIIGF
TQVGSHTQVL FMAAFLLVFG CGISTVSIAV MTSAFQGLDR AQVPHASGAT RIVLQVGGSF
GASIVSLILA RQIATHGGGG QTGLITAFSH TFWWAAGFAL IALIPAAFLP GRGHQLAPAV
PKQPRGEDAR H