Gene Francci3_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2004 
Symbol 
ID3906720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2353276 
End bp2354796 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content73% 
IMG OID637879340 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_481107 
Protein GI86740707 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.331322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCGACT CGACTCCCCC GGCCCCGCCC CCGGCGCTCC CGTCCCGCTG GCTCGCCCTC 
GGGGTGCTCT GCACCAGCAA CCTGATGGCC ATCCTGGACA GCTCGATCGT TACGGTCGCG
CTGCCGACCA TCCAGCAGGA CCTCGGCTTC TCCCAGGCGA ACCTGGCCTG GGTTGTGAAC
GGATACCTGA TCGCCTTTGC TGGCCTGCTG CTGCTGTCCG GGCGGCTGGG TGACCTGCTG
GGCGGCCGGC CCGTCTTCAT CGGGGGGCTC GGCCTGTTCA CGGCAGCCTC GCTGGCCTGC
GGCCTGGCCA ACACCGGCGG CCTCCTGATC ACGTTCCGCT TCCTGCAGGG CGCCGGGGGG
GCGCTGGCCT CCGCGGTGGC ACTCGGCATG ATAGTCACGA TCTTCCCTGA CAGCCGGGAG
CGGGCCAAGG CACTGGCCTG CTACGCCTTC GTCGCCGCGG CCGGCTCATC GATCGGGCTC
ATCCTCGGCG GCGTGCTGAC CAAGAGTCTC AGCTGGCCGT GGGTCTTCTA CGTCAACGTG
CCGCTGGGTG TCCTCGCGGT GGTGCTCGCC CTGTGGGTGC TGCCCCCGAT CGCCGGGCTC
GGGCTGCGCG AGAGCGCGGA CGTTCTCGGC GCCGGGCTGG TGACCGCCGG GCTGATGCTC
GGCGTCTACA CCGTCGTCCG GGGCGGGGCG GACGGCTGGA GCGACGGCGG GACGCTGGGG
TCGCTCGCCG CCTCGCTGCT CCTGCTCGCC GCCTTCGTGG GCCGCCAGGC CAGAGCGGCC
AAGCCACTGC TGCCGCTGCG CATCTTCCGC TCCCGCGCGG TCGGGGGAGC GAACATCATC
CTGACCCTGA TGGTGGCCGG CCTGTTCGGC TACCAGTTCT GCACCGCCCT GTACCTGCAG
GACGTGCTCG GCTACAACGC GCTGCGCACC GGGCTGGCGT TCCTGCCGGC CCCGCTAACC
ATCGCGGCCG TCTCGCTGGG CCTCGCGTCG CCGCTGAACA CCCGGTTCGG CCCCAGGCCG
GTGATCGTCG TCGGGCTGCT GCTGGTCGCC GCGGCGCTCC TACTGTTGGC CCGGCTACCG
GCTGACGGCA GCTACGCGAG CGACATCGCC CCGGTGTTCG TCCTGCTGGG TATCGGTTTC
GGCGCGGCGA TGCCCGCACT GATCGGCCAG GCGATGGCGG TCTCCGACCC CGCCGAGGCC
GGCGTCGCAT CCGGGGTGGC CAACACCACC CAGCAGATCG GCGCCGCGCT CGGCACCGCG
ATCCTGGCCA CCCTTGCCGC CTCCCGCACC GACTCGCTGC TCGACCACGG CAGGAGCACC
ACCACCGCCC TCACCGGCGG GTTCCACCTC GCCTACGCGG TGAGCGCGGC CCTGGTCGCC
GCGGCCGTCC TCGTCGCCCT GTTCGTCCTG CGCCCGCCCG GCCAGTCGCG GGCCGCCGAC
AGCGCACGGA CCGCCGACGC CCACCCAGCC GAGACCGACG ACCCGACGCA GCCGGCGGGG
GTGCCGGCCG CGGGACAGTA A
 
Protein sequence
MTDSTPPAPP PALPSRWLAL GVLCTSNLMA ILDSSIVTVA LPTIQQDLGF SQANLAWVVN 
GYLIAFAGLL LLSGRLGDLL GGRPVFIGGL GLFTAASLAC GLANTGGLLI TFRFLQGAGG
ALASAVALGM IVTIFPDSRE RAKALACYAF VAAAGSSIGL ILGGVLTKSL SWPWVFYVNV
PLGVLAVVLA LWVLPPIAGL GLRESADVLG AGLVTAGLML GVYTVVRGGA DGWSDGGTLG
SLAASLLLLA AFVGRQARAA KPLLPLRIFR SRAVGGANII LTLMVAGLFG YQFCTALYLQ
DVLGYNALRT GLAFLPAPLT IAAVSLGLAS PLNTRFGPRP VIVVGLLLVA AALLLLARLP
ADGSYASDIA PVFVLLGIGF GAAMPALIGQ AMAVSDPAEA GVASGVANTT QQIGAALGTA
ILATLAASRT DSLLDHGRST TTALTGGFHL AYAVSAALVA AAVLVALFVL RPPGQSRAAD
SARTADAHPA ETDDPTQPAG VPAAGQ