Gene Francci3_2045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2045 
Symbol 
ID3904618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2405925 
End bp2407445 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content69% 
IMG OID637879382 
Productmajor facilitator transporter 
Protein accessionYP_481148 
Protein GI86740748 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.706094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCTACG AAAGTTCGAG AACGGTGTCC GCTGACGTGG CGCGCACTCG CCCGGCCATG 
TCTGGCGCCG GGTCGGGAGC GGTCGCCGCG CCTGGCCAAA CCGCCGCGGG CCCCGCGGCT
GAGCCGGCCT ACCCGCGAAT CCTCCTAGCC GCGGGCATGG TCCTCGCGGA CACGTCGATC
CTGAATGTCG TCTCCCCCGT CATGCGGGAT CAGTTCAACG CGAGCATCGG GAGCCTGCAG
CTCGCCATCG CCGGTTATCA GATCGGCTAC GCGTCGGTCC TCGTAGCCGC CGGCGCCGTC
GGCGACCGGC GAGGCCGCCC GGAGACCTTT CGGATCGGAC TGGCGGCCTT CGCGCTGACC
TCGATCGCCT GCGCCGCGGC ACCAAACATC GGCTGCCTCA TCGCCTTCCG GGTAGTCCAG
GGCCTAGCCG CGGGTGTCCT TTTCCCACAG ATTCTGGGAA TCATCCGCGG CGCGGTGGCG
GATCGACAGG TCGGTCTGGT CGCGGCGATG AGCATGATTA TGAGTCTGGC CACCGTCGTC
GGGCCGATTG TGGCCGGAGT GATCGTCTAC AGCGCTCCGA GCTCGTTCAG TTGGCGGCTG
GTCTTCCTGA TCAACGTGCC GTTCTGCCTG TGGGCGTGGA GCGGCACCCC ACGAGTGGCC
AGCGGGGGCC GGTCCTCGCT CAACGGGCAG CTGGACGTCG TCGGCGCGCT CGGAATCGCC
GCACTGGTGA CCGCGATAGC GCTCCCGCTG ACCCTGGGAC GCTCGCTCGG GTGGCCGCTG
TGGGCGCTGC TGCTGCTGGT ATGTGCCGCA CCGGCCGCGG TGTTGTATGC CTGGCACCAG
CGTCTTCGCC ACGATCGGCA GCTGCCCTGC ACCTTCCCGG TCAGCGCGTT CAGGGAAAGG
CAGCTCCTCC AGGCCGCGAT CGCATATTTT CTGTTCTTCG CGGCCAGCAC CTGTTTCTTC
CTGTACTTCT CGATCTTCCT CGAGGAAGGC GCCGGTGCGA GCCCGCTCGC GGCTGGTCTG
AGCCTTGCCC CCTACGGGAT CGGTGCCGCG ATCACAGCCA AGGCGTCGAG CCGGCTCGTG
GCGCGCACCA GCATCCGCAC CGTCGTCGTC AGCGGCGCTC TGCTATGCGC GTTGGGCTCG
CTGGGCACCT GCCTGCTGGT GGCGCACCTC AGCCGCGGCT GGCTGGTGGC CGGGGCCGCC
CCCGCGCTGA TCGTCACCGG AGCGGGCCTG GGGCTCGTGG TCTCCACCGT GCTTCGGCTG
GTATTGGCGC TTGCACCTCC CCAGGAAGCG GGCTCCGTCG GCGGCGCGCT CTCCACTGGC
CAGCAGATCG GCGGTGCGAT CGGCATTCTG CTGTTCGGGC TCTTCTTCCC CATCCACCTG
AGCCCCTCGG TCGATCTGGG GTCGCTCAGA GTCGGCATCG AACATGGCCT GATCTACGAG
GCATCGGCTT TCGCCCTCGT CGCGGCCCTG TTCACCCTCA CCAGCCAGCG CCGTGGCGCC
GGCCCGCGGC AGGCGCGGTG A
 
Protein sequence
MRYESSRTVS ADVARTRPAM SGAGSGAVAA PGQTAAGPAA EPAYPRILLA AGMVLADTSI 
LNVVSPVMRD QFNASIGSLQ LAIAGYQIGY ASVLVAAGAV GDRRGRPETF RIGLAAFALT
SIACAAAPNI GCLIAFRVVQ GLAAGVLFPQ ILGIIRGAVA DRQVGLVAAM SMIMSLATVV
GPIVAGVIVY SAPSSFSWRL VFLINVPFCL WAWSGTPRVA SGGRSSLNGQ LDVVGALGIA
ALVTAIALPL TLGRSLGWPL WALLLLVCAA PAAVLYAWHQ RLRHDRQLPC TFPVSAFRER
QLLQAAIAYF LFFAASTCFF LYFSIFLEEG AGASPLAAGL SLAPYGIGAA ITAKASSRLV
ARTSIRTVVV SGALLCALGS LGTCLLVAHL SRGWLVAGAA PALIVTGAGL GLVVSTVLRL
VLALAPPQEA GSVGGALSTG QQIGGAIGIL LFGLFFPIHL SPSVDLGSLR VGIEHGLIYE
ASAFALVAAL FTLTSQRRGA GPRQAR