Gene Francci3_1772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1772 
Symbol 
ID3904002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2107934 
End bp2109397 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content66% 
IMG OID637879110 
Productmajor facilitator transporter 
Protein accessionYP_480877 
Protein GI86740477 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.819492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.422548 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAATA AGAGGGGGAG TCGGGCGATT CGCGGCGGGC GCGACCTTGA CGCTGATAAA 
TCCAGCCCGA CTGGTGCGCT CCTTCTGCTG AACATCCAAC ACCTGCTGAT CGCGATGGAC
TTCACCGTCG TCTTCGTGGC GCTGCCAACA ATGGGGGACG ACCTCGGTCT CACTGATCGT
GGGCTGACCT ACGTCACCGC GACTTACGGG CTGGCCTTCG CTGCGTTCCT GCTGTTGGGT
GGCCGAGCCG CGGACGTGTT CGGGCGGCGC CGGGTGCTCT TGGTGGGACT CGCGCTATTC
GTCGTCGCCT CGGTAGTCGC GGTGTCAGGC TCGGCCGCCG CGTTGCTGGT TGGCCGCAGC
ATGCAGGGTG CCGCCGCGGG CCTCATCACC CCGGCCGCCC AAGCCTTGGT CGTCACGCGA
TACCCCGAGG GGAAACAGCG TACGAGGGCG CTGAGTTGGT GGGGTGTCAC CGGTGCGGGC
GGCCTGGCCC TGGGGGGACC GGCCGCAGGG GTACTGACCT GGGCACTGGA CTGGCGGGCT
GTCCTGCTGG TCAATGTCCC GATTGGGTTG CTGTGCTTGG TCTTGGCGCC ACGGCTGATC
GCCGTCGACG CGACCAGGGA GACCGAGGCT GGGCCCCGGT TCCGTCTTCC CGCGGTCCTA
ACGGCAGGCG CGGCGATGGG TCTTCTGGTA TGGACGGTCG CCGAGGGTCC GGTCTCTGCC
GCTGCGGATA CCCTGGTCCG GGCAACGGCC GTAGTGGTGC TACTGGGAGC ATTCGTGCTT
ATGGAGCGGC GGAGCACAGA TCGGCTGATG CACCGCGATA TCTTGCGCGT CCGTCCGGTG
GCTGTCGCCG ACCTGATGTC GGTGTTTTTC GGTGCCGCCC TCGGAGGGCA GTTCTTCGCT
ATCACCCTGT ATCTGCAGGC AGTGAGCGGA ATGTCCGCGC TCGTCGCCGG ACTGATGTTT
ATTCCGGTTA CGCTTTTCAT GGTCGTAGGC AACAAGGTCG GAGTCCTGCT GATCGCTAGA
ATCGGCCCCA TCCGCAGCCT GCCTGTCGGG CTGGCAATTG CTGCGGTGGC CGAGGTGGCG
ATGGCGTTCT TGCCCACCGG TGGTGGCGTG CCGTTGCTGG TTCCCGCGAT GATCCTGCTC
GGTCTGGGCC AGGGCATCGC GTTCGTCGCG ATCACCGTTG CTGCCACGGC CACCGTCGCA
GCTGAGCGGC AGGGTGTTGC GTCCGGACTA CTCAACGTTG GCATGAACAT TGGGCAGTCG
ATCGGTCCCG CCGTGCTCGC CGCGATTGTC ACCTGGCGGT CCACTGCCGC CCTCGACGGC
GGGGCGGGCC AGGCCGAGGC GAACAACCAG GGTTTGCACG GCGCTTTTCT GGCCATCGCC
GCGATCGTGG TCGTGGGACT GCTGGTGTGC GGGGTGCTGC TGCGGTCGGG ACCGGGCCGA
GTCGAATCGG TCCACGCAGG CTGA
 
Protein sequence
MVNKRGSRAI RGGRDLDADK SSPTGALLLL NIQHLLIAMD FTVVFVALPT MGDDLGLTDR 
GLTYVTATYG LAFAAFLLLG GRAADVFGRR RVLLVGLALF VVASVVAVSG SAAALLVGRS
MQGAAAGLIT PAAQALVVTR YPEGKQRTRA LSWWGVTGAG GLALGGPAAG VLTWALDWRA
VLLVNVPIGL LCLVLAPRLI AVDATRETEA GPRFRLPAVL TAGAAMGLLV WTVAEGPVSA
AADTLVRATA VVVLLGAFVL MERRSTDRLM HRDILRVRPV AVADLMSVFF GAALGGQFFA
ITLYLQAVSG MSALVAGLMF IPVTLFMVVG NKVGVLLIAR IGPIRSLPVG LAIAAVAEVA
MAFLPTGGGV PLLVPAMILL GLGQGIAFVA ITVAATATVA AERQGVASGL LNVGMNIGQS
IGPAVLAAIV TWRSTAALDG GAGQAEANNQ GLHGAFLAIA AIVVVGLLVC GVLLRSGPGR
VESVHAG