Gene Francci3_2871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2871 
Symbol 
ID3906002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3383822 
End bp3385513 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content71% 
IMG OID637880192 
Productmajor facilitator transporter 
Protein accessionYP_481958 
Protein GI86741558 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.175724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGACC TGGTGATGCA ACCGATGCCC GGGATGCGGC CGACCCCCGG CACACGACGG 
GCCGGCCGCC GGGAATGGGC CGGTCTCGCC GTACTCGCCC TGCCCTGCCT GCTGCTGTCG
ATCGACGTGA CCGTGCTGTA CTTCGCCCTG CCGTTCATCA CCGCACGCCT CGCCCCGAGC
GGCGCCCAAC TGCTCTGGAT CGTCGACATC TACTCCTTCG TGCTCGCCGG CCTGCTGATC
ACGATGGGGA CGGTCGGCGA CCGGATCGGG CGTCGCCGGC TCCTGCTCTA CGGCGCGGCC
GTGTTCGGAA CCGCGTCGGT ACTCGCCGCC TACGCCACCG GCCCGGTCAT GCTGATCCTC
GCCCGGGCGG CGATGGGCGC CGCCGGTGCC ACCCTGATGC CGTCGACCCT CGGGCTCATC
CGGGTCCTGT TCGTCGATCC GGACCAGAGG CGCGTCGCCG TCGCCATCTG GACGGCGAGC
TTCTCCGGCG GTGTCGCGCT GGGACCGCTG CTCGGCGGCT TCCTGCTGGA GCACTTCTGG
TGGGGTGCCG TCTTCCTGAT CAATGTGCCG GTGATGCTCC TGCTGCTGGT GCTGGGTCTC
GCGTTCCTGC CCGAGTTCCG GGATCCGCGT CCCGGCCGCT TCGACCTGAC CAGCGCCGGG
CTCTCCCTCG CCGCCGTCCT CAGCGCGGTC TACGGCATGA AGCGCATCGC CGAACACGGC
TGGACCGCCG TTCCGATGAC CGCCTTCGTC GTCGGTGTGA TCCTGGGTGG CCTCCTTGTG
GTCCGGCAGC GCCGGGTCGA CCAACCGCTG ATCGATCCGC GGTTGTTCGC GTCCCGTGCC
GTGCGCACGG CGTTGGCCGT CAACGCCCTC GCCCTGTTCG CCCTCGTCGG GTTCAGCTTC
TTCGCCACTC AGTACCTGCA GCTCGTGGCG GGCCTGACCC CGCTGGTCGC CGGACTGTGG
ACGATGTTGC CGGCGACCGC AATGATTATC TCTGTGCTGG CCGCCCCCCG GCTGCTGCGG
CTGCTGCGGC TGGGCGGCGC GCTTGGCGCG GGGCTGGTCG TCACCGCGCT GGGATTCGGC
ATCGCCAGTC GGCTCGGCGT CTCCGGGGGT CTGCCGGTGC TGCTCGGCGC CTATGCCGTG
CTGATCAGCG GGGTCGGTGT CGTCCTCACG ATCTGCACCG ACGTCGTGCT CGGCAACGCC
CCACCCGAAC GGGCGGGTTC GGCCTCCGCC CTGTCCGAGT CGGCCATCGA GTTCGGGGGC
GCGCTCGGGG TCGCCGTTCT CGGCAGCATC GCCACCGCCG TCTACCGGAA CGAGGTGCCG
ACGCGGGCAC CGTCCGGCCT GCCCACGATC GCGGTCGACG CGGCGCGGGA GACCCTCGGC
GGTGCGGTGG AGGTCGCCCA CCGGCTCCCC GGCGGCCTCC ACCCGGGTGG TCTTGGCGAC
ATGCTGCTGC GGGTGGCACA GGAATGCTAC GTCGACGCGA TGTCGGTGAC CGCCCTGATT
GCCATGACGG TCATGGCGGT CACGGCCATC GCCGTCGTGG TGGTGCTGCG CCGGGACGCT
GCGCCGTCGA TGCCTGAATC GTCAACGGTC GGATCGTCAA CGGCCGGGGA CGCGGTCGCC
GGGCGGGCCG AGAGCGTCGG CACCGAAGGG TCCGACCGTA CGGTGGCGGA CTCCGCCGCT
CCCGGGGCAT AG
 
Protein sequence
MPDLVMQPMP GMRPTPGTRR AGRREWAGLA VLALPCLLLS IDVTVLYFAL PFITARLAPS 
GAQLLWIVDI YSFVLAGLLI TMGTVGDRIG RRRLLLYGAA VFGTASVLAA YATGPVMLIL
ARAAMGAAGA TLMPSTLGLI RVLFVDPDQR RVAVAIWTAS FSGGVALGPL LGGFLLEHFW
WGAVFLINVP VMLLLLVLGL AFLPEFRDPR PGRFDLTSAG LSLAAVLSAV YGMKRIAEHG
WTAVPMTAFV VGVILGGLLV VRQRRVDQPL IDPRLFASRA VRTALAVNAL ALFALVGFSF
FATQYLQLVA GLTPLVAGLW TMLPATAMII SVLAAPRLLR LLRLGGALGA GLVVTALGFG
IASRLGVSGG LPVLLGAYAV LISGVGVVLT ICTDVVLGNA PPERAGSASA LSESAIEFGG
ALGVAVLGSI ATAVYRNEVP TRAPSGLPTI AVDAARETLG GAVEVAHRLP GGLHPGGLGD
MLLRVAQECY VDAMSVTALI AMTVMAVTAI AVVVVLRRDA APSMPESSTV GSSTAGDAVA
GRAESVGTEG SDRTVADSAA PGA