Gene Francci3_1768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1768 
Symbol 
ID3903998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2102801 
End bp2104114 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content67% 
IMG OID637879106 
Productmajor facilitator transporter 
Protein accessionYP_480873 
Protein GI86740473 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.160661 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCGACCC CCCATAACCG GGACCGGTCG GCGCTACGGA TCCTGCTGAC GAGCATGATC 
GGCAGCGCCA TCGAATGGTA CGACTTCTAC CTGTACTCCA CCGCGTCGGC GCTGGTACTG
GGGCCGCTGT TCTTCCCTAA GAGCTCGCCT CAGGCCCAGA TCCTGGCTGT GTTCGCGACC
TACGCAGCCG GCTTTCTGGC CCGGCCAATC GGTGGCCTGC TTGCCGGACA CCTCGGCGAC
CGGGTCGGGC GCAAGTCGAT CCTGGTGCTG ACTCTCGGCG TGATGGGGAT GGCCACGTTT
CTCGTCGGCC TGCTGCCTAC CAGCAACCAG GTCGGGGTGC TGGCCCCCGC GCTGCTGATC
GTGCTGCGCG TGTTGCAGGG GATCGGCATC GGCGGCGAGT GGGGCGGCGG GGTGCTGCTG
GCGGTGGAGA ACGCCCCACC GGGCCGGGGC GGATGGTACA GCAGCTGGCC GCTGCTGGGG
TTCCCGGTCG GCCTCGCGTT GAGCACCCTC ACCTGGACGG CGCTGGCCCA GCTGCCCAGG
CAGGAGCTAC TGTCCTGGGG CTGGCGTCTC CCGTTCCTGG CCTCTGTGGT GCTGGTGGGC
ATCGGACTGT ACGTGCGACT GGGCATCGCC GAGACCCCTG AGTTCAGCCA GGCCAGGGCC
GCGGGCGAGG TGGTGCGGCT ACCGGTGGCA CAGGTCCTGC GCGAGCAGCC GCGCCACGTG
CTGTGCGGGC TACTGGCGGC GCTCGGAGTG GGCAGCACGG TCTCGCTCTA CAGCGTCTTC
CTGCTGTCCA CCGTGGCCAC AGGAGGTGGT CGCCACGATG TCGCGCTGAC TGCACTGGTC
ATCAGCGCCG CGTTGCAGTG TCTCTCGATA CCGCTGTTCG CCACACTGTC GGATCGGATC
GGGCGCAAAC CATTGATGGT GTTCGGTTAC GCGGTCGCCG CAGCGACCAC CGTCCCGGCG
CTGCTGTGGT TCGACAGTGG AAACCTACTC GCGGTGAGCG CAATCTACGT CATGGCCATA
TCGATCGGGC ACGGCGGCTG CTATGGTAAT CTCGCGGCAT TCCTCTCCGA GCTGTTCCCG
CCTACCCGGC GATTCTCCGC GCTTGCGGTG ACGTACCAAG TTGGTGTCAC CGTCGCCAGC
TTCCTCCCGT TGGCCGCCAC AGCGATTGCC TCCGGCACGC GCATGACCGT CGATGTCGCA
CTGCTGTTCT GCGGTGTCGC CACCGTCGCC GCGATCGCGA CTTCCCTGGC ACCCCAACCT
TTCATGCCAT CCACCACCAC CCCCGTAGGC GATCATGTAG CTACCGTGAG CTAA
 
Protein sequence
MPTPHNRDRS ALRILLTSMI GSAIEWYDFY LYSTASALVL GPLFFPKSSP QAQILAVFAT 
YAAGFLARPI GGLLAGHLGD RVGRKSILVL TLGVMGMATF LVGLLPTSNQ VGVLAPALLI
VLRVLQGIGI GGEWGGGVLL AVENAPPGRG GWYSSWPLLG FPVGLALSTL TWTALAQLPR
QELLSWGWRL PFLASVVLVG IGLYVRLGIA ETPEFSQARA AGEVVRLPVA QVLREQPRHV
LCGLLAALGV GSTVSLYSVF LLSTVATGGG RHDVALTALV ISAALQCLSI PLFATLSDRI
GRKPLMVFGY AVAAATTVPA LLWFDSGNLL AVSAIYVMAI SIGHGGCYGN LAAFLSELFP
PTRRFSALAV TYQVGVTVAS FLPLAATAIA SGTRMTVDVA LLFCGVATVA AIATSLAPQP
FMPSTTTPVG DHVATVS