Gene Francci3_3232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3232 
Symbol 
ID3904403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3826192 
End bp3827379 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content71% 
IMG OID637880557 
Productmajor facilitator transporter 
Protein accessionYP_482318 
Protein GI86741918 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.904986 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCA GCCAGGACGC GGTCGAGACG AACTCCACCG CGGTTCGCCA CGACGTATTC 
AAGGTGACCG CCGCGACCTG TGGCGCGCTG TTCGTGGACA GCCTGCTCTA CAGCATCGTG
GTACCTGTCC TGCCGAACTA CGCGGACCAG TTCGATGTGG GGTCGGCCGG GGTGAGCCTG
CTGTACGCCG CGTACGCGGT CGCCCTGCTG GCGGGGACTC CCCTGATGGG CCGGGTCGGC
GACCGGTTCG GGCATGAACG GCCATTCCAG GTCGGTGCCG CCGGTCTGCT GATCTCGACG
GTCGGTTTCG CGCTGGCTCG TAGTTATCCC GAACTTCTGG CTGCCCGCAC GCTCCAGGGA
GTCGCCGCGG CGGCGCTGTG GACGAACGGA ATCGCTCTGC TCGCCCAGCG GGTGCGGCCC
CCCAGGGCCG GCGGCGCGAT GGGAGCGGCG ATGTCCAGTA TGTCCGTCGG TATGGTCGCT
GGCCCAGTGA TCGGCGGGCT GCTGGCCGAA CGCTTCGGCG ACGCGGCGCC GTTCGTGGTG
TGCACGGTGC TCACCGCGGT ACTCGCCGCC GTCCTGCCGT GGCTGGTCCG CGGCGCCGCG
CAGCCGGTAC GTGAGCAGCA GCCATCCGGC TGGCGTTCGC TCCTGCCGAC CCTGCTCGCT
GTCGCCTTCG GAGCGGCGAC GCTGAGCATG CTGGAACCGC TGCTCCCGCT GCACCTGGCC
GACCGGTTCG GCAGCGGCCC GGCCACGCTG GGTCTGATCT TCGGGGCGGC GACGCTGGCG
CACGGGCTCG CCGGGGTGCC GGTCGGCCTG CTTGGGGACC GGCGGCCGGA TCTGCCGTTG
ATCCCGGGCG GGTTGCTGGG GATGAGCGCC GTCCTGCCGC TGCTCCCCAG GTTCGACGTT
GGCTGGACGA CGGTGCTGCT GGTGACGTTC GCCGTTTGCT TCTCCTTTGT CCTCATCCCT
GCGCTGGGGA TTCTCACCGC AGCGGCGGAA CGGCGGGGGG TCGGGCATGG GGCCATCTTC
GCGATGTTCA ACATCGCCTA TGCCGTGGGG ATGATGAGCG GCCCTCTGCT CGGCGCGCTG
GGTACCGGCT TCAGCAGCGT CACCACCGCG CTTACCGGCA TGGCGGCGGT TCTCGTCCTG
GGCGCCGTCC TGATCCTGAC CGCGGCGCAG CGCCGCTCCG TCACCTGA
 
Protein sequence
MAASQDAVET NSTAVRHDVF KVTAATCGAL FVDSLLYSIV VPVLPNYADQ FDVGSAGVSL 
LYAAYAVALL AGTPLMGRVG DRFGHERPFQ VGAAGLLIST VGFALARSYP ELLAARTLQG
VAAAALWTNG IALLAQRVRP PRAGGAMGAA MSSMSVGMVA GPVIGGLLAE RFGDAAPFVV
CTVLTAVLAA VLPWLVRGAA QPVREQQPSG WRSLLPTLLA VAFGAATLSM LEPLLPLHLA
DRFGSGPATL GLIFGAATLA HGLAGVPVGL LGDRRPDLPL IPGGLLGMSA VLPLLPRFDV
GWTTVLLVTF AVCFSFVLIP ALGILTAAAE RRGVGHGAIF AMFNIAYAVG MMSGPLLGAL
GTGFSSVTTA LTGMAAVLVL GAVLILTAAQ RRSVT