Gene Francci3_2752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2752 
Symbol 
ID3906463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3242445 
End bp3243941 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content69% 
IMG OID637880075 
Productmajor facilitator transporter 
Protein accessionYP_481841 
Protein GI86741441 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGGT CCGCTCAGCC GAAGGAGTCT GCGGCATCAC CACTGCTGCT GCCGGTCGTG 
TGCCTCGGCC AGTTCCTGGT CGTGTTCAGC CTGTCCAGCG TGAATGTCGC GCTGCCCAGC
CTGGCAGCCG ATCTTCAGAT GAGCCGCGAG ACCCAGCCCT ACACGGTCAC GGCCTACGGG
ATAGCCCTGG GCAGCCTGCT CCTGCTGGGT GGCAGGGCTG GCGATCTACT CGGCCGACGG
CGACTGTTCC GGCTAGGCAC GGGGGTCTTC GCGGTCGGCA CCGTGGTGTG CACGCTCGCC
AACGTGACCG GCGTGCTGCT CGCTGGGCGG GCCCTGGAGG GGCTCGGTGC GGCCATGCTC
GCCCCGGCCG CGCTGGGGAC GATCAACGCG ATCTACCCGC AAGGGCCGAG CAGAAACAAG
GCGCTCGGGG TGTTCGGTGC GGTGTCCAGC GCGGCGGCCG GGGTGGGCGT CTTGGCGGGC
GGGGTCCTCA CCGACGGGCC GGGATGGCGG TGGGCGTTCG CGATCAAAAT TCCCTTCTGT
CTACTCCTGC TCGCCGTGGT CGGAAGGATC ATGCCGGAAA CCCGTGACAC CCGTTCTCAC
CACGGCTTCG ACATCGCCGG CGCGCTCACC GCGACGGGGT CGATGTTCTC GCTGGTCGTG
GCCATCACAC AGGCGGAGCA GCGCGGTTTC ACCAGCCCGG CGATCCTTGC TCTGTTCGGA
TCAGCCGCCG TTCTTCTGAT CGCCTTCATC GTGATCGAGA GCCGCAGCGA CGACCCGCTC
ATGCCGTTGG CCCTCTTCCG CCGGCGTGGT CTCAGCGTCG CCAACGGCGC GACCCTGCTC
GTGTGGGCCA GCTTCGGGTC GGCTTTCCTC CTGGTGGCCC TGTATGTGCA GCAGGTACTC
GGCTACAGCC CGATGAAGGC CGGGCTGACA TTCGTGCCGA TGGCAACCGC GGCCGCGCTC
GCGGCCAACG TCGCCCAGGC GACAGCGAGT CGTTCCGCGC GCGGTCCCAA GCCCGCGCTG
CTCGTCGGCC TTGTCCTCCT CACTGTCGGC GTGGCCCTCC TGTCCCGGGG CGACGCCGAC
ACCAGGTACA CGGCGGTGCT GCTGCCGGCG CTCGTGCTGT GCGGCGCCGG TCTCGCGACC
TCCCTCACCG CGCTCAACGT GGCCGCCTTC AGCGGCTCCA ACAGCGACGA CTCCGGGGTT
GAGGCAGGAC TCCTGAGCAC CTCCCAGGAG ATCGGCGCGG CCCTCGGCGT CTCGATTCTC
TCGACGGTGT CGGCCCGCGT CATCTCCGAT CATCTCGCAG CTCATCCCGG AGATCCGACC
GCGATGCTCA GCGGCACGGT CGATGCCTTT CAGGTGTCCT TCGTCGTCGC GGCGGGCATG
GCTGCCGGCG CCTTCCTGCT TTCCCTGATC GCCCTCTCGA CGCGGGTTGA TATCCCCCCG
GCCAGCGTGT CGGAAAACCC CCTCGCGGAA CCAGCCTCGA TAATGCCCTC AAGGTGA
 
Protein sequence
MSRSAQPKES AASPLLLPVV CLGQFLVVFS LSSVNVALPS LAADLQMSRE TQPYTVTAYG 
IALGSLLLLG GRAGDLLGRR RLFRLGTGVF AVGTVVCTLA NVTGVLLAGR ALEGLGAAML
APAALGTINA IYPQGPSRNK ALGVFGAVSS AAAGVGVLAG GVLTDGPGWR WAFAIKIPFC
LLLLAVVGRI MPETRDTRSH HGFDIAGALT ATGSMFSLVV AITQAEQRGF TSPAILALFG
SAAVLLIAFI VIESRSDDPL MPLALFRRRG LSVANGATLL VWASFGSAFL LVALYVQQVL
GYSPMKAGLT FVPMATAAAL AANVAQATAS RSARGPKPAL LVGLVLLTVG VALLSRGDAD
TRYTAVLLPA LVLCGAGLAT SLTALNVAAF SGSNSDDSGV EAGLLSTSQE IGAALGVSIL
STVSARVISD HLAAHPGDPT AMLSGTVDAF QVSFVVAAGM AAGAFLLSLI ALSTRVDIPP
ASVSENPLAE PASIMPSR