Gene Francci3_2666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2666 
Symbol 
ID3904890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3146254 
End bp3148590 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content70% 
IMG OID637879991 
ProductMMPL 
Protein accessionYP_481757 
Protein GI86741357 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID[TIGR00833] Transport protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.1973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCG GGGACGAGTC CGAGGTCAGG GACAGCGGCG GGTCGGTGGC ACGGGCGCAG 
ACCCCGCCAG CGCCAGCAGC ACCGCCCCCG CCGGGGCCGG CTGTCGCCGG GGACGGCAAC
GGCACCCTGG CGGATTCCGC TTCCGGCCCC GCCGGTTCCG GTGGCCGTCC GGCGCGCGCC
CGCCGGCCAC GCATCGTCTA CGTCTTCGCG GTCTTGGCCG TCGGCTGGCT CCTGCTCGGC
GGCGTCGGCG GGTCCTACCA GGGCAAGCTC GGTGAGGTCC AGAAGAACGA CAATGCCGCC
TACCTCCCGA ACTCGGCCGA GTCGACCAAG GTCGACACCG CGTCGCGGCA GTTTCGCTCC
GTGCGGACGA TTCCCGGCTT CGTCGTCTAC GACCGCCAGG GCGGGCTGAC CGCGCAGGAC
AAAGCCAAGA TCACCGCCGA CGTGAGGTCC TTCGCCGGGA TTCACGGGGT GGACGCCGGC
CAGCTCGGGC CGCCCCAGTT CGCCCGCAAC GGCGCTGTGG CCGCCGTGGC CGTGCCGCTC
GTCGCCCAGG ACGGCGGCCG CGAGGTCAAG GGTGATGAGC TCGTCGACGT CGAGAAGGCC
GTCGTCGCGG CCGCCCGCGA CGGGGCACCG GCCGGTCTCG CCGTGCACCC GGCCGGGCCG
GGGGGTCTGC TCGTCGCCTT CATCGAGGCA TTCTCCGGCC TCGACGGGCT GTTGCTGCTC
GCCGCGGGCC TCGTCGTCGT GGGCATCCTG CTGCTCGTCT ATCGCAGCCC GGTGTTGTGG
TTCTTCCCGC TGTTCAGCGC GGTGCTCGCG CTCGGCGTCT CGGCGCTGAT CATCTATCCG
TTGGCCAAGA ACAACGTCCT CACCCTCAAC GGACAGGGCC AGGGCATCCT GTCGGTGCTG
GTCATCGGCG CCGGCACCGA CTACGCGCTG CTGCTGGTGA GCCGCTATCG GGAGGAGCTG
CACGCCTACC CCAGCCGGAT CCAGGCGATG ATCGTGGCCT GGCGGGGCGC CGCTCCGGCG
ATTACCGCCT CGGCGGTAAC GGTGATCCTC GGCCTGCTCT GCCTGAGCCT CGGGGAACTG
AACTCCACCC GCAGCCTCGG GCCGGTGGCG GCGATCGGCA TCGCCTGCAC AGCCCTGATC
ATGCTGATCT TCCTGCCGGT GTTCCTGGTG ATCGCCGGGC GCTGGATCTT CTGGCCTCGC
ATTCCCCGCG TCGATCACCA GGCCGACCTC GCCGGTCACG GTCTCTGGGC ACGCTTCGCC
GGAGGCCTCG TCCGCCGGGC CCGGTGGGCC TGGATCGTCA CGACGGTCGT GTTGCTGGCC
TGCTCGTCCC TGATCGTCAC CCTGAAGGTC GACGGCCTGT CGACGACGGA CAGCCTGACG
GGGCGGCCGG AGGCGATCGT CGGCCAGGAG ATCTTCGACG CCAACTTCAG CCAGGGGCAG
GGCGCCCCGG CCGTCATCAT CACCAACACC GTTGCCGCCG CGGACGTCAT CGCCGCCGTC
CGGAAGGTCG ACGGCGTCGC CACCGCTCCC GGCTCGGTCT GCGTGGAGGT CGACTACGCC
AAACTCGCCG CCCGGTTGGC GTCCGGTGGG CGTCCGGCGG CGAGGGGTTC CGACGGATGC
GCCCCGAAGT CGGTCCAGGT CGCCCCGGTT GACGGTCGAA TCGCCATCGA CGCCGCCATC
GTCCACCGTT ATGACACCGC CGAGGCTTAC AACACGATCA CTGCGATCCG ACGGGTCGTC
CGGGAGGTGC CCGGGGCGAA CGCCCTGGTC GGTGGCCAGT CGGCGATCAA TCTCGACACC
CAGAACGCCT CGCGCCACGA TCGCAACCTG ATCATCCCGA TCGTGCTGGT CGTCATCCTC
CTCGTGCTCG GCGTCCTGTT GCGGGCGCTG CTCGCCCCGG TCCTCCTGAT CGCGACGGTG
GTGCTCTCGT TTGCCGCGAC GCTGGGCGTG AGCGCGGTGG TGTTCAACCA CGTCTTCGGG
TTCGCCAACG CGGATCCCGG CTTCCCGCTG TTTGCGTTCA TCTTCCTCGT CGCCCTGGGG
ATCGACTACA ACATCTTCCT GATGACCAGG GTCCGGGAGG AGACCCTCAT CCACGGCACC
CGTAGCGGCA TCGTGCGCGG CCTCGCGGTC ACGGGCGGGG TCATCACCTC GGCCGGGATC
GTCCTCGCCG GGACGTTCGC CGTGCTCGGC GTCCTCCCGC TGGTCTTCCT GGCCCAGGTG
GGTTTCTCGG TGGCCTTCGG GGTGCTGCTG GACACGGTTC TGGTCCGCTC GGTCCTCGTG
CCGGCGCTGA GCCATGACCT CGGTCCGAAG ATCTGGTGGC CCTCGAAGCT CTCCTGA
 
Protein sequence
MTSGDESEVR DSGGSVARAQ TPPAPAAPPP PGPAVAGDGN GTLADSASGP AGSGGRPARA 
RRPRIVYVFA VLAVGWLLLG GVGGSYQGKL GEVQKNDNAA YLPNSAESTK VDTASRQFRS
VRTIPGFVVY DRQGGLTAQD KAKITADVRS FAGIHGVDAG QLGPPQFARN GAVAAVAVPL
VAQDGGREVK GDELVDVEKA VVAAARDGAP AGLAVHPAGP GGLLVAFIEA FSGLDGLLLL
AAGLVVVGIL LLVYRSPVLW FFPLFSAVLA LGVSALIIYP LAKNNVLTLN GQGQGILSVL
VIGAGTDYAL LLVSRYREEL HAYPSRIQAM IVAWRGAAPA ITASAVTVIL GLLCLSLGEL
NSTRSLGPVA AIGIACTALI MLIFLPVFLV IAGRWIFWPR IPRVDHQADL AGHGLWARFA
GGLVRRARWA WIVTTVVLLA CSSLIVTLKV DGLSTTDSLT GRPEAIVGQE IFDANFSQGQ
GAPAVIITNT VAAADVIAAV RKVDGVATAP GSVCVEVDYA KLAARLASGG RPAARGSDGC
APKSVQVAPV DGRIAIDAAI VHRYDTAEAY NTITAIRRVV REVPGANALV GGQSAINLDT
QNASRHDRNL IIPIVLVVIL LVLGVLLRAL LAPVLLIATV VLSFAATLGV SAVVFNHVFG
FANADPGFPL FAFIFLVALG IDYNIFLMTR VREETLIHGT RSGIVRGLAV TGGVITSAGI
VLAGTFAVLG VLPLVFLAQV GFSVAFGVLL DTVLVRSVLV PALSHDLGPK IWWPSKLS