Gene Francci3_1679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1679 
Symbol 
ID3903066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2015011 
End bp2017251 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content74% 
IMG OID637879017 
Producthypothetical protein 
Protein accessionYP_480784 
Protein GI86740384 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.548222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGC AGGGAAGTGG GACGGACGGT CAGGCGTCCG ACGCCCGGTC GACGAACGGC 
GACGCCGCGG CCAATCCCGA AACCGGAGTA CCCCGGACCG GGCAGAGCGG TGGGACCACG
GAGAAGACCG CACCGCGCCC GGGGGCCGGG AGGAACGCGG CCGCCACGTC CGCCCGTGCG
CCGAATCCGA CCTCCTCCCA GAGCGGACGA GCAGCGAGCG CGAGTTCGGC CGATGCGGAT
CCGGCCGGCC GGAGCGGTGC GGAGCGGGCC GGAAGCCCGC AGGGCACGAC AAAGACAGAG
AATCCCGGAA AGCCGGGAAA CTTCGGAAAG CCGGGAAACC CCGGGAGCGC GGGGAGCGCG
GTTGCCGCCC CGCGGCAGGG TACCGGCCGT ATCCCGCCGA CGTCGGTTCC ACCCTTGGGA
GGAGCTTCAC CGGCGCGGTC CGTGGGCGGC GGGCAGCGGT CGGTGCCGCC GGCTCCCGCC
CGGCCAGCCG CGGCGGACAC CCCGGAATCA TCGAAGCGGT CGACGCCACC GAAGACCGAC
ACACCCGGCC AGCCTGCTTC CGGCCAGGGG CGCGGCGCCG TCCGACGGCC CCCGGCTTCG
TGGCCCGGGC CGCCCGTCTC ACCCCCGCCC ACCCCGGCGG CGGCGCCACC GGCCGCGAGC
CGTTCGGGTA GTCCCGCGGC ACCGCCTCCC CCGAGTCGCG CGGGGAATCC GGTCAGTTCG
GCCGGAAGCG CGCGTTCGAC GCCCGGCCGT ACGAGTGCCT CCCCACCGCC GGTCCCCGCG
CCGCTGCCGG CACCGGCGCC GACCCGGGCT CGCCCGCCGG CGCCGTCCGA GCTGTCCGCC
AGCCCGTCCC ATCCCGCCGG ACAGTCCCCG ATCCGGGAAC GGATCAGCCC GTCGTCGCCC
GACGGTCCAC TCCCACGTCG CGGGGACGAG GGCGGCCGCG GATCTCGTCC CGACCGGCCG
GTACCGCGAT CCGGGGCGTC CGGTTCGCCG ACGCCGACGC CGACGCCGAC GCCGACGCCG
ACGCCGACGC CGACACCGAC ACCGACACCG ACACCGACAC CAACGCCGAC GCCGACGCCG
ACACCGACGC CATCCCGACC CGCGTGGTTC GAGCGCGACG TCTCGAGGAC CTCCGCCGGT
TCCGGTGCCA GGGTGCCACC CGTTCCCGGC GTTGAGGAGG GTCCTCGGCC GCGAGGAGTA
CCGGCGCCCG TTCCCGTCCG GGATCATCGA GAACCTGGAA GGGATCCCCT GGAGGCCGCG
GCCGGGCGTC GGATGGCCGC CCGTACCGAC GGGGATGATC ATGTGCTCGA TCATGATCGT
GGTCGTCCAC GGGCGTCGCG TTCCACCGAC CCGCCGACCA TGCAGGTGCG CCGACAGCTC
AGGCCCGATC TCGAACCGGA TCCGATGCCC TCCGGCACGA GGGACGAGGC GTGGCCCCTA
CCGGTCTCCG GCGCGGGGCC GGGCGGCGAG CCGGTGACCG ATGCCTATCC CTACCCCGAC
CCGGCCTTTG ACCGCCGGCT GCACGCGTAT CCCGGGCGGA ACCGGGGCGC GGGTGACGAC
CATGACCTGG AGGGTCAGGA ACGGCGTGCG GACCCGGGGC CGCGGCCGGT TGTCCCGCGT
CCGCAGGCCT TGCCCGAGAC GCATCGTCCG TCGACGCCGT CCGAACAACC GACGCCGTCC
GAACAACCGA CGCCGTCCGA ACAACCGACG CCGTCCGAAC AACCGACGCC GTCCGAACAA
CCGACGCCGT CCGAACAACC GACGCCGTCC GAACAACCGA CGCCGTCCGC TTTCCCGCCG
GCCACCGCGC CCCGGCCGGT CGCCGCGCTG ATGGTGGTGC TGGCGGCCGT CGGCGCCGGC
ATCGGTTCCG TCCTGCCGTG GAGCGAGATG TCCAGCGGCG ACGAGACACG TACGTTCAGC
GGTCTTGTGG TCGGAGACGG GCGCATCGTC GGTGTCCTGG CGGTCACACT CGGTGCCATC
GGGGTGGGAC GGTTGGTGCG TCGGCCGCTT GCCGGTGCGA TCGATGTCGC CCTCGCCCGA
ATCATCGCAG TTCTGATCGT GATCATCACC GCTCTGGACC GGGTCTACGG GCCGCCGACC
CTCGCATCGT TCCGCGCGAT TTCCGCGGAT GCAATCTCGA TCCGTCCACA GGCAGGGATC
ACGGTGTGCC TCGGCGCCGG CCTCCTCGCC CTGATCGGGG CGATGCTGCT CCAGCCGAGG
ACGAAGCCCC CTCGAAGATG A
 
Protein sequence
MAEQGSGTDG QASDARSTNG DAAANPETGV PRTGQSGGTT EKTAPRPGAG RNAAATSARA 
PNPTSSQSGR AASASSADAD PAGRSGAERA GSPQGTTKTE NPGKPGNFGK PGNPGSAGSA
VAAPRQGTGR IPPTSVPPLG GASPARSVGG GQRSVPPAPA RPAAADTPES SKRSTPPKTD
TPGQPASGQG RGAVRRPPAS WPGPPVSPPP TPAAAPPAAS RSGSPAAPPP PSRAGNPVSS
AGSARSTPGR TSASPPPVPA PLPAPAPTRA RPPAPSELSA SPSHPAGQSP IRERISPSSP
DGPLPRRGDE GGRGSRPDRP VPRSGASGSP TPTPTPTPTP TPTPTPTPTP TPTPTPTPTP
TPTPSRPAWF ERDVSRTSAG SGARVPPVPG VEEGPRPRGV PAPVPVRDHR EPGRDPLEAA
AGRRMAARTD GDDHVLDHDR GRPRASRSTD PPTMQVRRQL RPDLEPDPMP SGTRDEAWPL
PVSGAGPGGE PVTDAYPYPD PAFDRRLHAY PGRNRGAGDD HDLEGQERRA DPGPRPVVPR
PQALPETHRP STPSEQPTPS EQPTPSEQPT PSEQPTPSEQ PTPSEQPTPS EQPTPSAFPP
ATAPRPVAAL MVVLAAVGAG IGSVLPWSEM SSGDETRTFS GLVVGDGRIV GVLAVTLGAI
GVGRLVRRPL AGAIDVALAR IIAVLIVIIT ALDRVYGPPT LASFRAISAD AISIRPQAGI
TVCLGAGLLA LIGAMLLQPR TKPPRR