Gene Francci3_2911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2911 
Symbol 
ID3903975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3428334 
End bp3430919 
Gene Length2586 bp 
Protein Length861 aa 
Translation table11 
GC content72% 
IMG OID637880232 
Producthypothetical protein 
Protein accessionYP_481998 
Protein GI86741598 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein
[COG2308] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02050] uncharacterized enzyme 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.69744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.672466 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG CACGAATCGT GGCCGTCGGC GTCGAGGAGG AATTTCACAT CCTCGATCTC 
ACTACCCGTC AGCTCGTTCC CAGGGCCGAG GAGGTCCTCC GTCGACTCGA CGGGGACTCG
TTCTCTCCAG AGCTATTGAA ATCGGTTGTC GAGACGAACA GTCAGCCGAC GGCGGATCTG
CTGGAGCTGC GGACGAACCT GCTCGATCTG CGGCGGCGAC TCGCGGAGGT GACCGGCGAG
CTCGGGCTCG GGCCGGCCGC CGCGGGCACC GTGCCGATCG TGGACATGGA TCTGCTCGAC
GTCTCCCGGG ACGACCGTTA CGAGCAGATG ACCGAGGACT ACCAGATCGT CGCGCGCGAG
CAGCTCATCT GCGGCGCCCA AGTGCACGTC GACGTGGCCG ACCGCGACCT CGCGATGGCC
GTCGTCGCCT GGACCGCGCC GTGGCTGCCG ATGCTGCTGG CGCTCTCGGC CAGCTCGCCG
TTCTGGATGG GCGCCGACAG CGGCTACGCG AGCATGCGCA CGCTGGTGTG GCAGCGCTGG
CCGACCGCGG GGGTCGCCGG GTCCTTCCGG ACCGCCGCGG AGTACGACCA GCTCGTGGCA
GACCTGATCA AGTCCGGGGT GATCAGCGAC CCGGGGATGG TCTACTTCGA CGTCCGCCCG
TCGGCCCACC TGCCCACCGT CGAGCTGCGC ATCTGCGATG CCTGCCCGGA CGTCGACAAC
GTCATCCTTA TCGCCGGGCT GTTCCGGGCG CTGGTGTGCC AGGCCATCGA GGAGATCGAG
GCCGGCGGCC AGGCCCCGCC GCCACGCGCG GAGCTGCTGC GCGCGGCGAC CTGGCGGGCG
GCCCGGTCCG GCCTGGAGGG CGACCTCGTC GACATCCTCG GTGCCGGCCC GATACCGGCC
CAGGCCATGC TGCGCCGGCT GCTCACCGAG GTCCGCCCGC AGCTCGAACG GTTCGACGAC
TGGGAGCTCA TCGACAACCT GGCCGAACAG GCCGTCGGCC GGGGCAGCTC GGCGCACCGC
CAGCGCCGGG CCTTCGCCCG TCGGGGTCTG CTCACCGACG TCGCGGATCT GGTGCTGGCC
GAGACCCGCG ACGTCCCGCC CGCGGGGGCG GCGGCCGCGC TGGGGAGCGC GCCCGCTGTG
TCCGCCTCCG ACCAGATCGC GCCGCGGCTG CTCGAGCGCT ACCAGCCGAC CGGGTACGAC
GAGATCGTCG ACGCCCGCGG CGCGGTCCGT CCCCAGTACC GGGCCGTCAT GCGCACCTTG
GAACGGCTCG GCCCGGGCAT CCTCGACGAA CGGGTCGGGA CGCGCGAGGC CGAGCAGAAC
GACCGGGGGA TCGTCTTCCG GGCCTCGGGG GACTCCGCCA GCCGTCCCTT TCCGTTCGAC
CTGGTCCCGC GGATCATCGC CGCCGACGAC TGGACGACCC TGACGACCGG GCTGAGCCAG
CGGGTCCGGG CCCTGGAGGC GTTCCTGCAC GACATCTACG GCGAGCGGGC GGCCGTCGCC
GACGGGATCG TGCCGGCCTG GGTGGTCAAC GACGCGCCGA GCCTGCGCCA CGGCGGTCGC
GCCGTCGGCC CGGACGCCAT CCGGGTGACC GTCGCCGGCA TCGACCTGGT CCGCGGCGGC
GACGGCGGCT GGCTGGTGCT GGAGGACAAC CTGCGCGTGC CGTCCGGTAT CGCCTACGCC
ATGGAAGGCC GGCGGCTCGC CGAGTCGGTG CTGCCGGAGC TCGGTCCGCC GGCCGGCATC
CTGCGACTGG ACGGCATACC CGCCCTGCTG CACGAGGCGC TGGTCGCGGC GGCCCCGGCG
GCGGCGACGG GGGACCCGGC GGTGGCCGTG CTGACCGGCG GGAAGACCGA CGCGGCGTAC
TTCGAGCATT CCCTGCTCGC CGAGAAGATG GGCGTCGCGC TGGTGGAACC GGCGGATCTG
CTGGTCGACG ACAACGACGT CGTCTACCGG ATCGACGGCT CCCGGCGGTG CCGCGTCGAC
GTCCTCTACC GCCGGATGGA CGAGGACGAC CTGTTCGGCG CCCTCGGAGC CGCCGGCACC
CCGCTGGGGC TCCCGCTGCT GCGCGCGATC CGGGCCCGCC GGGTCGGCAT AGCGAACGCT
CTGGGTAACG GGGTGGGTGA CGACAAGGTC GTCTACGCCT ACGTCCCGCG GATGGTCACC
TACTACCTGG GCGAGCAGCC GGTGCTCGAT GACGTGCCGA CCTATGTCTG CGGCGACCCG
GAGCAGTGTG AACACGTGCT GGACAACCTC GATCAACTGG TGGTGAAACC GGTCGACGGG
TACGGCGGGT CCGGCGTGGT CATCGGGCCG CACGCCGAGC CCTACCGGCT TACCGAGGTC
CGCGAGCGGA TCCTGGCCAA TCCCCGGCAG TGGATCGGCC AGGAACTGGT GTCGCTGTCG
ACCCATCCGA CCTGGCATGA CAGCCATCTC GAACCCTGCG CGGTAGATCT GCGGGTCTTT
GTCTACGCCG GGCGGGAGCC GCGGGTGGTG CCCGCGGCGC TCAGCCGGGT GGCGCCTCCC
GGCAGTCTGA TCGTCAACTC CTCGCAGGGC GGGGGATCCA AGGACACCTG GATTCCCCGC
CGTTAG
 
Protein sequence
MSDARIVAVG VEEEFHILDL TTRQLVPRAE EVLRRLDGDS FSPELLKSVV ETNSQPTADL 
LELRTNLLDL RRRLAEVTGE LGLGPAAAGT VPIVDMDLLD VSRDDRYEQM TEDYQIVARE
QLICGAQVHV DVADRDLAMA VVAWTAPWLP MLLALSASSP FWMGADSGYA SMRTLVWQRW
PTAGVAGSFR TAAEYDQLVA DLIKSGVISD PGMVYFDVRP SAHLPTVELR ICDACPDVDN
VILIAGLFRA LVCQAIEEIE AGGQAPPPRA ELLRAATWRA ARSGLEGDLV DILGAGPIPA
QAMLRRLLTE VRPQLERFDD WELIDNLAEQ AVGRGSSAHR QRRAFARRGL LTDVADLVLA
ETRDVPPAGA AAALGSAPAV SASDQIAPRL LERYQPTGYD EIVDARGAVR PQYRAVMRTL
ERLGPGILDE RVGTREAEQN DRGIVFRASG DSASRPFPFD LVPRIIAADD WTTLTTGLSQ
RVRALEAFLH DIYGERAAVA DGIVPAWVVN DAPSLRHGGR AVGPDAIRVT VAGIDLVRGG
DGGWLVLEDN LRVPSGIAYA MEGRRLAESV LPELGPPAGI LRLDGIPALL HEALVAAAPA
AATGDPAVAV LTGGKTDAAY FEHSLLAEKM GVALVEPADL LVDDNDVVYR IDGSRRCRVD
VLYRRMDEDD LFGALGAAGT PLGLPLLRAI RARRVGIANA LGNGVGDDKV VYAYVPRMVT
YYLGEQPVLD DVPTYVCGDP EQCEHVLDNL DQLVVKPVDG YGGSGVVIGP HAEPYRLTEV
RERILANPRQ WIGQELVSLS THPTWHDSHL EPCAVDLRVF VYAGREPRVV PAALSRVAPP
GSLIVNSSQG GGSKDTWIPR R