Gene Francci3_2575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2575 
Symbol 
ID3904067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3036647 
End bp3040372 
Gene Length3726 bp 
Protein Length1241 aa 
Translation table11 
GC content75% 
IMG OID637879900 
Producthypothetical protein 
Protein accessionYP_481666 
Protein GI86741266 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.633416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGGCC TGAATCTGGA GATCGACGCG CTGGTCGGCC CGGCGCTGGA TCTACTGGCC 
ACCGCGGAGC GCGACGGGGA CCCGGTTGCC GCCCGGGCCG CCGTCGGCCT GCTGGCTCAG
GCATATGACC TCGCGCCCGG CCACTCCGGC GCGCCGGGCC TGGCGATGCT CATCGCCGAG
TGGGAGGGCG AGCTCGCCCA GGCGGATCTG GGCAGCTGGG ACAGTGTGGT GCTGTGGTGG
CGCCGGGGCT GGCCGGCGCC CCGGCCCGGC GGCCGCCCGT CCCTGAGCAT TGCCGAACTC
GATGCCATGG AAGGCGCGCT GCACGGCGAG CGCTACGTCA ACGGTCGCGA TCCCGCCGAG
CGGGACGCCG CCATGCGCCT GCTCGCCCCG CTGGCCGACG CGGGCAAGCT CGGTCCGCAG
GACCTACTCG GCTACGCAGC GCTCCTGATC GTCTGGTACG AGCAGGACGC GGTGCCGGCC
GCCCGCGACA CCGCGCTCGA CCTGCTGGCG CCCATGGTCA CCGCGGGCAC CGCCGACCCG
GGCATCGTGG CCGACTACGC AGAGCTCGTC TTCGACCGCG CCGAACAGAG CGGTACCACC
CGGGATTGGG CCCAGGCCGT CCACTGGCTG CGCGAATCCC TCCGCATCGG CGGCGCCCAA
AACGTCGCCG ACACCTGGGA CAGCCTCGGC CACGCCTACT GGGCGTGGAG CCGGCTCGAT
GACGACGACG CCCTGTTCGA CGAGGCGGTC GACTGTTTCA CCGCGGCGGT ACGGCACGGT
CAGCCCGCCG AGCTCCTCGC GCCGGCAGCG CTGGTGTTCG TCGCGGACCG GCTGCGCGAG
CAGCGGCGCA GCGCGGGCGG CGTGGACGAC GCGCTGCTGG CGGACATCCG CGCCGTGCTG
GCACTCGCGG ACCAGATCGT CGACCGGCCG GAACTGGATA CCGAGATCCG CGCGACGGTG
GCCATGGAGA TTCTCACGCT GCAGTACCTC ATCGTCGACG GGAACGAGAT CACCCGCCGG
CTGGTGCAGG AGGGCCTGGA GGCGTTCCCG CTGGCCGGGG ACAGGTTTGA CCGCCTGCTC
GGCTACGCCG ACGCGCACCC CGACCCGCCG CAGGGCTGGT CGAGCAGCCG GGCCGCGATG
CGGGCGATGA CCAGTCTGCT GCGCGCCATG TTCGGCACCG GGCCCCGGCC CGGCGCGGAC
TTCGGCCTGC AGCATGCCGC CGAGGCCATG CGCACGGCCG AGCACTCGGA CGACGCCGCC
GCGTTGTTCT CCCTCATCTC CGCTGTCGCC GGTGGCTTCT CCGGCGACCT GGGCCGGGTG
AAGGCCGCCG AGCAGCTGGT CGGGACCTCC GCGGACCCGA GCGCCACGGC GATCTCCCTG
TTCGCCCGAC TGGGTCAGAT GCTGCTGGTC GGCATGCCGG CCGACGGCGC CCGCACCGCC
GTCGAGCTGC TGTCCGTCTT GGCCCGGTCG CTGGCACAGG TTCTGGGCGG TGGCCTTCGG
GGCAGGCTGC AGCCACCTGC CGCCGACACG GGCAAGCCCG GTCACCTCAT CGAGGTCAGC
CAGTACCTCG ACGTCAGCCG GCGCCTCGAC GTCGGTCAGC TCCGGGAGAT CGCCGACGTC
TTCGCCGCGT TCCCGGGCTG CGACGCCCTT GTCGCGCAGG CAGCGGCCGC GGAAGTGATC
GCCGGCATCA TCGCGGCGAC TTCAGCCGGC GCGCCCGCGG ATCGCCTCGC CGAGTTGCGC
GCCCGCCTCG CGGACGCCGA CCAGGCCTGG GCCGACAGCA GTGCGCAGCT ACCGCCACCC
CCGCGGGCCA TGGGCACGCA GCTGTTCGCC CACGGCTGGC AGCTGTTCGC GGGAGTCACC
GGGGACCGGT CCGCCGCCCG GCTCGCCGTG GCACGCTCCC AGGCGGTCCT GGACTGGTTC
GACGGTCCTG ACCACCCGAT GTGGCCGATG GTCGTCATGA CCGCCGCCCG CGCCCGGCGA
CTGCGCGGCG AGCCCGGCGA CCTGCCCGGA GGTCGCCGCC TGGCGCTGCG AGCCCTGCGC
GGGCACGCCT GGCTCGTGCT GCTGCAGTCG GGCACCCGAG ACGCGCTGGA GGTCGCCCGC
GGGGCGGCGG CGGATGCCCG GGTGGTGCTG GAGTGGTGTG TCGAGGACCT GCGGGGCGGC
GACGAGCAGG CCCGCGACGA CCTGGTGACC GCCGTGGAGG CCGGCCGCGG CCTCGTCCTG
CACGCGGCGG TGACCACCCG GTCGGCCGCG GAGCAGCTGC GTGAGCTTGG GCATGCCGAC
CTCGCGCAGG CCTGGGCAGC CGCAGGCCTC GAACCGGCCG AGGCCGACCC GGCTGCGCAA
GGCCCCGGCT GGTTCGCCGA CGGGGCCGAA CGGTCCGCGC TGCGCCGGGC TGTGCTGGCG
GCGCTGAGCG AGTCCGCCGC CGGACCCGAC AGCCTGCTGC TTCCACCCAG CATCGGCGAG
ATCCGCGCAG CGGTGGGCAC GCACGGCTCC GACGCACTCG TCTATCTGGT CCCCGGCGTC
GACCGGCCCG GGCATATCAT CATCATCCCG GCTGACGGGA CCAGACCGGT GGAACATGTC
GAGCACCCGG GGCTCGTCAG CGCCGACGAC TCCCCCATGG GCCGGTACCT GGCGGCGTAC
GCGGGTTGGC GCCAGGCTGC AGGCGGTGAG GGTGCCAGGC AGGGTGTGGC CGAGCCAGCG
GCCGCGTTCG CGGCGTCGCG CGCCGCGCTG GCGGCGTGGC GCGCTGCGCT GGCGGAGCTC
ACCGAGTGGG CGGGTCGCAT CGCCGAGGTG CTGCTGGACC GGGTCTCGGC TGTCGCGGCC
GACACCACCT GGCCGACCTT CGTAATCACC TCGGTGGGGG TGCTCGGCCT AGTGCCCTGG
CACGCCGCCG TGCTGGACGT CGAGGCGCCC GGTCATCAAA CCGGCGGCGG CACCCGGCTC
GCCCAGCGGG CGACGGTGTC CTACATCCCC TCGGCCCGGC TGTTGTGCCG GACGGTCGCG
ATGCCCGCCG CTGCCGACGG CGACGTGCTG ATCGTCGGCA ATCCGGTGGG CGCGGCGTCC
CACCCTGCCG GTGACGCCGC CTGGCATCTA CCATCGGCCT TCTACCCCCG CGCCATGCTG
CTGGGCGGGG GGCCGTCCCG GTCGGGCCGC GAACCGGCCA CGGTGGCGGG CGCGGGGGCG
TCCGGGGACG GGACGCCGGC CGAGGTGCTC GCCTGGCTGC GCCGGCCAGG CGCCCGCCGC
CTAGTGCTCC ACCTGGCGTG CCACGGCTTC GCCGTCCCCG CGGACCCGGC GGCGTCGCAT
CTGGAGCTCG CCGACGGACG GCCGCTGACC GCCCGCGAGC TGCTGGCCGA GCAGAGCGCC
GCCACCCGCG TCGAGCGGGT GTTCCTCGCT GCCTGCTCGA CCAACGTCAC CGGTGCGGAC
TACGACGAGG CGTTCAGCAT CGCGACCGCG TTCCTCGCCG CCGGCGCCCG CACCGTGTAC
GGCTCTCTGT GGGATCTGCC GGACAGGTAC ACGGCCACCA TGATGTTCGT GTTGCACCAC
AACCTGGTCG TGGAGGACTG TTCACCGGCC GAGGCCATGC AGGCCGCCCA GGACTGGGCG
CTGGACCCCT ACCGGACGGC GCCGGCCACA ATGCCCGGCC CGGTCGCCGC GGCCATGGTG
GACGACGCGT GGCTCCCCTT CGTTGACCCG GCCGCCTGGG CCGGGCTCGT CCACCTTGGC
GCCTGA
 
Protein sequence
MDGLNLEIDA LVGPALDLLA TAERDGDPVA ARAAVGLLAQ AYDLAPGHSG APGLAMLIAE 
WEGELAQADL GSWDSVVLWW RRGWPAPRPG GRPSLSIAEL DAMEGALHGE RYVNGRDPAE
RDAAMRLLAP LADAGKLGPQ DLLGYAALLI VWYEQDAVPA ARDTALDLLA PMVTAGTADP
GIVADYAELV FDRAEQSGTT RDWAQAVHWL RESLRIGGAQ NVADTWDSLG HAYWAWSRLD
DDDALFDEAV DCFTAAVRHG QPAELLAPAA LVFVADRLRE QRRSAGGVDD ALLADIRAVL
ALADQIVDRP ELDTEIRATV AMEILTLQYL IVDGNEITRR LVQEGLEAFP LAGDRFDRLL
GYADAHPDPP QGWSSSRAAM RAMTSLLRAM FGTGPRPGAD FGLQHAAEAM RTAEHSDDAA
ALFSLISAVA GGFSGDLGRV KAAEQLVGTS ADPSATAISL FARLGQMLLV GMPADGARTA
VELLSVLARS LAQVLGGGLR GRLQPPAADT GKPGHLIEVS QYLDVSRRLD VGQLREIADV
FAAFPGCDAL VAQAAAAEVI AGIIAATSAG APADRLAELR ARLADADQAW ADSSAQLPPP
PRAMGTQLFA HGWQLFAGVT GDRSAARLAV ARSQAVLDWF DGPDHPMWPM VVMTAARARR
LRGEPGDLPG GRRLALRALR GHAWLVLLQS GTRDALEVAR GAAADARVVL EWCVEDLRGG
DEQARDDLVT AVEAGRGLVL HAAVTTRSAA EQLRELGHAD LAQAWAAAGL EPAEADPAAQ
GPGWFADGAE RSALRRAVLA ALSESAAGPD SLLLPPSIGE IRAAVGTHGS DALVYLVPGV
DRPGHIIIIP ADGTRPVEHV EHPGLVSADD SPMGRYLAAY AGWRQAAGGE GARQGVAEPA
AAFAASRAAL AAWRAALAEL TEWAGRIAEV LLDRVSAVAA DTTWPTFVIT SVGVLGLVPW
HAAVLDVEAP GHQTGGGTRL AQRATVSYIP SARLLCRTVA MPAAADGDVL IVGNPVGAAS
HPAGDAAWHL PSAFYPRAML LGGGPSRSGR EPATVAGAGA SGDGTPAEVL AWLRRPGARR
LVLHLACHGF AVPADPAASH LELADGRPLT ARELLAEQSA ATRVERVFLA ACSTNVTGAD
YDEAFSIATA FLAAGARTVY GSLWDLPDRY TATMMFVLHH NLVVEDCSPA EAMQAAQDWA
LDPYRTAPAT MPGPVAAAMV DDAWLPFVDP AAWAGLVHLG A