Gene Francci3_4401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4401 
Symbol 
ID3907376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5261528 
End bp5264041 
Gene Length2514 bp 
Protein Length837 aa 
Translation table11 
GC content75% 
IMG OID637881732 
Producthypothetical protein 
Protein accessionYP_483476 
Protein GI86743076 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.433686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCTTG TGGACTGGTT GCGTGCGCTT CCGGATCAGG CGTTGACCCG GCTCTTCCTC 
CTACGGCCGG ACCTGGCGAC GCCGCCCCCA CCGGACTTCG AGGTGCTTGC GGCCCGCCTG
GAGATCCGGG TCAGCGTGGC CCGCGCGTTG GAACGTCTCG ACACGGGCGC CTGCGAGCTC
CTGGAGGCGT TGACGATCCT TCCCCGGCCG GTGTCCCTGG CGGAGCTGAC AAGGTTCTGC
GGGCATTCGG AGGTGTCCGA GCCGGTGGGC CGGCTTCGTG ACCTCGGGCT CATCTGGGGC
GACGATGCCG AGCTGCGGGT CACCGGGATG GTCGTCGACC TCCTCGGCGT GCGCCCGCTG
GGGCTCGGCC GTCCCGTGCG AGCCTGCCTG ACGACGTACC GGCAGTCCCA GCTCGCCCGG
CTCCTGACCG CGGCCGGCCT GCCGGTGCCC CGGGGCGCGG TCGACGAGAC GACGACCGGG
GTCGCCGTCC GGGAAGCGAT GATCGACGCG TTGAGCGACG CCTTCGCGGA CGCGGATCGG
GTGCGGGAGT GGATCGAGAC CTGCTCCTCC CGGGCGCGGC AGCTGCTGGA CCGGCTGGCC
GCGGGTCCGG CGCTCGGCGC CACCACGGAG GCGGGCCGGT TGCTGGGGGT GTCCGCGGCC
CGAACCCCGG TGGAGGAACT ACTCGCCCGC GGCCTGCTCG TCGGATTGGA CGAGGAGACC
GTCGAGCTGC CGCGCGAGGT TGGGCTGGTC GTGCGCGGCG AGCATCCGGC CGGGGTCCTG
CACCCGCATC CGCCCGCCGT CGAGGGAGAG GTCGTCGGGG TGGCGGCGGT TGACGCCGTG
GCGGCCCTGG CGGCGGACAC CCTCGTCCGG TGGGTCACCA CCCTGCTGGA CGCTTGGGGC
GCGAGCCCGG TCACTCCGCT GCGCACCGGC GGGTTGAGCG TGCGGGACCT GCGGGCCGCC
GCGAAACTGC TCGACGTTGA CGAGCGGACC GCGGCCTTCG TGGTGGAGCT GGCCGCGGCG
GCCGGCCTGG TTGACGCGAC CGCGGGGGTT GACGTCCAGT ACGTGCCGAC CACGGCCTAC
GACCGCTGGG GCACCGACCT GGTGGCGGCC CGGTGGGCCG TACTGGTCGA GGGCTGGCTG
CGCTCGCCGA CCGCCGCCTG GCTGGTCGGC GAGCGGGACG AACGGGGCCG GCAGATCGCT
CCCCTGTCGC TGGACGTGCG GCGGCCGGCC GTGCAGGACC TGCGTGCCCA GGTCCTGCGG
GCGCTGGCCG CGGCGCCCGC AGGGGTGGTG CCGACCGCGG AGTCGTTGCG TGCCCTGCTG
ACCTGGCGAG CGCCCCGGCG CGGCGGAATG CTGCTCGCCG GGATGATCGA CGGAACGATC
GTCGAGGCCG AGCTGCTCGG GCTGACCGGC CGGGGCGCGC CGAGCACGGT CGGTCGGTTA
CTCGCCGCCC AGCTCGACGC GGAGAACGCC GATCCGACGC GGAACGTGGG CGTGCCGAGC
CGATCGGCGG CTGGTGATCC AGGGCTGTGC GTCAGGCTCG CCGACGCGCT TGCGCCGCTG
CTGCCCGAGC CGGTCGAGGA GCTGCTGCTG CAGGGGGATC TCACCGCGGT TGCCCCGGGC
CCGCTGGTAC CCCGGGTGGC TGCGGAGCTG GCGCGGATGG CGGATGTCGA GTCCGCGGGC
GCGGCCACCG TCTATCGGTT CTCCGAGGCC TCGCTGCGTC GCGGCCTGGA CTCGGGCCTG
GTAGCGGAGG ACATCCACGC GATGCTCGTC CGTCTCGGTC GAGGCGGGGT GCCGCAGGCC
CTCACCTATC TGATCGACGA CACCGCCCGG CGGCACGGCC GGTTGCGGGC CGGTCCCGCC
GCGAGCTACC TGCGCTGCGA GGACACGGCG CTGCTCGCCG AGGTGGTCGC CAACCGGCGT
ACCCAGGCGT TGGGGCTGCG CCGGATCGCG CCGACGATCG TCATCTCCCC ACTGCCGACC
GCGGAGATGC TCGCCGGCCT GCGCAACGCC GGGTTCGCCC CGGTCGCGGA GGCCCCGGAC
GGGCGGGTGG TGCTGCGCCG GCCCGCCGCC CATCGGACGC CGGCGCGTCC CCGCGCGGCG
TCGGTCGACG TGGCCGACAC CCGGATCAAC CAGCTGCGCG ACGTGGTCCG GCTGGTCCGC
CGGGGCGACG ACAGCGCGCG GGCGGTCCGG GCGGCCGGCG AGGTTCCCGG CATCGAGGTC
GGGCTCACCC GGTCCGCGCC GTTGATCCTG GTGCTGCTGC AGGGAGCCGT CCGGGACCGG
CGCCGGGTGC TGCTGGGCTA CGTTGACCAG CAGGGTGCGC CGAGCGACCG GATCGTGCGG
CCCACCGTCG TCGAGGGCGG CTGGCTCACC GCGTGGGACG AACTCAGCGC CGGCCCGCGT
CGTTTCGTCC TGCACCGGGT GACCGGCGTC GCTGACATCG ACGACGCGTT CGGCGGACCG
CCGGTGGCGG CGGACTGGTC GGCCGCCGCG GAGGATCTCG CCGATCCGCC ATGA
 
Protein sequence
MTLVDWLRAL PDQALTRLFL LRPDLATPPP PDFEVLAARL EIRVSVARAL ERLDTGACEL 
LEALTILPRP VSLAELTRFC GHSEVSEPVG RLRDLGLIWG DDAELRVTGM VVDLLGVRPL
GLGRPVRACL TTYRQSQLAR LLTAAGLPVP RGAVDETTTG VAVREAMIDA LSDAFADADR
VREWIETCSS RARQLLDRLA AGPALGATTE AGRLLGVSAA RTPVEELLAR GLLVGLDEET
VELPREVGLV VRGEHPAGVL HPHPPAVEGE VVGVAAVDAV AALAADTLVR WVTTLLDAWG
ASPVTPLRTG GLSVRDLRAA AKLLDVDERT AAFVVELAAA AGLVDATAGV DVQYVPTTAY
DRWGTDLVAA RWAVLVEGWL RSPTAAWLVG ERDERGRQIA PLSLDVRRPA VQDLRAQVLR
ALAAAPAGVV PTAESLRALL TWRAPRRGGM LLAGMIDGTI VEAELLGLTG RGAPSTVGRL
LAAQLDAENA DPTRNVGVPS RSAAGDPGLC VRLADALAPL LPEPVEELLL QGDLTAVAPG
PLVPRVAAEL ARMADVESAG AATVYRFSEA SLRRGLDSGL VAEDIHAMLV RLGRGGVPQA
LTYLIDDTAR RHGRLRAGPA ASYLRCEDTA LLAEVVANRR TQALGLRRIA PTIVISPLPT
AEMLAGLRNA GFAPVAEAPD GRVVLRRPAA HRTPARPRAA SVDVADTRIN QLRDVVRLVR
RGDDSARAVR AAGEVPGIEV GLTRSAPLIL VLLQGAVRDR RRVLLGYVDQ QGAPSDRIVR
PTVVEGGWLT AWDELSAGPR RFVLHRVTGV ADIDDAFGGP PVAADWSAAA EDLADPP