Gene Francci3_4389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4389 
Symbol 
ID3907363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5246519 
End bp5247976 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content73% 
IMG OID637881720 
Producthypothetical protein 
Protein accessionYP_483464 
Protein GI86743064 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0131212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGCGG ACCCGTCGAG CCGCCTCCCG GTGCCGCGGC GCGCCGGGGG GCCGGCTTTG 
TCCGCACCCG GACGGCCGGC CGCGCTCGCT CGTCCCGGAG CCGGCACCCC CGACCGGTCA
GCCCGGCGCC TCTGGGCATG GTGGCTGGTC AGCCGTGCCG TCCTCGTCAC CCTGGTGGTG
ACCGACCAGG CGTTCGGCAT GCAGCAGAGC GTGCTCGGCG ATCTACACAT CTATGCGGAG
TGGGGCCGCG GGCTCAGTCG CGGCGAGGGG ATGCCCACCG GTGACGACCG TTGGCAGTAT
CCGCCGGGCG CGGCAGTTGT GTTCGTGTTG CCCGACGTAA CTCACGCCTA TCTGGGATTG
CCCTATCGAG CGGTGTTCGT CGGGATGATG GTGGCCGTCG ACGCCGTCCT GACCGCCGCG
CTGGTGCGTC GCTCGATCCC GGCGGCGGGG TTCTGGGTGG CGGGTCTCAC CGCGTTGGGA
CCGGTGGCGC TGGCCCGTTT CGATCTGCTG CCGGCCGCGG CGGCGGTCGC GGCCGTGCTC
GCGCTCGCCG CGGGCCGGTT CGGCCGGGCG GGATTTCATC TCGCCGCGGG TGCGCTGCTC
AAGATCTGGC CGGTCCTGCT GCTGATCGCC CTGTCCCGTC CGCCGCTCCG GTCCCCGGGT
CGGTCGGCGT CCATCGATCC GCCCCCCGAT CCGCCCCCCG GCCGGCCCAC CGGTCGATTC
CCGGCGCCCT CGCCGGACTG GTTGCGTCTT GCCGCCGGCG TCCTGTGCGC CGTCGGGGGG
CTCGCGGCGC TGTTCGCGCT CACCGGGTGG TGGCGGGACG CGTTCGGGTT TCTCGGCGCG
CAGCAGGCCC GCGGACTACA GGTGGAGGCG GTGGCCGCGA CGCCGTTCGT CGTCGCCCAC
ATGCTCGGCC ATGGAACGGC ACCCGGCTAT TCCTACGGTT CGCTCCAGTT CGACACCCCG
CTGGCCCGCG GCGTGGCCAC CGCATGCTCG TTGACCGAGA TCGTAGTGAT CGCCATTTCG
GCCCTGTGGT GGTGGGGGTG GGCGCGTCCG CCCGCGGCGG CGGACGGCGC GGGCCGGGCC
GGTGTGCTCG TGGGGCGCAC GCTCGCGCTG CTGCTTGTCG TGGTCGTGAC CGCCCGCGTG
CTCAGCCCGC AGTACCTCGT GTGGATCATG GCGCTCGTCG CGGTCTGGAT CGCGATGGCC
CGGCCCGCCG GCTCCGGCGG CCTGACCGTC CCGGCTGGTG GACCGGCTGC CCCCGTCCCG
GCCGGTGGAC CGTCGTGGCG GTGGGTGGCG CTGCTGCTGA TCTGCGTTGT GCTCTCCCAG
GTCGTCTATC CGTGGCGGTA CAACGACGTC GTCCAGGGAC GGATCATGAT GAGCCTGGTC
CTGGTGGTCC GCAACGTCGG ACTGGTCACC GTCTGCGGGT GGGCGTGGCG TGCCGCCGCC
GGATCGCGAC CAGCGTGA
 
Protein sequence
MRADPSSRLP VPRRAGGPAL SAPGRPAALA RPGAGTPDRS ARRLWAWWLV SRAVLVTLVV 
TDQAFGMQQS VLGDLHIYAE WGRGLSRGEG MPTGDDRWQY PPGAAVVFVL PDVTHAYLGL
PYRAVFVGMM VAVDAVLTAA LVRRSIPAAG FWVAGLTALG PVALARFDLL PAAAAVAAVL
ALAAGRFGRA GFHLAAGALL KIWPVLLLIA LSRPPLRSPG RSASIDPPPD PPPGRPTGRF
PAPSPDWLRL AAGVLCAVGG LAALFALTGW WRDAFGFLGA QQARGLQVEA VAATPFVVAH
MLGHGTAPGY SYGSLQFDTP LARGVATACS LTEIVVIAIS ALWWWGWARP PAAADGAGRA
GVLVGRTLAL LLVVVVTARV LSPQYLVWIM ALVAVWIAMA RPAGSGGLTV PAGGPAAPVP
AGGPSWRWVA LLLICVVLSQ VVYPWRYNDV VQGRIMMSLV LVVRNVGLVT VCGWAWRAAA
GSRPA