Gene Francci3_3673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3673 
Symbol 
ID3905357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4404196 
End bp4405866 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content69% 
IMG OID637880999 
Productheat shock protein 70 
Protein accessionYP_482754 
Protein GI86742354 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.192305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.510271 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGGCA CCAAGGTGTT CGGGATCGAC CTGGGCACGA CCTACTCCTG CATTGCCCAG 
GTCGACGAGT ACGGCCGACC GGAAGTGATC CGCAACATCG AGTCCCAGCC GACGACGCCT
TCGGTCGTCC TGTTCGACAC TGGGGCGGAG GGCCCAACCT CTTTCGTGGT GGGAACCCAG
GCCAAGCGCC AGGCTCGCAT CCGTCCCGAC GATGTCGCCC GGCTGGTCAA GCGGCACATG
GGCGCGTCGG ACTGGCGGTT CGTGGCGCAT GACGAGGAGT ACAGCGCCGC CGCGGTGTCG
AGCCTGGTGC TCAAGGCGCT CGCTGCGGAC GCCGAACGCG CGACCAGTGT CCCGGTCACC
GATGTAGTGA TCACCGTGCC CGCGTACTTC GGTGACGAGG AGCGCAAGGC GACGAAGCTG
GCCGGCGAGC TCGCCGGCCT CAACGTGGTC GACATCATCA ACGAGCCGAC CGCCGCCGCC
TTCGCCTACG GATTCGGCCA GGACGGGGCC GAGGAGTCGA CCGTGCTGGT CTACGACCTC
GGCGGTGGCA CATTCGACAC CACGGTCATC AGGCTGAGCG AGGGCGCGAT CACCGTGGTC
GCCACGGATG GCGACCACGA GCTGGGCGGT GCGGACTGGG ACAACGAACT CGTCCGCTAC
CTGGCGCAGA AGTTCACCGA GGCGCAGCCC GACGCGGGCG ACCCGCTCGA CGACGTCTAC
GACGAGCAGG AGCTGCTGGC CGCGGCCGAG GACGCGAAGC TGGCGTTGTC CGGCCGGGAC
AGCGTCGACG TGCTGGTCGT GCACAACGGC AGGCGCACAA GCGTGCCGGT GACCCGGACC
GTCTTCGAGG AGATCACCGG CCCGCTGCTG CGGCGCACCA TCGACCTGAC CGGCTCGGTG
CTGGCGCGAG CCCGGGAGAA GGGTGTCGAG AAGATCGACT TGTGCCTGCT GGTGGGGGGC
ATGAGCAAGA CACCGGCGGT GGCCCGCCGG CTGCAGGAGT CCTTCGGGCT CACCTCTCGG
CTTGCCGATC CCGATCTCGC CGTCGCCAAG GGCGCCGCGG TGTACGGGCA GAAGAAGGCA
CTGGAGCGCG AGGTCCACGC GGACCTGGTC GCCAGCGGGC ATCTACGTCC CGACCAGGAA
CTCGCCGCAG CCGACGCCGT GGACGTGGAG AAGGCGGCCG CAGCCAGCGC CGAGGAGGCG
GGACTCTCGA CCGCCTCCGT GGTCGATCTC GTCCGCACGA AAGTGACGAA CGTCACGTCA
CGTGGCTTCG GGATCTTCGC CGAGGACCGG GGCACACCGG TCGCGGCCTT CCTCGCCCAC
CAGAACGACC CGCTGCCTAT CGCCGTCACC CGGACCTTCT ACACCGTCGT CGACGATCAG
GCCGAGGTGG ACATCCGGGT CTTCGAGCAG GGCACCACCG CCGAGTCCAC GGCGATCGAC
GACAACAAGG TGATCGTCGC CGGCTCCATC AGCGGGATTC CGCCTGGCCA CCCGTTGGGC
ACCCCCGTTG AGGTGACCTT CACGATGGGC GGGGACCAGA CGATCCAGGT CACCGCCTCG
CACGAGGGCG CGGCCACCCC CTTGGTGCTC GAGGTGCGCG CCGGGGTCGG CTCGGAGGAG
ATGCGGGCCG TCGAGTCGGC GAAGGTCAGC CTGCTCAAGC AGCGGGACTG A
 
Protein sequence
MAGTKVFGID LGTTYSCIAQ VDEYGRPEVI RNIESQPTTP SVVLFDTGAE GPTSFVVGTQ 
AKRQARIRPD DVARLVKRHM GASDWRFVAH DEEYSAAAVS SLVLKALAAD AERATSVPVT
DVVITVPAYF GDEERKATKL AGELAGLNVV DIINEPTAAA FAYGFGQDGA EESTVLVYDL
GGGTFDTTVI RLSEGAITVV ATDGDHELGG ADWDNELVRY LAQKFTEAQP DAGDPLDDVY
DEQELLAAAE DAKLALSGRD SVDVLVVHNG RRTSVPVTRT VFEEITGPLL RRTIDLTGSV
LARAREKGVE KIDLCLLVGG MSKTPAVARR LQESFGLTSR LADPDLAVAK GAAVYGQKKA
LEREVHADLV ASGHLRPDQE LAAADAVDVE KAAAASAEEA GLSTASVVDL VRTKVTNVTS
RGFGIFAEDR GTPVAAFLAH QNDPLPIAVT RTFYTVVDDQ AEVDIRVFEQ GTTAESTAID
DNKVIVAGSI SGIPPGHPLG TPVEVTFTMG GDQTIQVTAS HEGAATPLVL EVRAGVGSEE
MRAVESAKVS LLKQRD