Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3800 |
Symbol | |
ID | 5672164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4506842 |
End bp | 4509484 |
Gene Length | 2643 bp |
Protein Length | 880 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641242679 |
Product | heat shock protein 70 |
Protein accession | YP_001508099 |
Protein GI | 158315591 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0443] Molecular chaperone |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.122258 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTACG CGCTCGGCAT CGACGTCGGC ACGACGTTCA CCGCCGGTGC CATCTGGCGG GATGGCCGGG CCGAGGCGTT CGGGCTCGGC ACGCACTCCA CCGCGGTCCC CAGCGTGCTG TTCCTGCGGG ACGACGGCGT GATGCTGGTC GGGGAGGCAG CCGAGCAGCG GGCGGTGACC GAGCCGTCCC GGGTCGCCCG CGAGTTCAAG CGGCGGTTCG GCGACGACGT GCCCGTCCTG CTCAGCGACA CCTGGGTGAC CGCCACCGAA CTGTTCGCCG ACATGATCCG CTTCGTGGTC GGGAAGGTGA CCGAACGCGA GTCGCAGGCC CCCGGCTACG TCATGCTGAC CTGCCCGGCC ACCTGGTCCG ACCACCGCAG GGGGCTGATG GAGGACGCCG CCGGCCTGGC CGGGCTCGGC CAGGTGGGCC TGGTGGCCGA GCCGACCGCC GCGGCGATGT ACTACGCCGC CCAGGAAAGG CTCGAGCCGG GCGCCCTGCT CGGCATCTAT GACCTCGGCG GCGGGACCTT CGACGCGACC GTGCTGCGCA AGACCGCCGG CGGATTCGAG TTGTGCGGCG ATCCCGGCGG CGACGACGAG ATCGGCGGGG TGGACGTCGA CCAGGCGGTC GTCGACCACA TCGCGCGGGC GGTCGGCCCG TCCTGGCACG AGCAGGACAC CTCCGACCCG GCGACCGCGC GAGCGCTCGC GGCCGTCCTC GCCGCCGCGG TCACCGCCAA GGAGACCCTG TCGCAGGACC TCCAGGCGGA GATCCCGGTC ATACTGCCCG GCTGCAACAA GGTCGTCCGG ATCACCCGTG ACGATCTCGA GGACGCGGTG CGCATCCTGG TCCTACGCAC CGTGGACGCC TTCCGCCGGA CCGTCCGCGC CGCCGGCGTG GAGGTGTCGG ACCTCGCACG CGTCCTGCTG GTCGGCGGTT CCAGCCGGAT CCCGCTGATC GCCCGCATGA TCGAGGACGA CCTGCGGGTG CCGGTTGCCG TCGACACGCA TCCGAAGCTC GCGGTCTGTC TCGGTGCGGC GATCGCCGCC GGCCCCCGGG TGGCCACCGG GGCGCTCGGG GCGGCGGCCC CAGGGACCGC CGCCGGTCCC GCACCGTGGA CCGCCCCGCC GGTGGGCACG CCACCGTCGC CCCGGCCGGC CGACGCGCCG GCGCCGCGGC CCGGCGCTCC GCAGCCCGCG CCGTGGCCGG ACGGCACGCC CGCGGGGGTT CCGACCGGTG GCCCGGGGGG CGAGCCGACC GACGTTCCGC GGCGCGTGCC GGAACCGGTC GAGGCGACCG CCGCCCGCCG CGCCGCCGAC CTGGTGGCGC CGGCACCGGC CGGTCCGCCC GGCGGCGCCC GCTCGGAGGA ACAGGTCCGG CTCGACGTCG ACCTGGCCGG CGCCGGCCTG GCCGAACCGT CCGACCAGCC GCTACGCCCC GCGGTCATGC CGACGCGCGC GGTCCGGCTG GCCGACCGCG ACGTCCCCCT CGTCGTCCGG ACCGCCGGCG ACGCGTCCTA CCGGCAGGCC GGCCGCCGCA CCGCGGCCGT GCTGGGGGCG GTCGCGGTTG TCGCGGTGCT CGCCGCCGCG GCGATCGGCG TCCTGCTCGG CCTCGGCGGC GGCTCGGCCG GACCGGAGCC CCCGCCCCGC ACGACGGCGC CAGGCGCGGC CAGCACCGCG GCGGCGGCGA CCGCCCGAGT GGCCCGGCTG GCCGGCGCTC CGCTACCGGC CGGCGGGAGT GGCGGTGGCT CGGTGGGCGC CGCCCTCGCC GTCGCCGCCC GACCCGGTGG CGGGCTGGTG GCCGTCGGCG CCGCCGGATC GCCAGACCCC GCTGGACGCA CGCCATCCGC ATGGTGGACC GGCGACGGTA CGACCTGGCG GCTGGCGGCG GTGCCACTGC CGGCCGGCAC GACCGTGGGC ACGATGAGCG GCCTGGCCTC CACCGGGGGA CGCCTGGTGG CGGTCGGCTG GGTTGGTTCC GGGGATACCA CCAGCGCCGC GGTCTGGGTC TCGGACGACG GGCAGGCCTG GCGGGCCGGG TCCGTGGGCG GGGCCGCGTC GTCGAGCATG CGTGACGTCG TCGCCCGTGC CGGCGGGCTG CTTGCCGTGG GTCAGGACGA CGGTTCGGAC CCTGAGGGCG ACGGCGCGGT GTGGACGTCG GCGGACGGCA GCGACTGGCA GCGGGTCGGC ATCTCCGGGG CAGACGGGCT CGGCACGCAG ACCCTGCATC GGGTCGTTTC CCTGGCCGGT GGCGGCCTGC TCGCCACGGG GCAGGAACCG GAGGGCGCCG GCACCGTCGC ACGCGTCCGG CAATCGGCGG ACGGGTCGAG TTGGACCGGG GTGGAGACCG ACCTGCCGCT CGACGCCGAG GTGACCGGGC TTGCCATACT GCCGGACGGC CGGCTGGTCG GGGCCGGGTC GGTCCCGCAC GCCGGCGGGC GGCAGCAACA AATCTGGGTG GCGGATGCGA CCGGCCGCTC GTGGGCACCC CAGGACGCGC TGACCGCAAC GGGCCAGTCG GGGACCGGGA TCGACATCAC CGGGGTGGCC GTGGCGGGCA CGCTGGTCGC TGCCGGCAGC ATCGACGGCA CGGACGGACC CGCCGCGGCC TCCTGGTCCG TCACCCTCGA CCAGCCGCGC TGA
|
Protein sequence | MAYALGIDVG TTFTAGAIWR DGRAEAFGLG THSTAVPSVL FLRDDGVMLV GEAAEQRAVT EPSRVAREFK RRFGDDVPVL LSDTWVTATE LFADMIRFVV GKVTERESQA PGYVMLTCPA TWSDHRRGLM EDAAGLAGLG QVGLVAEPTA AAMYYAAQER LEPGALLGIY DLGGGTFDAT VLRKTAGGFE LCGDPGGDDE IGGVDVDQAV VDHIARAVGP SWHEQDTSDP ATARALAAVL AAAVTAKETL SQDLQAEIPV ILPGCNKVVR ITRDDLEDAV RILVLRTVDA FRRTVRAAGV EVSDLARVLL VGGSSRIPLI ARMIEDDLRV PVAVDTHPKL AVCLGAAIAA GPRVATGALG AAAPGTAAGP APWTAPPVGT PPSPRPADAP APRPGAPQPA PWPDGTPAGV PTGGPGGEPT DVPRRVPEPV EATAARRAAD LVAPAPAGPP GGARSEEQVR LDVDLAGAGL AEPSDQPLRP AVMPTRAVRL ADRDVPLVVR TAGDASYRQA GRRTAAVLGA VAVVAVLAAA AIGVLLGLGG GSAGPEPPPR TTAPGAASTA AAATARVARL AGAPLPAGGS GGGSVGAALA VAARPGGGLV AVGAAGSPDP AGRTPSAWWT GDGTTWRLAA VPLPAGTTVG TMSGLASTGG RLVAVGWVGS GDTTSAAVWV SDDGQAWRAG SVGGAASSSM RDVVARAGGL LAVGQDDGSD PEGDGAVWTS ADGSDWQRVG ISGADGLGTQ TLHRVVSLAG GGLLATGQEP EGAGTVARVR QSADGSSWTG VETDLPLDAE VTGLAILPDG RLVGAGSVPH AGGRQQQIWV ADATGRSWAP QDALTATGQS GTGIDITGVA VAGTLVAAGS IDGTDGPAAA SWSVTLDQPR
|
| |