Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3003 |
Symbol | |
ID | 3905500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3558990 |
End bp | 3561797 |
Gene Length | 2808 bp |
Protein Length | 935 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637880323 |
Product | heat shock protein 70 |
Protein accession | YP_482089 |
Protein GI | 86741689 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [P] Inorganic ion transport and metabolism |
COG ID | [COG0226] ABC-type phosphate transport system, periplasmic component [COG0443] Molecular chaperone |
TIGRFAM ID | [TIGR02136] phosphate binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.376537 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.234156 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTCCGG TCGCCGTGAT GATCGCGTGG GTTTCGGGAG ACGCGGTGAG CTATCAGCTG GGTATCGACC TCGGGTCCGC GAACACGATC GTCGGGGTGG CGGACGGAGG CTGGCCCCGC GTGCTCGAAC TGAGCGGTCA GCGTCGCCTG CCGTCGGTGA TCTACGCCGC CCCGGCCGGT GGCCTGCAGT TCGCGCGCTC GGCGGACCGC CGGTCGCTGG TCGACCCCGA GCGTGCCGCA ACCGACCTCC TGCGCCCGCT CGGGGACAGC GGTCCCGTTC TGCTCGGGGG TGCCGCCTAC AGCCGGGAGG GGCTGATTGC CCGGTTGGTG TCCTACCTCG TCACGGCCGC CACCGAGCAG CTCGGGTCGG AACCGGACCA GGTCGTCGTG ACGTTCCCGA CGTTCTGGTC GGCGGCCCGG CGCGAGGTGT TCGCGGACGC CGTGAGCCAG CTCTCCGATG TCACGGTGCC CGTCGCGGCG TTCCCGGCCG CCGACGCGAT CGGAACCCTG CTGTCTCACG GCTCCTCGAC CCAGACCGTC GAGTTGGTCG GATGGTACGA CTTCGGCGCC GGCTTCCTCG ACACGGGGAT CCTGTCCTTC TCGCCGTTCG GGTTCCAGCT GCTCGGCTTG GCCGCGGGTC TGCGCCACGG CGCCGGCCTC GAGCTCGACG AGCTCCTGGT CGACCGGGTG CTGACCGGCG CGGGTGTCGA GCGTGCGGCG CTCGACCGTT CCGACCCGAT GATCCGGACC GCGCTCGGCA GGCTGCGGCG GGAATGCGCG CAGGCGAAGG AGATCCTTTC CGGTGAGGAC GAGGTTGACG TTCCGGTCTC CCTTCCCGGA GCCGATACCT TCGTCACGCT GAGCCGGGCG GAGCTGGAGA CCCTCATCGG CCCGATTGTC GATGACACAA TCATGACCTT TCGCCGGGCG CTCCGCTCGG TGCCGGTGAC CTCCGCCGAG CTGTCCCGGA TCCTGTTGTC CGGCGGGATC TCCCACCTTC CGCTGGTGGC CCGCCGGCTG CGGGAGACCT TCAGCGGGAT CGGCCGGATC GACCACGGTT CGGACGCGGA TGTGGCGATG GGCGCGGCAC TGCTCGCCGC CGACCTCGCT GACCGGTTCG GCGCCGTCGG CGCCGGGAGT GCACCGGGTG CACCGGGTGC ACCGGGTGTT CCGGGTGTTC CGGGTGTTCC GGGTGTTCCG GGTGCGCCCG CGGCCTTTGG AGCCGTCGGG GCGGTGACCG CCGGCGGTTT CGGCGCGTCG GGCGAGGTCG GTGACGCCCA GGAACCCTCT GCTCAGGAAC CCTCCGACGT CACCGCGGTG ATCCGGCCGC CGGACGCCGC GGACCTGGTC TCGCGGCCGC CCTTCCTGTC CGGGCCGAGG GGGCTGTCCG GGCCGAGGGG GTTCGCGGGC GCGGCGGGTC CGGCCGGGAT CCGAGGCGTG CCGGAGGCGG ACGGCGTGAC GGCCGCGGCG ACGGTGCTCA GCGGCCCGGA CGGCTCCGCG CCGCACGGCT TGGGCGCCGG GGACGACACG ATGATCTCCT CGGGTGGCGC GGCGGATTCC CCGCCGCCGG GGATCGTCGT CGGCGGGCCG GGCCCGTCCG ACACGTCCGG TCACCGGATC CCACCCGGCC TCATGACGGT GGGCGCCGAT CGGGACCACG TTCTGCTCGG GCAGCCGGGG AAGCTCGACC TGCCCTCGGA CTTCTCGACG GGCGGTGTGG GCGGCCCGCA CGGTGCCTCC GCCGGGGACG GCCGGGGAAT CTTCGGCTCC CGGCGGGCCG CGGTCGTCGC CGCCATTGTC GTGGTCCTCT TCCTGGCTCT CGGGACGACC TTCGCCGTGG TGCTGACCGG CCGTGACTCC GGGGCGGGCT CCGGGGCGGA TGCGGTGGCC GCCCCGGCGG TCACCGCGTC GCCGGCCCCA TCCGCCACCG GCCCGACCGG GCCACCCGCC TCCGCCGCGA ACCTCGTCCG GGTGGCGGGA TCCTCGGAGG TCGCCCCGAT TACCGAGACC GCCTATAACG AATTTCGTCA GGTCCAGCGC AACGTCACCG TCAGCATCGA GTCCACGACG ACCGAGGACG GTTTCGCCGC GCTGTGCAGC GGCAAGGCCG ACATCGCCGA CGCCTCGTTC GAACTCAATC CCGGGTTCAT CAAGAACCCC GACTGCGAGA AGAAGGTCGT CGGGTTCGAG GTGGCGCACC ACACCCTGCC GATCGTGGTC AACCCGCGGA ACACCTGGCT GCACTGTCTG ACCCTGCAGC AGGTCAAGCA GGTGTGGGGG GCCGGTTCCG CCGTCACCCG GTGGAGCCAG ATCGACCCGT CGTTCCCGGA CGAGCCGATC ACGTTTGTCG GGCCGCCGCG CGGCTCCGTG CAGGCGCAGG TGTTCAACGC CACGATCAGT GACGCCAGTG ACCGGTCCCG CGACTACCGG CAGACCGATC TCAGCGGGGT CGCCAACGAC GTGGCCGCCG ACCGCTCGGC CATCGGCTAC CTCGACTTCC CCACCTACGA GACCTTCGGC ACCAAGCTGC GCGGTGTCGA AATCAACAAT GGTGACGGGT GCGTCGCGCC GAACGCCGTG TCGGTCGGTA CCGGCCTCTA CCTGCCGCTG TGCAAGCCGC TGTTCGTCTA CGCCCGCACG GACGCCCTGC GCCGGCCAGC GACCGCCGCC TTCCTGCGCT ACTACCTGGC GAACGGCCGG AAGATCGCCT TCGACGCGCA CTACGTCCCC CGCAACGACG ACACGGTCGG GGAGAACGTC GCCAAGCTCG CGAGCCTGAC GGCCGGCGTG GGACCCGTAC CGGCCTAG
|
Protein sequence | MAPVAVMIAW VSGDAVSYQL GIDLGSANTI VGVADGGWPR VLELSGQRRL PSVIYAAPAG GLQFARSADR RSLVDPERAA TDLLRPLGDS GPVLLGGAAY SREGLIARLV SYLVTAATEQ LGSEPDQVVV TFPTFWSAAR REVFADAVSQ LSDVTVPVAA FPAADAIGTL LSHGSSTQTV ELVGWYDFGA GFLDTGILSF SPFGFQLLGL AAGLRHGAGL ELDELLVDRV LTGAGVERAA LDRSDPMIRT ALGRLRRECA QAKEILSGED EVDVPVSLPG ADTFVTLSRA ELETLIGPIV DDTIMTFRRA LRSVPVTSAE LSRILLSGGI SHLPLVARRL RETFSGIGRI DHGSDADVAM GAALLAADLA DRFGAVGAGS APGAPGAPGV PGVPGVPGVP GAPAAFGAVG AVTAGGFGAS GEVGDAQEPS AQEPSDVTAV IRPPDAADLV SRPPFLSGPR GLSGPRGFAG AAGPAGIRGV PEADGVTAAA TVLSGPDGSA PHGLGAGDDT MISSGGAADS PPPGIVVGGP GPSDTSGHRI PPGLMTVGAD RDHVLLGQPG KLDLPSDFST GGVGGPHGAS AGDGRGIFGS RRAAVVAAIV VVLFLALGTT FAVVLTGRDS GAGSGADAVA APAVTASPAP SATGPTGPPA SAANLVRVAG SSEVAPITET AYNEFRQVQR NVTVSIESTT TEDGFAALCS GKADIADASF ELNPGFIKNP DCEKKVVGFE VAHHTLPIVV NPRNTWLHCL TLQQVKQVWG AGSAVTRWSQ IDPSFPDEPI TFVGPPRGSV QAQVFNATIS DASDRSRDYR QTDLSGVAND VAADRSAIGY LDFPTYETFG TKLRGVEINN GDGCVAPNAV SVGTGLYLPL CKPLFVYART DALRRPATAA FLRYYLANGR KIAFDAHYVP RNDDTVGENV AKLASLTAGV GPVPA
|
| |