Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1946 |
Symbol | |
ID | 5670347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2338856 |
End bp | 2341729 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641240867 |
Product | heat shock protein 70 |
Protein accession | YP_001506289 |
Protein GI | 158313781 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [P] Inorganic ion transport and metabolism |
COG ID | [COG0226] ABC-type phosphate transport system, periplasmic component [COG0443] Molecular chaperone |
TIGRFAM ID | [TIGR02136] phosphate binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.253458 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.635918 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGTTACC AGCTTGGCAT CGACGTCGGA TCGGCCACCA CAGTCGTCGC CGCCACGGAC GGGGGCTGGC CCGCCGTGCT GACCCTGGGC GGCGCCCGAG CCGTGCCGTC CGTCCTGTAC ATGCCCCAGA CGGGCGGTGT GCTGTTCGGG CGCTCGGCTG AGAGGCGGGC CCGCACCGAT CCCGACCGGG CCGCCCGGGG GTTCCTGCGC CGGCTCGGCG AGCCCGGCCA CCTGCTGGTC GGCGGCGCCG CGTACAGCCC GGACGGGCTG CTCGCCCGGC TGGTCGGCCA CCTGGTCGGG CAGGTCGTCG CGGCCCGGGG GGAGGAGCCC GAGCAGATCG TCGTCGCGCA TCCCGCCTTC TGGCCCGCAC ACCGCCGCGA GGTGTTCGCG TCCGCGGTCA GCCAGCTCTC GGACGTCTCT GCACCCGTCG CGACCTGCGC GGCCGCCGAC GCGATCGGCA CCCTGCTCGC CCGCCGCTCG GGGACCCGCA CGGTCGACCT CGTCGGGGTC TACGACTTCG GCGCCGGGCA CTTCGACGCG GCGGTGCTCT CGTTCAGCCC GTTCGGGTTC CAGCAGCTGG GCACATCGGT CGGAGTGAAC CACGCCGGCG GGGCCGACTT CGACGAGCTG CTGGTCGAGC GGGTGCTGGC CGAGGCCGGC GCCGGCCGGG AGCGGCTCGA CCGCTCCGAT CCCGCGGTCA CCGCGGCACT CGCCCGGCTG CGTGAGGAGT GCGCCCAGGC GAAGGAGTAC CTCGCGGAGG AGGACGAGAT CGAGGTCGCC CTCGCGCTCC CCGGCCTGCC CGCGACGTCG GTGGTGCTGC GCCGCGCGGA TCTTGAGACC CTCGTCGCGC CGGTCGTCGA CGACACGGTC CGGGCGTTCC GGCGGACCCT CCGAACGGGC GAGGCGACCC CCGAGGATCT CTCCTCCGTG CTCCTCTACG GCGGCGCGGC GCGGATGCCG ATCGTCGCCG CCCAGATGCG GGCGGCGTTC CCGAGCGTCG GCCGGTGGGA GTACGGCTCG GACGACGACA TCGCGACAGG CGCGGCGCTG ATCGCCGCGC GGCTGGCCGC CCAGTCCTCC CGCGAGGAGG TCACCTCGGT CATCCGGCCG CCGGACAGCA GCCCGCCGAT CCTCTCCGCG CCGCCGCTGA GCTCTGTCCC GCCCCTGAGC CCGGTCCCGC CGGTCGGGTC AGTGCCGCCG GTCGGGTCAG TGCCGCCCGG CGCGGTGGCG TCGACCGGCG GGCCGGCCGC CGGTGACGGC GACACGTCCA GCGGGCCGGT TCCCCCCGGC GGGGCGGTGT TCGGGGGAGC GGCCGCCGGC GTCAGCGCCT CCAGCCTGGG CTACACCTCA TGGCCCGACC GGACGGGACA GCCGCCCGCC GCCGACCCCG ACGCCACGGC GATCGGCCAC CACGGCGGCT CGGCGGACGA CACCCTGATC TCCCGCGGCG GCCACGGAAC ACCGCCACCC ACCACGGCCC CGCCACCCAC CGGGCCGACG TCCACAGGCG CGACGTCCAC CGGGGAGCCG GCTGGCGCGG GGTACCAGGG CGGTTGGGGG AGCAGCCCGT ACCCGTACAG CGACGCCGCG CACACCGTGG CGGCCGGCTC GTCGGCCGGC GGCGACCGCA CCCAGGCGGT CGCCACGCCC GGCGGCCCCG GGACGACAGG AGCGCCCGCG ACCACCGGAG TCACCGGAGT CACCGGGATC GCTGGGACCG CCGGCGGCGG TGATCCTCAG TCGGCCGCGC CGACACCGTC CCAGGCCGGA ACGTTCGGCG GGAGCTCCGC CGGCACCCGG GCCGTCCCGC GCGGCGGGCT GTTCGGCGGC TGGTCCCGCG CGACGATCGC CGCGGCCGTG GCCGCGATCG TCTTCGTCGC CGCCGGCACC ACCCTGGGCA TCGTGCTGAC CGGAGGCGGC GGCGACACGC CGTCCAACGG CATCGTGCCG ATCGCCGCGC CCGCCGCCAC GCTGCCGCCG CCCGCGGCGA CGACCGCGCC GCCCGCGGCC CCGACACCCG GACCCAACAC GGTGCTCGTC GCGGGCTCCA GCGAGGTCGC GCCGATCACC GAGACCGCCT ACGCCGGGTT CCGCAAGGTT CAGCAGAACG TGACCGTGAA CGTCGAGGCG TCGACGACGG AGGACGGCTT CGCCAAGCTC TGCGCGGGCG GCGCCGACAT CGCCGGCGCG TCCTTCGAGT TCGACCCGTC CTTCTCGAAG GACCCCGGCT GCGCCGACCA GATCGTCGGG TTCGAGGTCG CGCACCACAC ACTGCCGATC GTGGTGAACC CGCAGAACAC CTGGGCGCGC TGCATGACCC TCGACCAGGT GCGCAAGGTC TGGGACGCCG GCTCGACGAT CAACCGGTGG AACCAGATCG ACCCGTCCTT CCCGGACGAG CCGATCACGT TCGTGGGGCC GTCCCGGAAC ACCGTCCAGG CGCAGGTGTT CAACTCGACG GTGAACGACT CCAGCTCCCG GTCCCGGCAG TACCAGGAGA CCGACCTGAG CGGGGTCGCG AACGACGTCG CCGGTGACCG GTTGGCCATG GGCTTCCTGG ACTTCCCGAC GTTCGAGACC TTCGGGCCCC GACTGAGGGG CCTGGAGATC GACAACGGTG AGGGATGTGT CGAACCGAAC GCGGTGACGG CCGGAACGGG CTTCTACCTC CCGCTGTGCA AGCCGGTGTT CGTCTACGCC CGTAAGGACT CGCTGCAGAA GCCCGCGGCC GCCGCGTTCA TGCGCTACTA CATGGAGAAC GGCGAGGAGA TCGCCTTCGA CGCGCACTAC GTCCCGCGGA CCAAGAGCAC GATCGACGAG AACGTGGCCC GCGTCGACGA GCTGACAAAG GGAGTACCAC CCGTCACGGC CTGA
|
Protein sequence | MGYQLGIDVG SATTVVAATD GGWPAVLTLG GARAVPSVLY MPQTGGVLFG RSAERRARTD PDRAARGFLR RLGEPGHLLV GGAAYSPDGL LARLVGHLVG QVVAARGEEP EQIVVAHPAF WPAHRREVFA SAVSQLSDVS APVATCAAAD AIGTLLARRS GTRTVDLVGV YDFGAGHFDA AVLSFSPFGF QQLGTSVGVN HAGGADFDEL LVERVLAEAG AGRERLDRSD PAVTAALARL REECAQAKEY LAEEDEIEVA LALPGLPATS VVLRRADLET LVAPVVDDTV RAFRRTLRTG EATPEDLSSV LLYGGAARMP IVAAQMRAAF PSVGRWEYGS DDDIATGAAL IAARLAAQSS REEVTSVIRP PDSSPPILSA PPLSSVPPLS PVPPVGSVPP VGSVPPGAVA STGGPAAGDG DTSSGPVPPG GAVFGGAAAG VSASSLGYTS WPDRTGQPPA ADPDATAIGH HGGSADDTLI SRGGHGTPPP TTAPPPTGPT STGATSTGEP AGAGYQGGWG SSPYPYSDAA HTVAAGSSAG GDRTQAVATP GGPGTTGAPA TTGVTGVTGI AGTAGGGDPQ SAAPTPSQAG TFGGSSAGTR AVPRGGLFGG WSRATIAAAV AAIVFVAAGT TLGIVLTGGG GDTPSNGIVP IAAPAATLPP PAATTAPPAA PTPGPNTVLV AGSSEVAPIT ETAYAGFRKV QQNVTVNVEA STTEDGFAKL CAGGADIAGA SFEFDPSFSK DPGCADQIVG FEVAHHTLPI VVNPQNTWAR CMTLDQVRKV WDAGSTINRW NQIDPSFPDE PITFVGPSRN TVQAQVFNST VNDSSSRSRQ YQETDLSGVA NDVAGDRLAM GFLDFPTFET FGPRLRGLEI DNGEGCVEPN AVTAGTGFYL PLCKPVFVYA RKDSLQKPAA AAFMRYYMEN GEEIAFDAHY VPRTKSTIDE NVARVDELTK GVPPVTA
|
| |